QLoRA：大规模语言模型微调的量化工具

版权申诉

5星 · 超过95%的资源 102 浏览量更新于2024-10-24 收藏 50.81MB ZIP 举报

资源摘要信息:"QLoRA是专门设计用于对大规模语言模型（LLM）进行量化微调的工具。在深度学习和自然语言处理领域，大型语言模型如GPT和BERT等已经证明了它们在理解和生成人类语言方面的强大能力。然而，这些模型往往拥有数十亿甚至数万亿的参数，导致它们计算量大、存储需求高，且在部署时对硬件资源的要求严苛。量化技术作为一种优化手段，通过减少模型参数的精度来降低计算量和存储需求，提高运行速度和效率。 QLoRA工具的出现，为研究者和工程师们提供了一种有效的方式来微调这些经过量化的大型语言模型。它通过特殊设计的量化和微调策略，使得模型在精度损失最小的情况下，依然能够在特定任务上展现出良好的适应性和表现。这种微调方法使得经过量化的语言模型能够在特定任务上保持甚至提升性能，同时大大减少了运行和部署的成本。量化微调的实践涉及对原始浮点数模型参数进行四舍五入或剪切以减少精度，将它们转换为较低比特宽度的表示形式（如INT8而不是FP32）。这不仅减少了模型的大小，还能显著提升计算速度，因为整数运算通常比浮点运算更快，且更易在各种硬件平台上进行优化。 QLoRA的工作原理涉及到一系列高级算法和策略，包括但不限于： 1. 知识蒸馏：通过将一个大型的、预先训练好的模型的输出作为软标签，来训练一个更小的模型，使得小模型在学习过程中能够保留大模型的知识。 2. 权重映射：将浮点数权重映射为定点数表示，同时调整模型架构和训练策略，以最小化由权重量化引起的性能下降。 3. 适应性微调：在量化模型的基础上进行微调，以便模型能够在特定任务上进行优化，通常涉及对模型的前几层或者最后几层进行微调，因为这些层包含了与特定任务最相关的特征。 QLoRA工具的开发，标志着量化技术在自然语言处理领域的进一步成熟。通过提供一个强大的平台来研究和应用这些量化策略，它有望进一步降低大规模语言模型的应用门槛，推动这些模型在更广泛的场景中的应用，比如移动设备、边缘计算等资源受限的环境。 QLoRA的出现，为研究人员和开发者提供了一个新的优化途径，允许他们以更少的资源消耗来微调和部署高性能的自然语言处理模型。随着深度学习和机器学习技术的不断发展，量化技术及其工具如QLoRA，正在成为推动这一领域发展的重要力量。" 【文件名称列表】中的"qlora-main"可能指向了QLoRA项目的主代码库或主要组成部分。这通常包括了工具的核心功能、使用示例、API文档以及可能的测试脚本等，使得用户可以方便地下载、安装和使用QLoRA工具进行量化微调实验。

收起资源包目录

量化LLM微调工具：用于量化微调大规模语言模型(LLM)的工具（274个子文件）

gpt-3.5-oa-generations-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1011KB

13b-alpaca-oa-generations-topp0.9-temp0.7.jsonl 4.2MB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1014KB

65b-unnatural-instructions-oa-generations-topp0.9-temp0.7.jsonl 3.97MB

65b-longform-oa-generations-topp0.9-temp0.7.jsonl 4.7MB

7b-alpaca-oa-generations-topp0.9-temp0.7.jsonl 4.9MB

vicuna-13b-oa-generations-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

gpt-3.5-oa-generations-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.02MB

13b-hh-rlhf-oa-generations-topp0.9-temp0.7.jsonl 7.85MB

30b-guanaco-vicuna-generations-topp0.9-temp0.7.jsonl 282KB

vicuna-13b-oa-generations.jsonl 5.31MB

gpt-3.5-oa-generations.jsonl 3.99MB

65b-chip2-oa-generations-topp0.9-temp0.7.jsonl 4.12MB

30b-self-instruct-oa-generations-topp0.9-temp0.7.jsonl 4.57MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1.01MB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1MB

vicuna-13b-oa-generations-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1018KB

gpt-4-oa-generations-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1.02MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1019KB

gpt-4-oa-generations-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1022KB

13b-unnatural-instructions-oa-generations-topp0.9-temp0.7.jsonl 4.19MB

gpt-3.5-oa-generations-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

generations_qualitative_comparison_guanaco65b_vs_gpt35.ipynb 327KB

30b-unnatural-instructions-oa-generations-topp0.9-temp0.7.jsonl 4.25MB

7b-longform-vicuna-generations-topp0.9-temp0.7.jsonl 177KB

guanaco_7B_demo_colab.ipynb 15KB

13b-hh-rlhf-vicuna-generations-topp0.9-temp0.7.jsonl 253KB

zero_shot_mmlu_val.json 936KB

65b-longform-vicuna-generations-topp0.9-temp0.7.jsonl 213KB

gpt-3.5-oa-generations-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1020KB

7b-self-instruct-oa-generations-topp0.9-temp0.7.jsonl 5.38MB

65b-guanaco-oa-generations-topp0.9-temp0.7.jsonl 5.91MB

13b-longform-vicuna-generations-topp0.9-temp0.7.jsonl 180KB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.02MB

65b-flan-oa-generations-topp0.9-temp0.7.jsonl 3.68MB

gpt-3.5-oa-generations-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1004KB

vicuna-13b-oa-generations-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 1004KB

7b-hh-rlhf-vicuna-generations-topp0.9-temp0.7.jsonl 254KB

13b-guanaco-oa-generations-topp0.9-temp0.7.jsonl 6.19MB

7b-hh-rlhf-oa-generations-topp0.9-temp0.7.jsonl 8.07MB

five_shot_mmlu_test.json 40.41MB

answer_gpt4.jsonl 174KB

7b-chip2-oa-generations-topp0.9-temp0.7.jsonl 4.65MB

.gitignore 3KB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1018KB

13b-flan-oa-generations-topp0.9-temp0.7.jsonl 3.95MB

65b-guanaco-vicuna-generations-topp0.9-temp0.7.jsonl 290KB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 1010KB

7b-flan-oa-generations-topp0.9-temp0.7.jsonl 3.57MB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1018KB

vicuna-13b-oa-generations-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1MB

gpt-4-oa-generations-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.02MB

13b-guanaco-oa-generations-topp0.9-temp0.7-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

gpt-4-oa-generations-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1014KB

oa_questions.jsonl 3.87MB

30b-flan-oa-generations-topp0.9-temp0.7.jsonl 3.54MB

13b-guanaco-vicuna-generations-topp0.9-temp0.7.jsonl 262KB

30b-longform-oa-generations-topp0.9-temp0.7.jsonl 4.32MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.03MB

65b-hh-rlhf-oa-generations-topp0.9-temp0.7.jsonl 5.95MB

13b-self-instruct-oa-generations-topp0.9-temp0.7.jsonl 4.93MB

7b-guanaco-oa-generations-topp0.9-temp0.7.jsonl 6.12MB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1017KB

65b-alpaca-oa-generations-topp0.9-temp0.7.jsonl 4.24MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

vicuna_benchmark_human_annotations.csv 21.02MB

13b-longform-oa-generations-topp0.9-temp0.7.jsonl 6.65MB

gpt-3.5-oa-generations-vs-vicuna-13b-oa-generations-gpt-4-reviewer-threeclass.jsonl 1018KB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

7b-unnatural-instructions-oa-generations-topp0.9-temp0.7.jsonl 4.65MB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1.03MB

7b-alpaca-vicuna-generations-topp0.9-temp0.7.jsonl 153KB

vicuna-13b-oa-generations-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1MB

30b-hh-rlhf-vicuna-generations-topp0.9-temp0.7.jsonl 257KB

30b-chip2-oa-generations-topp0.9-temp0.7.jsonl 3.89MB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 1003KB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-30b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1020KB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-65b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1017KB

five_shot_mmlu_val.json 4.43MB

gpt-4-oa-generations-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 996KB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-4-oa-generations-gpt-4-reviewer-threeclass.jsonl 1.02MB

30b-hh-rlhf-oa-generations-topp0.9-temp0.7.jsonl 7.75MB

7b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 1003KB

7b-longform-oa-generations-topp0.9-temp0.7.jsonl 6.69MB

gpt-4-oa-generations-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

gpt-4-oa-generations.jsonl 4.73MB

zero_shot_mmlu_test.json 8.35MB

mturk_ui.html 4KB

13b-chip2-oa-generations-topp0.9-temp0.7.jsonl 4.19MB

65b-guanaco-oa-generations-topp0.9-temp0.7-vs-7b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.01MB

30b-guanaco-oa-generations-topp0.9-temp0.7.jsonl 5.87MB

vicuna-13b-oa-generations-vs-13b-guanaco-oa-generations-topp0.9-temp0.7-gpt-4-reviewer-threeclass.jsonl 1.02MB

30b-guanaco-oa-generations-topp0.9-temp0.7-vs-gpt-3.5-oa-generations-gpt-4-reviewer-threeclass.jsonl 1008KB

65b-self-instruct-oa-generations-topp0.9-temp0.7.jsonl 5.11MB

30b-alpaca-oa-generations-topp0.9-temp0.7.jsonl 4.16MB

7b-guanaco-vicuna-generations-topp0.9-temp0.7.jsonl 288KB

65b-hh-rlhf-vicuna-generations-topp0.9-temp0.7.jsonl 205KB

共 274 条

UnknownToKnown

粉丝: 1w+
资源: 773

QLoRA：大规模语言模型微调的量化工具

大语言模型LLM：微调、量化、推理.zip

巨型语言模型的 8 位量化：LLM.int8() 中文版论文

大语言模型LLM微调、量化、推理技术详解

大语言模型量化-对LLMs进行量化以进行搞笑Finetuning微调-附项目源码-优质项目分享.zip

大模型微调，使用intel资源微调chatglm

在您自己的数据上预训练、微调、部署 20+ LLM

关于使用且功能强大的NLP和LLM库令人惊叹的模型应用

关于举办《企业级生成式人工智能LLM大模型技术、算法及案例实战》线上高级研修讲座.pdf

LLM-Custome.zip

大语言模型班的作业.zip

最新资源