Llama3-chinese: 大幅改进Llama3 中文能力

本文主要是介绍Llama3-chinese: 大幅改进Llama3 中文能力，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

中文｜ English

请添加图片描述

介绍

Llama3-Chinese是以Meta-Llama-3-8B为底座，使用 DORA + LORA+ 的训练方法，在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。

Github:https://github.com/seanzhang-zhichen/llama3-chinese

请添加图片描述

模型下载

Model	Download
Meta-Llama-3-8B	🤗 HuggingFace 🤖 ModelScope
Llama3-Chinese-Lora	🤗 HuggingFace 🤖 ModelScope
Llama3-Chinese (合并好的模型)	🤗 HuggingFace 🤖 ModelScope

合并LORA模型（可跳过）

1、下载 Meta-Llama-3-8B

git clone https://www.modelscope.cn/LLM-Research/Meta-Llama-3-8B.git

2、下载Llama3-Chinese-Lora

From ModelScope

git lfs install
git clone https://www.modelscope.cn/seanzhang/Llama3-Chinese-Lora.git

From HuggingFace

git lfs install
git clone https://huggingface.co/zhichen/Llama3-Chinese-Lora

3、合并模型

python merge_lora.py \--base_model path/to/Meta-Llama-3-8B \--lora_model path/to/lora/Llama3-Chinese-Lora  \--output_dir ./Llama3-Chinese

下载 Llama3-Chinese（合并好的模型）

From ModelScope

git lfs install
git clone https://www.modelscope.cn/seanzhang/Llama3-Chinese.git

From HuggingFace

git lfs install
git clone https://huggingface.co/zhichen/Llama3-Chinese

vllm web 推理

1、使用vllm部署模型

python -m vllm.entrypoints.openai.api_server --served-model-name Llama3-Chinese --model ./Llama3-Chinese(换成你自己的合并后的模型路径)

2、在命令行执行

python vllm_web_demo.py --model Llama3-Chinese

训练数据集

匠数科技大模型sft数据集

LICENSE

本项目仅可应用于研究目的，项目开发者不承担任何因使用本项目（包含但不限于数据、模型、代码等）导致的危害或损失。详细请参考免责声明。

Llama3-Chinese项目代码的授权协议为 The Apache License 2.0，代码可免费用做商业用途，模型权重和数据只能用于研究目的。请在产品说明中附加Llama3-Chinese的链接和授权协议。

Citation

如果你在研究中使用了Llama3-Chinese，请按如下格式引用：

@misc{Llama3-Chinese,title={Llama3-Chinese},author={Zhichen Zhang, Xin LU, Long Chen},year={2024},howpublished={\url{https://github.com/seanzhang-zhichen/llama3-chinese}},
}

Acknowledgement

meta-llama/llama3

hiyouga/LLaMA-Factory

这篇关于Llama3-chinese: 大幅改进Llama3 中文能力的文章就介绍到这儿，希望我们推荐的文章对编程师们有所帮助！

Llama3-chinese: 大幅改进Llama3 中文能力

介绍

模型下载

合并LORA模型（可跳过）

下载 Llama3-Chinese（合并好的模型）

vllm web 推理

训练数据集

LICENSE

Citation

Acknowledgement

相关文章

一文教你解决Python不支持中文路径的问题

Python爬虫selenium验证之中文识别点选+图片验证码案例(最新推荐)

Java下载文件中文文件名乱码的解决方案(文件名包含很多%)

Go语言实现将中文转化为拼音功能

中文分词jieba库的使用与实景应用（一）

EasyPlayer.js网页H5 Web js播放器能力合集

vscode中文乱码问题，注释，终端，调试乱码一劳永逸版

一种改进的red5集群方案的应用、基于Red5服务器集群负载均衡调度算法研究

解决Office Word不能切换中文输入

sqlite不支持中文排序，采用java排序