Ollama本地部署Llama3-Chinese并进行测试-武穆逸仙 In November 2024

开源仓库

短短几天，开源社区 HuggingFace 上已经涌现了一批基于 Llama3 微调的中文版本。

Ollama 本地部署 Llama3-Chinese 并进行测试

llama3-chinese

GitHub 上 llama3-Chinese-chat 仓库也整理了目前各家微调的版本，包括：


联通微调版：https://www.modelscope.cn/models/UnicomAI/Unichat-llama3-Chinese/summary
Openbuddy 微调版：https://www.modelscope.cn/models/OpenBuddy/openbuddy-llama3-8b-v21.1-8k/summary
zhichen 微调版：https://github.com/seanzhang-zhichen/llama3-chinese
Rookie 微调版：https://github.com/Rookie1019/Llama-3-8B-Instruct-Chinese
shareAI-V1：https://opencsg.com/models/shareAI/llama3-Chinese-chat-8b
shareAI-V2：https://modelscope.cn/models/baicai003/Llama3-Chinese_v2/summary

Ollama 部署 Llama3-Chinese 模型

ollama 仓库也已经出现了 llama3-Chinese 的模型（基于 https://huggingface.co/zhichen/Llama3-Chinese 的 quantize q4_0 版本）。

但经本人测试效果比较差，不仅回答的不准确，而且提问第二个问题就没有任何响应了。所以，不推荐使用。

Ollama 本地部署 Llama3-Chinese 并进行测试

这里，我们使用 HuggingFace 社区的 zhouzr/Llama3-8B-Chinese-Chat-GGUF 模型，推荐下载使用 q4_k_m 版本：

地址：zhouzr/Llama3-8B-Chinese-Chat-GGUF at main (huggingface.co)

Ollama 本地部署 Llama3-Chinese 并进行测试

将下载的 GGUF 模型上传到指定位置，编写 Modelfile：


FROM ./Llama3-8B-Chinese-Chat.q4_k_m.GGUF
TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>

{{ .Response }}<|eot_id|>"""
PARAMETER stop "<|start_header_id|>"
PARAMETER stop "<|end_header_id|>"
PARAMETER stop "<|eot_id|>"
PARAMETER stop "<|reserved_special_token"

执行 ollama create llama3-Chinese:8B -f Modelfile 创建模型：



(base) [root@localhost my_model]# ollama create llama3-Chinese:8B -f Modelfile
transferring model data
creating model layer
creating template layer
creating parameters layer
creating config layer
using already created layer sha256:74db82a06a038230371e62740a9b430140e4df3a02c5ddcbe97c9bee76d6455e
writing layer sha256:8ab4849b038cf0abc5b1c9b8ee1443dca6b93a045c2272180d985126eb40bf6f
writing layer sha256:c0aac7c7f00d8a81a8ef397cd78664957fbe0e09f87b08bc7afa8d627a8da87f
writing layer sha256:109fb4827ddd6f21dd04a405dec5e1c9e39cf139e89b98536875a782938c02f5
writing manifest
success

执行 ollama list 查看：


(base) [root@localhost my_model]# ollama list
NAME                            ID              SIZE    MODIFIED
llama3-Chinese:8B               e45ad8ada59e    4.9 GB  33 seconds ago
qwen:14b-chat-v1.5-q5_K_M       ba0e61d66b27    10 GB   6 weeks ago

llama3-Chinese:8B 已经成功导入到 Ollama 中。