是否可以使用llama.cpp自己对huggingface上的FlagAlpha/Llama3-Chinese-8B-Instruct模型进行量化，下面是我遇到的问题 #344

ZhichengQian1 · 2024-06-01T16:00:38Z

我使用的指令如下：
$ python convert.py models/llama3-8b-chinese --outfile models/llama3-8b-chinese-f16.gguf --outtype f16
$ ./quantize models/llama3-8b-chinese-f16.gguf models/llama3-cn-q4_0.gguf q4_0
但是在使用ollama create这个模型并且使用的时候得到报错：Ollama: 500 Internal Server Error invalid unordered_map<K, T> key

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

是否可以使用llama.cpp自己对huggingface上的FlagAlpha/Llama3-Chinese-8B-Instruct模型进行量化，下面是我遇到的问题 #344

是否可以使用llama.cpp自己对huggingface上的FlagAlpha/Llama3-Chinese-8B-Instruct模型进行量化，下面是我遇到的问题 #344

ZhichengQian1 commented Jun 1, 2024

是否可以使用llama.cpp自己对huggingface上的FlagAlpha/Llama3-Chinese-8B-Instruct模型进行量化，下面是我遇到的问题 #344

是否可以使用llama.cpp自己对huggingface上的FlagAlpha/Llama3-Chinese-8B-Instruct模型进行量化，下面是我遇到的问题 #344

Comments

ZhichengQian1 commented Jun 1, 2024