GLM4v使用Lora模型微调后，merge模型后运行报错， #1163

hyyuan123 · 2024-06-18T08:43:53Z

Describe the bug
What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程，最好有截图)

使用CUDA_VISIBLE_DEVICES=0 swift infer --ckpt_dir output/glm4v-9b-chat/v1-20240617-191301/checkpoint-150-merged/ --load_dataset_config true 运行程序，报错：

但是直接使用命令CUDA_VISIBLE_DEVICES=0 swift infer --ckpt_dir output/glm4v-9b-chat/v1-20240617-191301/checkpoint-150 --load_dataset_config true 运行程序，则正常运行，没有报错
之前使用框架训练internvl-v1.5版本的模型，训练和测试均无问题，想问一下出现问题的原因。怎么修改

tastelikefeet · 2024-06-18T09:16:46Z

用的swift版本是什么

hyyuan123 · 2024-06-18T09:18:38Z

用的swift版本是什么

ms-swift ==2.1.0

XuRui314 · 2024-06-19T09:49:39Z

请问你的loss是正常的吗

#1091 (comment)

hyyuan123 changed the title ~~GLMV4使用Lora模型微调后，merge模型后运行报错，~~ GLM4v使用Lora模型微调后，merge模型后运行报错， Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GLM4v使用Lora模型微调后，merge模型后运行报错， #1163

GLM4v使用Lora模型微调后，merge模型后运行报错， #1163

hyyuan123 commented Jun 18, 2024

tastelikefeet commented Jun 18, 2024

hyyuan123 commented Jun 18, 2024

XuRui314 commented Jun 19, 2024

GLM4v使用Lora模型微调后，merge模型后运行报错， #1163

GLM4v使用Lora模型微调后，merge模型后运行报错， #1163

Comments

hyyuan123 commented Jun 18, 2024

tastelikefeet commented Jun 18, 2024

hyyuan123 commented Jun 18, 2024

XuRui314 commented Jun 19, 2024