训练及导出显存消耗不正常 #1179

tastelikefeet · 2024-06-19T11:51:23Z

Describe the bug

显存比正常值消耗高500M左右

Your hardware and system info

GPU：NVidia 4060Ti 16GB

Additional context

运行命令：
CUDA_VISIBLE_DEVICES=0 swift sft --model_id_or_path qwen/Qwen-7B-Chat --custom_train_dataset_path identity.json --save_steps 500 --lora_target_modules ALL --learning_rate 5e-5 --gradient_accumulation_steps 8 --batch_size 2

无法运行，export merge-lora命令同样无法运行

tastelikefeet · 2024-06-21T07:06:44Z

fixed
由于初始化时使用了device_map=auto导致的，该技术对显存评估并不精确

tastelikefeet self-assigned this Jun 19, 2024

tastelikefeet added the bug Something isn't working label Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

训练及导出显存消耗不正常 #1179

训练及导出显存消耗不正常 #1179

tastelikefeet commented Jun 19, 2024

tastelikefeet commented Jun 21, 2024

训练及导出显存消耗不正常 #1179

训练及导出显存消耗不正常 #1179

Comments

tastelikefeet commented Jun 19, 2024

tastelikefeet commented Jun 21, 2024