Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lora 训练参数 #40

Open
MonkeyTB opened this issue Jun 6, 2023 · 3 comments
Open

lora 训练参数 #40

MonkeyTB opened this issue Jun 6, 2023 · 3 comments
Labels
question Further information is requested wontfix This will not be worked on

Comments

@MonkeyTB
Copy link

MonkeyTB commented Jun 6, 2023

trainable params: 22021440 || all params: 6195307952 || trainable%: 0.35545351692954874

请教一下,更新之后的lora为啥训练参数占比这么高了,之前只有0.05左右

@MonkeyTB MonkeyTB added the question Further information is requested label Jun 6, 2023
@MonkeyTB
Copy link
Author

MonkeyTB commented Jun 6, 2023

trainable params: 22021440 || all params: 6195307952 || trainable%: 0.35545351692954874

请教一下,更新之后的lora为啥训练参数占比这么高了,之前只有0.05左右

原始老版本的lora
trainable params: 3670016 || all params: 6176956416 || trainable%: 0.05941463324063059

@shibing624
Copy link
Owner

shibing624 commented Jun 6, 2023

参考qlora的论文,改了lora_target_module默认参数,lora_target_modules= all linear layers, 全部dense层参与训练,效果最好。

想改回之前的,设置lora_target_modules参数即可。

Copy link

stale bot commented Dec 27, 2023

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.(由于长期不活动,机器人自动关闭此问题,如果需要欢迎提问)

@stale stale bot added the wontfix This will not be worked on label Dec 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants