-
Notifications
You must be signed in to change notification settings - Fork 253
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
关于qwen1.5长序列训练的疑问 #761
Comments
|
感谢回答,还有疑问是:
File "/nas/macheng.ma/projects/xtuner-main/xtuner/parallel/sequence/attention.py", line 65, in sequence_parallel_attn File "/nas/macheng.ma/projects/xtuner-main/xtuner/parallel/sequence/attention.py", line 23, in pre_process_for_sequence_parallel_attn File "/opt/conda/lib/python3.8/site-packages/torch/_tensor.py", line 426, in repr
|
好的,感谢回答。 |
使用A100-32卡,xtuner训练qwen1.5-32b长序列,有下面几个疑问想请教一下:
The text was updated successfully, but these errors were encountered: