Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: quantization with DeepSpeed HE #5624

Open
wants to merge 5 commits into
base: master
Choose a base branch
from
Open

Conversation

Atry
Copy link

@Atry Atry commented Jun 6, 2024

When the model is quantized, the hidden sizes cannot be determined from ds_shape and shape, because they are 1 dimensional. This PR fixes the bug by determining hidden sizes from in_features and out_features.

This PR fixes #5398

When the model is quantized, the hidden size cannot be determined from `ds_shape` and `shape`, because they are 1 dimensional. This PR fixes the bug by determining hidden size from `in_features` and `out_features`.
@Atry
Copy link
Author

Atry commented Jun 10, 2024

@lekurile The CI error is transient:

huggingface_hub.utils._errors.HfHubHTTPError: 429 Client Error: Too Many Requests for url: https://huggingface.co/api/models?cursor=eyJfaWQiOnsiJGd0IjoiNjYzNGRlNzA2MzRhNjdjNGVmMmE4YWEwIn19

Could you retry CI?

@loadams
Copy link
Contributor

loadams commented Jun 26, 2024

@lekurile The CI error is transient:

huggingface_hub.utils._errors.HfHubHTTPError: 429 Client Error: Too Many Requests for url: https://huggingface.co/api/models?cursor=eyJfaWQiOnsiJGd0IjoiNjYzNGRlNzA2MzRhNjdjNGVmMmE4YWEwIn19

Could you retry CI?

Retrying CI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Question] Deepspeed inference stage 3 + quantization
4 participants