Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ROCm test artifacts not being published on HUD #5298

Open
jithunnair-amd opened this issue Jun 4, 2024 · 2 comments
Open

ROCm test artifacts not being published on HUD #5298

jithunnair-amd opened this issue Jun 4, 2024 · 2 comments

Comments

@jithunnair-amd
Copy link
Contributor

If a commit has mem_leak_check and rerun_disabled_tests jobs running along with the regular trunk workflows jobs, we find that the HUD page for the commit doesn't list the artifacts for the regular trunk workflow. E.g. https://hud.pytorch.org/pytorch/pytorch/commit/cf77e7dd9770caf65e898ac2ee82045aa0408e30#rocm
The linux-focal-rocm6.1-py3.8 / test (default, 1, 6, linux.rocm.gpu.2 jobs have a link for artifacts for the mem_leak_check and rerun_disabled_tests variants, but not for the regular one.

image

However, from the corresponding Github Actions page for the regular job, https://github.com/pytorch/pytorch/actions/runs/9363246389/job/25774767756#step:19:51, it looks like the artifact was succesfully uploaded to https://gha-artifacts.s3.amazonaws.com/pytorch/pytorch/9363246389/1/artifact/test-jsons-test-default-1-6-linux.rocm.gpu.2_25774767756.zip

@clee2000
Copy link
Contributor

For whomever ends up taking it up, its because we show multiple workflow runs in a box but only query for the artifacts of one workflow id

@huydhn
Copy link
Contributor

huydhn commented Jun 11, 2024

AI: Need to double check if this is a ROCM-thing or if this is a wide spread issue with other runners too

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Cold Storage
Development

No branches or pull requests

3 participants