Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Difference between "i2v_subject" and "subject_consistency" dimensions #34

Open
AshkanTaghipour opened this issue Jun 3, 2024 · 6 comments
Assignees

Comments

@AshkanTaghipour
Copy link

No description provided.

@ziqihuangg
Copy link
Contributor

Hi, "i2v_subject" under "I2V" refers to subject consistency between the input image and the generated videos, whereas "subject_consistency" under "Video Quality" refers to subject consistency among the generated video frames.

@AshkanTaghipour
Copy link
Author

AshkanTaghipour commented Jun 3, 2024

thank you very much,
the "i2v_subject" has not reported for the base I2V models in leaderboard correct?

@ziqihuangg
Copy link
Contributor

The preliminary results have been posted on our leaderboard under "VBench-I2V", under the "Video-Image Subject Consistency" dimension. We want to clarify that these results can be replicated using our current open-source VBench-I2V code. However, we are considering optimizations to the evaluation pipeline to align it even more closely with human perception. Therefore, there may be a slight update to the leaderboard results later this month.

@ziqihuangg ziqihuangg self-assigned this Jun 4, 2024
@ziqihuangg
Copy link
Contributor

Screenshot 2024-06-04 at 10 36 08 AM

@ziqihuangg ziqihuangg changed the title is there any difference between "i2v_subject" and "subject_consistency" ? Difference between "i2v_subject" and "subject_consistency" dimensions Jun 4, 2024
@AshkanTaghipour
Copy link
Author

Thank you very much for your response.

@AshkanTaghipour
Copy link
Author

Could you please clarify the temporal flickering dimension for the I2V suite? I was unable to reproduce the leaderboard results for SVD; I obtained a number of 0.9476 for SVD-XT. Additionally, I could not find any usage of 'temporal_flickering' in the VBench repository at I2V.
is that as effective evaluation dimension as for T2V or other dimensions mentioned in the I2V would be enough for I2V evaluation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants