Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Idea: hyperparameter searching #181

Open
pfeatherstone opened this issue Aug 31, 2023 · 2 comments
Open

Idea: hyperparameter searching #181

pfeatherstone opened this issue Aug 31, 2023 · 2 comments

Comments

@pfeatherstone
Copy link
Contributor

This library offers so many useful parameters to tweak your architecture.
However, though @lucidrains offers insights from the papers and from experience, what works and what doesn't work ultimately depends on your data and your compute.
It's a bit daunting trying to figure out how to tune your model.
Rather than doing a blind manual search, maybe a hyperparameter searching algorithm would be a good idea. Maybe something like a genetic algorithm or similar.

@pfeatherstone
Copy link
Contributor Author

Or maybe use something like https://github.com/optuna/optuna

@pfeatherstone
Copy link
Contributor Author

This probably doesn't need to be added to the library. But maybe an example snippet or jupyter notebook would be cool. I might try it at some point. If i have some success, I'll submit a PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant