Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Rate limit per model per key #4144

Open
krrishdholakia opened this issue Jun 12, 2024 · 0 comments
Open

[Feature]: Rate limit per model per key #4144

krrishdholakia opened this issue Jun 12, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@krrishdholakia
Copy link
Contributor

The Feature

Allow admin to set tpm/rpm limits per model per key

Motivation, pitch

hi, i want to be able to distribute one key to each of my customer projects, each key allows them to access to a predefined list of models (associated with the key : this is already possible), but with a different tpm/rpm for each model as i don't want to globally rate limit them accross multiple model, i need a more fine grained rate limit, per key and model.

Twitter / LinkedIn details

cc: @olad32

@krrishdholakia krrishdholakia added the enhancement New feature or request label Jun 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant