openai.RateLimitError - Unable to use Azure Open AI for multi modal input

Question

I am using Azure Open AI (gpt-4o) model. With simple text input, the result is returning fine. When provided with image input (multi modal), to extract the data from image it throws error of ratelimiterror. I am currently using Free Trail version. I hope this will not impact the usage of multi modal input.

Error below

"openai.RateLimitError: Error code: 429 - {'error': {'code': '429', 'message': 'Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2024-08-01-preview have exceeded token rate limit of your current AIServices S0 pricing tier. Please retry after 86400 seconds. Please contact Azure support service if you would like to further increase the default rate limit.'}}"

Answer

Hello Harish,

Welcome to the Microsoft Q&A and thank you for posting your questions here.

I understand that you are having openai.RateLimitError - Unable to use Azure Open AI for multi modal input.

This is one of the common errors, and it has been resolved on this platform. Kindly follow the links below to resolve it:

I hope this is helpful! Do not hesitate to let me know if you have any other questions.

Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.

Answer

Harish Greetings!

"openai.RateLimitError: Error code: 429 - {'error': {'code': '429', 'message': 'Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2024-08-01-preview have exceeded token rate limit of your current AIServices S0 pricing tier. Please retry after 86400 seconds. Please contact Azure support service if you would like to further increase the default rate limit.'}}"

The error message is related to rate limits, which is a common practice in APIs to prevent abuse and ensure fair usage.

Did you check if you have exceeded the quota limit for your Azure OpenAI resources?

To give more context, As requests come into the deployment endpoint, the estimated max-processed-token count is added to a running token count of all requests that is reset each minute. If at any time during that minute, the TPM rate limit value is reached, then further requests will receive a 429 response code until the counter resets. For more details, see Understanding rate limits.

Please see Manage Azure OpenAI Service quota for more details.

You could also try increasing the limit on your deployment. User's image

To minimize issues related to rate limits, it's a good idea to use the following techniques:

Set max_tokens and best_of to the minimum values that serve the needs of your scenario. For example, don’t set a large max-tokens value if you expect your responses to be small.
Use quota management to increase TPM on deployments with high traffic, and to reduce TPM on deployments with limited needs.
Implement retry logic in your application.
Avoid sharp changes in the workload. Increase the workload gradually.
Test different load increase patterns.

Also, see A Guide to Limits, Quotas, and Best Practices for more details.

Hope this helps. Do let me know if you have any further queries.

If the response helped, please do click Accept Answer and Yes for was this answer helpful.

Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.

Share via

openai.RateLimitError - Unable to use Azure Open AI for multi modal input

2 answers

Your answer