Skip to content

Commit 9c7e1d7

Browse files
lcawlgithub-actions[bot]
authored andcommitted
[DOCS] Clarify default rate limit settings (#4472)
(cherry picked from commit e4094f4)
1 parent da30074 commit 9c7e1d7

File tree

4 files changed

+26
-4
lines changed

4 files changed

+26
-4
lines changed

output/openapi/elasticsearch-openapi.json

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

output/openapi/elasticsearch-serverless-openapi.json

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

output/schema/schema.json

Lines changed: 3 additions & 2 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

specification/inference/_types/Services.ts

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -286,9 +286,30 @@ export type ServiceSettings = UserDefinedValue
286286

287287
export type TaskSettings = UserDefinedValue
288288

289+
/**
290+
* This setting helps to minimize the number of rate limit errors returned from the service.
291+
*/
289292
export class RateLimitSetting {
290293
/**
291294
* The number of requests allowed per minute.
295+
* By default, the number of requests allowed per minute is set by each service as follows:
296+
*
297+
* * `alibabacloud-ai-search` service: `1000`
298+
* * `anthropic` service: `50`
299+
* * `azureaistudio` service: `240`
300+
* * `azureopenai` service and task type `text_embedding`: `1440`
301+
* * `azureopenai` service and task type `completion`: `120`
302+
* * `cohere` service: `10000`
303+
* * `elastic` service and task type `chat_completion`: `240`
304+
* * `googleaistudio` service: `360`
305+
* * `googlevertexai` service: `30000`
306+
* * `hugging_face` service: `3000`
307+
* * `jinaai` service: `2000`
308+
* * `mistral` service: `240`
309+
* * `openai` service and task type `text_embedding`: `3000`
310+
* * `openai` service and task type `completion`: `500`
311+
* * `voyageai` service: `2000`
312+
* * `watsonxai` service: `120`
292313
*/
293314
requests_per_minute?: integer
294315
}

0 commit comments

Comments
 (0)