add docs

shreemaan-abhishek · shreemaan-abhishek · commit a3287b5ad3a9 · 2025-03-18T10:05:39.000+05:45
diff --git a/docs/en/latest/plugins/ai-rate-limiting.md b/docs/en/latest/plugins/ai-rate-limiting.md
@@ -35,17 +35,18 @@ The `ai-rate-limiting` plugin enforces token-based rate limiting for requests se
 
 | Name                      | Type          | Required | Description                                                                                                                                                                                                                                                                                   |
 | ------------------------- | ------------- | -------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| `limit`                   | integer       | false    | The maximum number of tokens allowed to consume within a given time interval. At least one of `limit` and `instances.limit` should be configured.                                                                                                                                             |
-| `time_window`             | integer       | false    | The time interval corresponding to the rate limiting `limit` in seconds. At least one of `time_window` and `instances.time_window` should be configured.                                                                                                                                      |
+| `limit`                   | integer       | conditionally    | The maximum number of tokens allowed to consume within a given time interval. At least one of `limit` and `instances.limit` should be configured.                                                                                                                                             |
+| `time_window`             | integer       | conditionally    | The time interval corresponding to the rate limiting `limit` in seconds. At least one of `time_window` and `instances.time_window` should be configured.                                                                                                                                      |
 | `show_limit_quota_header` | boolean       | false    | If true, include `X-AI-RateLimit-Limit-*` to show the total quota, `X-AI-RateLimit-Remaining-*` to show the remaining quota in the response header, and `X-AI-RateLimit-Reset-*` to show the number of seconds left for the counter to reset, where `*` is the instance name. Default: `true` |
 | `limit_strategy`          | string        | false    | Type of token to apply rate limiting. `total_tokens`, `prompt_tokens`, and `completion_tokens` values are returned in each model response, where `total_tokens` is the sum of `prompt_tokens` and `completion_tokens`. Default: `total_tokens`                                                |
-| `instances`               | array[object] | false    | LLM instance rate limiting configurations.                                                                                                                                                                                                                                                    |
+| `instances`               | array[object] | conditionally    | LLM instance rate limiting configurations.                                                                                                                                                                                                                                                    |
 | `instances.name`          | string        | true     | Name of the LLM service instance.                                                                                                                                                                                                                                                             |
 | `instances.limit`         | integer       | true     | The maximum number of tokens allowed to consume within a given time interval.                                                                                                                                                                                                                 |
 | `instances.time_window`   | integer       | true     | The time interval corresponding to the rate limiting `limit` in seconds.                                                                                                                                                                                                                      |
 | `rejected_code`           | integer       | false    | The HTTP status code returned when a request exceeding the quota is rejected. Default: `503`                                                                                                                                                                                                  |
 | `rejected_msg`            | string        | false    | The response body returned when a request exceeding the quota is rejected.                                                                                                                                                                                                                    |
 
+If `limit` is configured, `time_window` also needs to be configured. Else, just specifying `instances` will also suffice.
 ## Example
 
 Create a route as such and update with your LLM providers, models, API keys, and endpoints: