AI - Prompt Token Tracking

Overview

This policy allows you to track of the number of tokens sent and received by an AI API.

Usage

Here are some examples for how to use the AI - Prompt Token Tracking.

Built-in support for OpenAI, Gemini, Claude, and Mistral

The plugin has built-in support for the following AI providers:

OpenAI (ChatGPT)
Google (Gemini)
Anthropic (Claude)
Mistral

Select the appropriate type in the configuration, and the plugin handles the token tracking automatically.

Custom Provider

When the API provider is not one of the built-in providers, use the CUSTOM type. When you choose the CUSTOM, you must provide a custom response body parsing configuration that matches the structure of the API responses from your provider.

For example, the following configuration can be used to extract tokens usage and model from a custom AI API response:

{
  "id": "a6775254-dc2f-4411-9b1c-415f3ba8ee8d",
  "my_model": "LLAAMA",
  "result": "a result",
  "my_usage": {
    "promptUsage": 100,
    "responseUsage": 8
  }
}

Sent tokens count point: my_usage.promptUsage
Receive tokens count point: my_usage.responseUsage
Sent tokens count point: my_model

Phases

The ai-prompt-token-tracking policy can be applied to the following API types and flow phases.

Compatible API types

PROXY

Supported flow phases:

Response

Compatibility matrix

Strikethrough text indicates that a version is deprecated.

Plugin version	APIM	Java version
1.x	4.8.x and 4.9.x	21
2.x and after	4.10.x and after	21

Configuration options

Name `json name`	Type `constraint`	Mandatory	Description
Response body parsing `extraction`	object		See "Response body parsing" section.
Cost `pricing`	object		See "Cost" section.

Response body parsing (Object)

Name `json name`	Type `constraint`	Mandatory	Description
Type `type`	object	✅	Type of Response body parsing Values: `GPT` `GEMINI` `CLAUDE` `MISTRAL` `CUSTOM`

Response body parsing: ChatGPT by OpenAI `type = "GPT"`

Name `json name`	Type `constraint`	Mandatory	Default	Description
No properties

Response body parsing: Gemini by Google `type = "GEMINI"`

Name `json name`	Type `constraint`	Mandatory	Default	Description
No properties

Response body parsing: Claude by Anthropic `type = "CLAUDE"`

Name `json name`	Type `constraint`	Mandatory	Default	Description
No properties

Response body parsing: Mistral `type = "MISTRAL"`

Name `json name`	Type `constraint`	Mandatory	Default	Description
No properties

Response body parsing: Custom provider `type = "CUSTOM"`

Name `json name`	Type `constraint`	Mandatory	Description
Sent token count EL `inputTokenPointer`	string	✅	A Gravitee Expression Language that represent number of tokens sent to LLM
Model pointer `modelPointer`	string		A Gravitee Expression Language that represent model of LLM
Receive token count EL `outputTokenPointer`	string	✅	A Gravitee Expression Language that represent number of tokens receive from LLM

Cost (Object)

Name `json name`	Type `constraint`	Mandatory	Description
Type `type`	object	✅	Type of Cost Values: `none` `pricing`

Cost: No cost calculation `type = "none"`

Name `json name`	Type `constraint`	Mandatory	Default	Description
No properties

Cost: Cost calculation `type = "pricing"`

Name `json name`	Type `constraint`	Mandatory	Description
Input Token Price Unit `inputPriceUnit`	number `(0, +Inf]`	✅	Input Token Price Unit
Input Token Price Value `inputPriceValue`	number `(0, +Inf]`	✅	Input Token Price Value
Output Token Price Unit `outputPriceUnit`	number `(0, +Inf]`	✅	Output Token Price Unit
Output Token Price Value `outputPriceValue`	number `(0, +Inf]`	✅	Output Token Price Value

Examples

Calculate usage cost for OpenAI ChatGPT API

{
  "api": {
    "definitionVersion": "V4",
    "type": "PROXY",
    "name": "AI - Prompt Token Tracking example API",
    "flows": [
      {
        "name": "Common Flow",
        "enabled": true,
        "selectors": [
          {
            "type": "HTTP",
            "path": "/",
            "pathOperator": "STARTS_WITH"
          }
        ],
        "response": [
          {
            "name": "AI - Prompt Token Tracking",
            "enabled": true,
            "policy": "ai-prompt-token-tracking",
            "configuration":
              {
                  "extraction": {
                      "type": "GPT"
                  },
                  "pricing": {
                      "inputPriceValue": 0.4,
                      "inputPriceUnit": 1000000,
                      "outputPriceValue": 0.8,
                      "outputPriceUnit": 1000000
                  }
              }
          }
        ]
      }
    ]
  }
}

Track tokens usage only on Custom API response

{
  "api": {
    "definitionVersion": "V4",
    "type": "PROXY",
    "name": "AI - Prompt Token Tracking example API",
    "flows": [
      {
        "name": "Common Flow",
        "enabled": true,
        "selectors": [
          {
            "type": "HTTP",
            "path": "/",
            "pathOperator": "STARTS_WITH"
          }
        ],
        "response": [
          {
            "name": "AI - Prompt Token Tracking",
            "enabled": true,
            "policy": "ai-prompt-token-tracking",
            "configuration":
              {
                  "extraction": {
                      "type": "CUSTOM",
                      "inputTokenPointer": "/usage/custom_prompt_tokens",
                      "outputTokenPointer": "/usage/custom_completion_tokens",
                      "modelPointer": "/custom_model"
                  },
                  "pricing": {
                      "type": "none"
                  }
              }
          }
        ]
      }
    ]
  }
}

Changelog

2.0.0 (2025-12-15)

Features

make the policy compatible with 4.10 (5cfa0cf)

BREAKING CHANGES

requires 4.10

1.2.0 (2025-11-13)

Bug Fixes

relaxe the Content-Type check to handle more cases (c8c57f0)

Features

allign metrics naming on llm proxy (a53be09)

1.1.0 (2025-08-27)

Features

update form to provide el metadata (6c197d7)

1.0.1 (2025-06-19)

Bug Fixes

ignore error when token usage not found in response (2718c28)

1.0.0 (2025-06-17)

Features

extract token sent, received and model of LLM queries (c95d63e)

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.circleci		.circleci
.docgen		.docgen
.github		.github
docs		docs
src		src
.gitignore		.gitignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
LICENSE.txt		LICENSE.txt
README.md		README.md
Taskfile.yml		Taskfile.yml
pom.xml		pom.xml

Folders and files

Latest commit

History

Repository files navigation

AI - Prompt Token Tracking

Overview

Usage

Built-in support for OpenAI, Gemini, Claude, and Mistral

Custom Provider

Phases

Compatible API types

Supported flow phases:

Compatibility matrix

Configuration options

Response body parsing (Object)

Response body parsing: ChatGPT by OpenAI type = "GPT"

Response body parsing: Gemini by Google type = "GEMINI"

Response body parsing: Claude by Anthropic type = "CLAUDE"

Response body parsing: Mistral type = "MISTRAL"

Response body parsing: Custom provider type = "CUSTOM"

Cost (Object)

Cost: No cost calculation type = "none"

Cost: Cost calculation type = "pricing"

Examples

Changelog

2.0.0 (2025-12-15)

Features

BREAKING CHANGES

1.2.0 (2025-11-13)

Bug Fixes

Features

1.1.0 (2025-08-27)

Features

1.0.1 (2025-06-19)

Bug Fixes

1.0.0 (2025-06-17)

Features

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Response body parsing: ChatGPT by OpenAI `type = "GPT"`

Response body parsing: Gemini by Google `type = "GEMINI"`

Response body parsing: Claude by Anthropic `type = "CLAUDE"`

Response body parsing: Mistral `type = "MISTRAL"`

Response body parsing: Custom provider `type = "CUSTOM"`

Cost: No cost calculation `type = "none"`

Cost: Cost calculation `type = "pricing"`

Packages