Implement OpenAI Responses API instrumentation and examples by vasantteja · Pull Request #4166 · open-telemetry/opentelemetry-python-contrib

vasantteja · 2026-02-05T05:49:47Z

Description

This PR adds OpenAI Responses API instrumentation (sync Responses.create) to opentelemetry-instrumentation-openai-v2, using TelemetryHandler. It also adds tests for Responses API create behavior.

Fixes #3436 partly

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Responses API tests with VCR recordings

source .tox/py312-test-instrumentation-openai-v2-latest/bin/activate
pytest instrumentation-genai/opentelemetry-instrumentation-openai-v2/tests/test_responses.py --vcr-record=all -v

Does This PR Require a Core Repo Change?

Yes. - Link to PR:
No.

Checklist:

See contributing.md for styleguide, changelog guidelines, and more.

Followed the style guidelines of this project
Changelogs have been updated
Unit tests have been added
Documentation has been updated

- Added instrumentation for the OpenAI Responses API, including tracing for `Responses.create` and `Responses.stream` methods. - Introduced example scripts demonstrating the usage of the Responses API with OpenTelemetry. - Created a `.env` file for configuration, including API keys and OpenTelemetry settings. - Updated README files to include instructions for running examples and configuring the environment. - Added unit tests for the new Responses API functionality, ensuring proper tracing and metrics collection. This update enhances the observability of OpenAI API interactions within the OpenTelemetry framework.

- Updated the OpenAIInstrumentor to conditionally wrap and unwrap the Responses API methods based on the installed OpenAI package version (>=1.66.0). - Added version checks in the test suite to skip tests if the Responses API is not available, ensuring compatibility with earlier versions of the OpenAI library. - Improved error handling for missing API methods to prevent runtime exceptions.

- Added pylint disable comments to suppress warnings for specific lines in the Responses API example and patch files. - Updated the `responses_create` and `responses_stream` methods with links to relevant OpenAI documentation for better reference. - Improved code formatting for readability by adjusting line breaks and indentation in the patch file.

- Reformatted code in the patch file to enhance readability by adjusting line breaks and indentation. - Ensured consistent style for model retrieval and span name updates in the ResponseStreamWrapper class. - Minor adjustments to import statements for clarity and organization.

- Updated the OpenAI package version in requirements.txt to 1.66.0 for compatibility. - Refactored span name retrieval in the patch.py file to directly format the span name using operation and model attributes, removing the redundant _get_span_name function. - Improved code clarity and consistency in the responses_create and responses_stream methods.

…omments - Added a comment in the `responses_stream` method to clarify the purpose of avoiding duplicate span creation. - Updated span name retrieval to use a default value of 'unknown' for the model attribute if not present, improving robustness. - Refactored the `_record_metrics` function to directly access the request model from attributes, enhancing clarity and consistency.

- Added a new wrapper function `responses_retrieve` to trace the `retrieve` method of the `Responses` class, enhancing observability. - Updated the `OpenAIInstrumentor` to include the new tracing functionality for the `retrieve` method. - Enhanced test coverage for the `retrieve` method, including new test cases for both standard and streaming responses. - Added new YAML cassettes to support the updated tests for the `retrieve` functionality.

- Added a TODO comment in the patch.py file to consider migrating Responses instrumentation to TelemetryHandler once content capture and streaming hooks are available. - Included a reference link to the OpenAI responses.py file for context on the `retrieve` method.

- Introduced a new module `patch_responses.py` to handle tracing for the `Responses` class methods, including `create`, `stream`, and `retrieve`. - Updated the `__init__.py` file to import the new responses patching functions. - Enhanced test coverage with new YAML cassettes for various response scenarios, including standard and streaming responses. - Removed outdated response tracing logic from `patch.py` to streamline the instrumentation process.

…strumentation - Moved the `_record_metrics` function to the `utils.py` file for better organization and accessibility. - Updated the `patch.py` file to import the `_record_metrics` function from `utils.py`, streamlining the code structure. - Enhanced the `responses_retrieve` method to simplify span attribute checks and improve readability. - Added new test cases and YAML cassettes to cover various response scenarios, including streaming and standard responses.

JWinermaSplunk · 2026-02-10T19:14:10Z

This is a relatively large PR, is there any way it could be split up?

vasantteja · 2026-02-11T12:59:53Z

@JWinermaSplunk Sure I removed the async stuff to make it small. I will remove stream and retrieve to make it more small. FYI I am rewriting this with TelemetryHandler so that we can start using shared utils. I am assuming that might make the pr little bulky so can I remove the examples?

…nai-responses

…ry support - Added `opentelemetry-util-genai` as a dependency for improved telemetry handling. - Refactored response handling in `patch_responses.py` to utilize `TelemetryHandler` for tracing. - Updated `responses_create` and `responses_retrieve` methods to integrate new telemetry features. - Simplified imports and removed unused code in `__init__.py`. - Added extensive test cases and YAML cassettes for various response scenarios, including streaming and standard responses. - Adjusted requirements files to include the new utility package for testing.

…try features - Updated `responses_create` and `responses_retrieve` methods to streamline content capture logic. - Introduced helper functions for extracting input and output messages, and system instructions. - Enhanced telemetry support by integrating content capture based on experimental mode settings. - Added new test cases to validate content capture functionality in various scenarios, including streaming and standard responses. - Created YAML cassettes for testing response handling and content capture behavior. - Simplified imports and removed unused code in `__init__.py` and `patch_responses.py`.

…se handling.

…prove optional feature handling.

lmolkova · 2026-02-20T03:02:24Z

+        invocation,
+        result,
+        "service_tier",
+        GenAIAttributes.GEN_AI_OPENAI_RESPONSE_SERVICE_TIER,


this is a deprecated attribute, use OpenAIattributes.OPENAI_RESPONSE_SERVICE_TIER

https://github.com/open-telemetry/opentelemetry-python/blob/f32b684a1ccd6d5eb658ec0b45bf18928ea430e4/opentelemetry-semantic-conventions/src/opentelemetry/semconv/_incubating/attributes/openai_attributes.py#L23C1-L23C29

lmolkova · 2026-02-20T03:09:21Z

@vasantteja it seems we have some intersection with this PR - #3715 (switching to new semconv for chat completion), would you mind reviewing it?

We'll probably have some merge conflicts, but not terrible ones

MikeGoldsmith

Generally looks pretty good - left some suggestions and would like to reduce size of this PR if possible.

This is another 3000+ lines change PR across many files - PRs of this size are hard to review with confidence. I know this is experimental and wanting to move fast, but huge PRs makes it challenging.

In this case, it's mostly VCR cassettes but I think we could break up into the following parts:

util-genai only — eg should_capture_content() plus tests
Responses API core (non-streaming)
Responses API streaming

As pointed out by @lmolkova - this PR is touching a lot of files #3715 does too. eg both modify utils.py, __init__.py, and patch.py.

Maybe we should we focus on one first, then rebase the other one to minimise conflicts?

Other things I noticed:

pyproject.toml has openai >= 1.26.0 but Responses API requires openai >= 1.66.0
can we separate some of the changes into other files to avoid having to disable pylint's too many lines? eg something like patch_chat.py

…e response handling in ResponseStreamWrapper with safe instrumentation methods for error logging.

…e request model extraction.

MikeGoldsmith · 2026-03-06T15:16:58Z

@vasantteja looks like you may have some lint errors. You can run the precommit to clean them up.

MikeGoldsmith · 2026-03-18T13:50:49Z

@vasantteja please can you merge main into this branch and resolve conflicts, then run precommit as there are lint errors.

vasantteja requested a review from a team as a code owner February 5, 2026 05:49

github-actions Bot assigned codefromthecrypt, karthikscale3, lmolkova, lzchen and nirga Feb 5, 2026

vasantteja added 4 commits February 5, 2026 00:57

Add support for OpenAI Responses API instrumentation in CHANGELOG.md

2e568bb

xrmx mentioned this pull request Feb 5, 2026

add OpenAI responses support #3901

Closed

vasantteja added 7 commits February 5, 2026 22:59

Merge branch 'main' into feat/instrument-openai-responses

e4378e8

vasantteja added 3 commits February 11, 2026 22:38

Merge remote-tracking branch 'upstream/main' into feat/instrument-ope…

5e26cdd

…nai-responses

wrisa reviewed Feb 12, 2026

View reviewed changes

vasantteja added 2 commits February 13, 2026 21:46

Remove example files for OpenAI Responses API instrumentation.

e2604b6

Refactor OpenAI instrumentation to improve content capture and respon…

8f4679e

…se handling.

github-actions Bot assigned DylanRussell Feb 15, 2026

vasantteja added 2 commits February 15, 2026 15:26

regerating cassettes and fixing failing tests.

11d1b3c

Update OpenAI instrumentation to enhance dependency management and im…

bc1a604

…prove optional feature handling.

nagkumar91 reviewed Feb 19, 2026

View reviewed changes

Comment thread ...y-instrumentation-openai-v2/src/opentelemetry/instrumentation/openai_v2/response_wrappers.py

lmolkova reviewed Feb 20, 2026

View reviewed changes

MikeGoldsmith reviewed Feb 20, 2026

View reviewed changes

Merge branch 'main' into feat/instrument-openai-responses

9ac718a

tammy-baylis-swi added this to Python PR digest Feb 26, 2026

tammy-baylis-swi moved this to Reviewed PRs that need fixes in Python PR digest Feb 26, 2026

vasantteja added 3 commits February 26, 2026 22:46

Merge branch 'main' into feat/instrument-openai-responses

6cbdadd

Update opentelemetry-util-genai version to >=0.3b0, <0.4b0 and enhanc…

3b38509

…e response handling in ResponseStreamWrapper with safe instrumentation methods for error logging.

Refactor OpenAI response handling to use OpenAI attributes and improv…

fc065f8

…e request model extraction.

gyliu513 mentioned this pull request Feb 27, 2026

Align llama-stack client strategy with OpenAI-compatible clients for OTel coverage ogx-ai/ogx#5015

Closed

vasantteja mentioned this pull request Mar 3, 2026

Add response wrappers for OpenAI Responses API streams. #4280

Merged

11 tasks

nagkumar91 mentioned this pull request Mar 3, 2026

REQUEST: New membership for nagkumar91 open-telemetry/community#3304

Closed

6 tasks

Merge branch 'main' into feat/instrument-openai-responses

b9c0c41

github-actions Bot assigned keith-decker Mar 6, 2026

vasantteja closed this Mar 6, 2026

github-project-automation Bot moved this from Reviewed PRs that need fixes to Done in Python PR digest Mar 6, 2026

vasantteja reopened this Mar 6, 2026

tammy-baylis-swi moved this from Done to Approved PRs that need fixes in Python PR digest Mar 6, 2026

vasantteja closed this Mar 16, 2026

github-project-automation Bot moved this from Approved PRs that need fixes to Done in Python PR digest Mar 16, 2026

vasantteja reopened this Mar 16, 2026

vasantteja closed this Mar 16, 2026

vasantteja reopened this Mar 16, 2026

vasantteja marked this pull request as draft March 16, 2026 21:08

gyliu513 mentioned this pull request Mar 18, 2026

OpenAI Responses API Observability ogx-ai/ogx#5192

Open

iamemilio mentioned this pull request Mar 18, 2026

openai-v2: ChoiceBuffer crashes on streaming tool-call deltas with arguments=None #4344

Closed

vasantteja closed this Apr 7, 2026

Conversation

vasantteja commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Does This PR Require a Core Repo Change?

Checklist:

Uh oh!

JWinermaSplunk commented Feb 10, 2026

Uh oh!

vasantteja commented Feb 11, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lmolkova Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lmolkova commented Feb 20, 2026

Uh oh!

MikeGoldsmith left a comment

Choose a reason for hiding this comment

Uh oh!

MikeGoldsmith commented Mar 6, 2026

Uh oh!

MikeGoldsmith commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

vasantteja commented Feb 5, 2026 •

edited

Loading