deepgram
diff --git a/‎.agents/skills/deepgram-java-audio-intelligence/SKILL.md‎
Lines changed: 135 additions & 0 deletions b/‎.agents/skills/deepgram-java-audio-intelligence/SKILL.md‎
Lines changed: 135 additions & 0 deletions
diff --git a/‎.agents/skills/deepgram-java-conversational-stt/SKILL.md‎
Lines changed: 102 additions & 0 deletions b/‎.agents/skills/deepgram-java-conversational-stt/SKILL.md‎
Lines changed: 102 additions & 0 deletions
diff --git a/‎.agents/skills/deepgram-java-maintaining-sdk/SKILL.md‎
Lines changed: 113 additions & 0 deletions b/‎.agents/skills/deepgram-java-maintaining-sdk/SKILL.md‎
Lines changed: 113 additions & 0 deletions
@@ -0,0 +1,135 @@
+---
+name: deepgram-java-audio-intelligence
+description: Use when writing or reviewing Java code in this repo that enables Deepgram intelligence overlays on `/v1/listen` audio transcription - diarization, entity detection, sentiment, summarize, topics, intents, language detection, and redaction. Same endpoint as plain STT, but with extra request fields on `ListenV1RequestUrl` or `MediaTranscribeRequestOctetStream`. Use `deepgram-java-speech-to-text` for plain transcripts and `deepgram-java-text-intelligence` for analysis on existing text. Triggers include "audio intelligence", "diarize", "summarize audio", "sentiment from audio", "topic detection", and "redact".
+---
+
+# Using Deepgram Audio Intelligence (Java SDK)
+
+Audio intelligence is not a separate client in this SDK. It is the **Listen V1 REST request surface** with additional analysis fields enabled.
+
+## When to use this product
+
+- You have **audio** and want transcript + analysis together.
+- REST is the main path; the Java WebSocket client only exposes the real-time subset.
+
+**Use a different skill when:**
+- You want plain transcription only → `deepgram-java-speech-to-text`.
+- You already have text and only need text analysis → `deepgram-java-text-intelligence`.
+- You need turn-aware conversational streaming → `deepgram-java-conversational-stt`.
+
+## Authentication
+
+```java
+import com.deepgram.DeepgramClient;
+
+DeepgramClient client = DeepgramClient.builder()
+        .apiKey(System.getenv("DEEPGRAM_API_KEY"))
+        .build();
+```
+
+## Quick start — REST with repo-backed example pattern
+
+```java
+import com.deepgram.resources.listen.v1.media.requests.ListenV1RequestUrl;
+import com.deepgram.resources.listen.v1.media.types.MediaTranscribeRequestModel;
+import com.deepgram.resources.listen.v1.media.types.MediaTranscribeResponse;
+
+ListenV1RequestUrl request = ListenV1RequestUrl.builder()
+        .url("https://dpgr.am/spacewalk.wav")
+        .model(MediaTranscribeRequestModel.NOVA3)
+        .smartFormat(true)
+        .punctuate(true)
+        .diarize(true)
+        .language("en-US")
+        .build();
+
+MediaTranscribeResponse result = client.listen().v1().media().transcribeUrl(request);
+```
+
+The concrete repo example (`examples/listen/AdvancedOptions.java`) demonstrates the same pattern for enabling higher-value Listen options via the builder.
+
+## What else the REST request surface supports
+
+The generated `ListenV1RequestUrl` and `MediaTranscribeRequestOctetStream` classes also expose these verified analysis fields in this checkout:
+
+- `sentiment`
+- `summarize`
+- `topics`
+- `customTopic`
+- `customTopicMode`
+- `intents`
+- `customIntent`
+- `customIntentMode`
+- `detectEntities`
+- `detectLanguage`
+- `diarize`
+- `redact`
+
+## Quick start — WebSocket subset
+
+```java
+import com.deepgram.resources.listen.v1.websocket.V1ConnectOptions;
+import com.deepgram.resources.listen.v1.websocket.V1WebSocketClient;
+import com.deepgram.types.ListenV1Model;
+import java.util.concurrent.TimeUnit;
+
+V1WebSocketClient wsClient = client.listen().v1().v1WebSocket();
+wsClient.onResults(result -> System.out.println(result));
+
+wsClient.connect(V1ConnectOptions.builder()
+        .model(ListenV1Model.NOVA3)
+        .diarize(true)
+        .build())
+        .get(10, TimeUnit.SECONDS);
+```
+
+In this Java checkout, the WebSocket connect options include `diarize`, `detectEntities`, `redact`, and the normal streaming transcription controls, but **not** `summarize`, `topics`, `intents`, or `detectLanguage`.
+
+## Key parameters / API surface
+
+- REST builders: `ListenV1RequestUrl` and `MediaTranscribeRequestOctetStream`
+- REST analysis fields verified in source: `sentiment`, `summarize`, `topics`, `customTopic`, `customTopicMode`, `intents`, `customIntent`, `customIntentMode`, `detectEntities`, `detectLanguage`, `diarize`, `redact`
+- Helpful transcription companions: `smartFormat`, `punctuate`, `paragraphs`, `utterances`, `numerals`, `keywords`, `keyterm`, `replace`, `search`
+- WebSocket subset: `diarize`, `detectEntities`, `redact`, plus standard live transcription options
+
+## API reference (layered)
+
+1. **In-repo source of truth**: `src/main/java/com/deepgram/resources/listen/v1/media/requests/` and `src/main/java/com/deepgram/resources/listen/v1/websocket/` plus `examples/listen/AdvancedOptions.java`. `reference.md` is absent here.
+2. **Canonical OpenAPI (REST)**: https://developers.deepgram.com/openapi.yaml
+3. **Canonical AsyncAPI (WSS subset)**: https://developers.deepgram.com/asyncapi.yaml
+4. **Context7**: `/llmstxt/developers_deepgram_llms_txt`
+5. **Product docs**:
+   - https://developers.deepgram.com/docs/stt-intelligence-feature-overview
+   - https://developers.deepgram.com/docs/summarization
+   - https://developers.deepgram.com/docs/topic-detection
+   - https://developers.deepgram.com/docs/intent-recognition
+   - https://developers.deepgram.com/docs/sentiment-analysis
+   - https://developers.deepgram.com/docs/language-detection
+   - https://developers.deepgram.com/docs/redaction
+   - https://developers.deepgram.com/docs/diarization
+
+## Gotchas
+
+1. **There is no separate “audio intelligence client”.** Everything hangs off Listen V1.
+2. **Most intelligence fields are REST-only in this SDK surface.** The WebSocket connect options do not expose `summarize`, `topics`, `intents`, or `detectLanguage`.
+3. **`summarize` on Listen V1 is its own generated type.** Do not assume the Read API shape is identical.
+4. **The repo example only demonstrates diarization-level options.** There is no dedicated example file for sentiment/topics/intents in this checkout.
+5. **`redact` is currently a single `String` field on the REST builders.** Do not assume Python-style string-or-list support here.
+6. **Model support matters.** The examples consistently use `NOVA3`; follow that unless you have verified another model supports the overlays you need.
+7. **These fields live on both URL and byte-upload request builders.** Pick the builder that matches your input source.
+
+## Example files in this repo
+
+- `examples/listen/AdvancedOptions.java`
+- `examples/listen/TranscribeUrl.java`
+- `examples/listen/FileUploadTypes.java`
+
+## Central product skills
+
+For cross-language Deepgram product knowledge — the consolidated API reference, documentation finder, focused runnable recipes, third-party integration examples, and MCP setup — install the central skills:
+
+```bash
+npx skills add deepgram/skills
+```
+
+This SDK ships language-idiomatic code skills; `deepgram/skills` ships cross-language product knowledge (see `api`, `docs`, `recipes`, `examples`, `starters`, `setup-mcp`).
@@ -0,0 +1,102 @@
+---
+name: deepgram-java-conversational-stt
+description: Use when writing or reviewing Java code in this repo that calls Deepgram Conversational STT v2 / Flux over `/v2/listen`. Covers `client.listen().v2().v2WebSocket()`, `V2ConnectOptions`, `onTurnInfo`, and turn-aware close handling. Use `deepgram-java-speech-to-text` for standard v1 transcription and `deepgram-java-voice-agent` for fully interactive assistants. Triggers include "flux", "conversational stt", "listen v2", "turn detection", "end of turn", and "eot".
+---
+
+# Using Deepgram Conversational STT / Flux (Java SDK)
+
+Turn-aware streaming transcription over `/v2/listen` for conversational audio.
+
+## When to use this product
+
+- You want explicit turn events, not just regular interim/final transcript chunks.
+- You are building conversational UX where end-of-turn timing matters.
+
+**Use a different skill when:**
+- You need general-purpose STT over REST or classic streaming → `deepgram-java-speech-to-text`.
+- You need a hosted interactive assistant → `deepgram-java-voice-agent`.
+
+## Authentication
+
+```java
+import com.deepgram.DeepgramClient;
+
+DeepgramClient client = DeepgramClient.builder()
+        .apiKey(System.getenv("DEEPGRAM_API_KEY"))
+        .build();
+```
+
+## Quick start
+
+```java
+import com.deepgram.resources.listen.v2.types.ListenV2CloseStream;
+import com.deepgram.resources.listen.v2.types.ListenV2CloseStreamType;
+import com.deepgram.resources.listen.v2.websocket.V2ConnectOptions;
+import com.deepgram.resources.listen.v2.websocket.V2WebSocketClient;
+import java.util.concurrent.TimeUnit;
+
+V2WebSocketClient wsClient = client.listen().v2().v2WebSocket();
+
+wsClient.onConnected(connected ->
+        System.out.println("request_id=" + connected.getRequestId()));
+
+wsClient.onTurnInfo(turnInfo -> {
+    System.out.printf("[%s] turn=%.0f transcript=\"%s\"%n",
+            turnInfo.getEvent(),
+            turnInfo.getTurnIndex(),
+            turnInfo.getTranscript());
+});
+
+wsClient.connect(V2ConnectOptions.builder()
+        .model("flux-general-en")
+        .build())
+        .get(10, TimeUnit.SECONDS);
+
+// wsClient.sendMedia(okio.ByteString.of(audioChunk));
+
+wsClient.sendCloseStream(ListenV2CloseStream.builder()
+        .type(ListenV2CloseStreamType.CLOSE_STREAM)
+        .build());
+```
+
+## Key parameters / API surface
+
+- Entry point: `client.listen().v2().v2WebSocket()`
+- Required connect field: `model(String)`
+- Verified connect options in source: `encoding`, `sampleRate`, `eagerEotThreshold`, `eotThreshold`, `eotTimeoutMs`, `keyterm`, `mipOptOut`, `tag`
+- Send methods: `sendMedia(...)`, `sendCloseStream(...)`
+- Event handlers: `onConnected(Consumer<ListenV2Connected>)`, `onTurnInfo(...)`, `onErrorMessage(...)`, plus generic connection/error hooks
+
+## API reference (layered)
+
+1. **In-repo source of truth**: `src/main/java/com/deepgram/resources/listen/v2/` and `examples/listen/LiveStreamingV2.java`. No `reference.md` exists in this checkout.
+2. **Canonical AsyncAPI**: https://developers.deepgram.com/asyncapi.yaml
+3. **Context7**: `/llmstxt/developers_deepgram_llms_txt`
+4. **Product docs**:
+   - https://developers.deepgram.com/reference/speech-to-text/listen-flux
+   - https://developers.deepgram.com/docs/flux/quickstart
+   - https://developers.deepgram.com/docs/flux/language-prompting
+
+## Gotchas
+
+1. **This is WebSocket-only in the Java SDK.** There is no REST helper for `/v2/listen` here.
+2. **`model` is a plain `String`, not an enum.** Use Flux model IDs such as `flux-general-en` exactly.
+3. **Close with `sendCloseStream(...)`, not Listen V1 finalize.** The message type is different from v1.
+4. **The current Java connect options do not expose `language_hint`.** Do not assume the Python surface exists here.
+5. **Turn events are the main payload.** Handle `onTurnInfo(...)`, not Listen V1 `onResults(...)`.
+6. **You still need to stream binary audio manually.** The example only wires handlers and close flow.
+7. **Wait for `connect(...).get(...)` before sending media.** The client is async but not fire-and-forget.
+
+## Example files in this repo
+
+- `examples/listen/LiveStreamingV2.java`
+
+## Central product skills
+
+For cross-language Deepgram product knowledge — the consolidated API reference, documentation finder, focused runnable recipes, third-party integration examples, and MCP setup — install the central skills:
+
+```bash
+npx skills add deepgram/skills
+```
+
+This SDK ships language-idiomatic code skills; `deepgram/skills` ships cross-language product knowledge (see `api`, `docs`, `recipes`, `examples`, `starters`, `setup-mcp`).
@@ -0,0 +1,113 @@
+---
+name: deepgram-java-maintaining-sdk
+description: Use when regenerating this Java SDK with Fern, editing `.fernignore`, preparing the repo for a generator release, reconciling manual patches after regen, or deciding whether a file is permanently frozen vs temporarily frozen. This SDK is Fern-generated - most files under `src/main/java/com/deepgram/` should not be edited directly. Triggers include "fern regen", "regenerate sdk", ".fernignore", "unfreeze", "re-apply patches", and "sdk regeneration".
+---
+
+# Maintaining the Deepgram Java SDK
+
+This SDK is generated by [Fern](https://buildwithfern.com/). Most files under `src/main/java/com/deepgram/` are auto-generated and should not be edited directly. Some files are hand-written or manually patched and are listed in `.fernignore` so Fern does not overwrite them.
+
+## Freeze classification rules
+
+Every entry in `.fernignore` falls into one of two categories. The comment above each entry is authoritative, but when in doubt use these rules.
+
+### Never unfreeze (permanently frozen)
+
+These files are hand-written or maintained independently from Fern. They must stay in `.fernignore`.
+
+How to identify:
+
+- Custom wrapper/client code written by the repo maintainers
+- Transport abstractions or other hand-built infrastructure
+- Build files, docs, tests, examples, CI/config artifacts
+- Anything outside the generated Java package tree that Fern should not own
+
+Current permanently frozen files and directories:
+
+- `src/main/java/com/deepgram/DeepgramClient.java`
+- `src/main/java/com/deepgram/AsyncDeepgramClient.java`
+- `src/main/java/com/deepgram/DeepgramClientBuilder.java`
+- `src/main/java/com/deepgram/AsyncDeepgramClientBuilder.java`
+- `src/main/java/com/deepgram/core/transport/`
+- `build.gradle`, `settings.gradle`, `gradle/`, `gradlew`, `gradlew.bat`, `pom.xml`, `Makefile`
+- `README.md`, `CHANGELOG.md`, `CONTRIBUTING.md`, `LICENSE`
+- `src/test/`
+- `examples/`
+- `.editorconfig`, `.githooks/`, `.github/`, `.gitignore`
+- `target/`
+- `CLAUDE.md`, `AGENTS.md`, `.claude/`, `.agents/`
+
+Also note the defensive flat-path `.fernignore` entries:
+
+- `src/main/java/DeepgramClient.java`
+- `src/main/java/AsyncDeepgramClient.java`
+- `src/main/java/DeepgramClientBuilder.java`
+- `src/main/java/AsyncDeepgramClientBuilder.java`
+
+Those flat-path files do **not** exist in this checkout. They are layout guards for alternate local generation layouts, not active source files.
+
+### Unfreeze for regen (temporarily frozen)
+
+These files are Fern-generated but still carry local fixes. Unfreeze them before a regen so the generator can rewrite the original path and you can diff the new output against your patched copy.
+
+How to identify:
+
+- Fern would regenerate the file if it were removed from `.fernignore`
+- The checked-in version is a patched copy of generator output
+
+Current temporarily frozen files:
+
+- `src/main/java/com/deepgram/core/ClientOptions.java` — preserves release-please version markers plus correct `User-Agent`, `X-Fern-SDK-Name`, and `X-Fern-SDK-Version` constants that Fern currently overwrites
+
+## Prepare repo for regeneration
+
+1. Create a branch from `main` named `lo/sdk-gen-<YYYY-MM-DD>`.
+2. Push it and open a PR titled `chore: SDK regeneration <YYYY-MM-DD>`.
+3. Read `.fernignore` and classify every entry.
+4. For each **temporarily frozen** file only:
+   - Copy it to `<filename>.bak` beside the original.
+   - In `.fernignore`, replace the original path with the `.bak` path.
+5. Leave **permanently frozen** entries untouched.
+6. Commit as `chore: unfreeze files pending regen` and push.
+7. Fern can now regenerate the original paths.
+
+## After regeneration
+
+The `.bak` files preserve the old patched versions. The original paths now contain fresh generator output.
+
+1. Diff each `.bak` file against the regenerated original.
+2. Re-apply only the patches that are still needed.
+3. In `.fernignore`, replace each `.bak` path back to the original path for files that still need local patches.
+4. Remove `.fernignore` entries entirely for files where Fern now generates the correct output.
+5. Delete all `.bak` files.
+6. Run verification:
+   ```bash
+   ./gradlew test
+   ./gradlew compileExamples
+   mvn test
+   ```
+   Use `mvn verify` only when you also want the Maven Failsafe integration-test phase (`**/IntegrationTest*`) to run.
+7. Commit as `chore: re-apply manual patches after regen` and push.
+
+## Java-specific notes
+
+- The custom builders add **Bearer-token support**, **auto session ID headers**, and **transportFactory(...)** hooks on top of Fern's generated API client.
+- `ClientOptions.java` is the only currently documented temporary patch point.
+- The transport abstraction under `src/main/java/com/deepgram/core/transport/` is permanently hand-maintained.
+- `examples/` is permanently frozen and is also used as the main source of truth for skill authoring because this checkout does not include `reference.md`.
+- `sample-app/` is **not** listed in `.fernignore`, so it does not currently appear frozen.
+- `build.gradle` intentionally excludes three manage examples from `compileExamples`: `manage/ListModels.java`, `manage/MemberPermissions.java`, and `manage/UsageBreakdown.java`.
+
+## Source-of-truth note
+
+`AGENTS.md` in the repo root and this skill should stay synchronized. If the regeneration workflow changes, update both.
+
+## Example files in this repo
+
+- `AGENTS.md`
+- `.fernignore`
+- `build.gradle`
+- `pom.xml`
+- `src/main/java/com/deepgram/DeepgramClientBuilder.java`
+- `src/main/java/com/deepgram/core/ClientOptions.java`
+- `src/main/java/com/deepgram/core/transport/`