Kevin m kent/v2 domain intent taxonomy by kevin-m-kent · Pull Request #4071 · microsoft/vscode-copilot-chat

kevin-m-kent · 2026-02-27T18:33:51Z

PR that builds up domain and intent from prompt clustering and labeling in our 1p data. Also adds some prompting that improves the classifier's error rate (in particular swapping domain and intent). This approach also reduces the number of categories by about 50%.

FYI @digitarald , pending our discussion later.

Replace the original domain (20 categories) and intent (38 categories) definitions with v2 taxonomy derived from clustering analysis: - 16 domain categories (cicd_cloud_infra, cli_scripting, automated_testing, etc.) - 13 intent categories (explain, find_content, research, review, etc.) - Updated classification guidance with domain vs intent independence framing - Updated prompt generation to match new category format Scope and time estimate dimensions are unchanged. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Updates the prompt-categorization taxonomy and prompting used by the panel prompt classifier to a new “v2” domain/intent scheme derived from clustering, with additional guidance intended to reduce domain/intent swaps.

Changes:

Replaces the intent and domain category definitions with a reduced v2 taxonomy.
Updates taxonomy prompt formatting and adds explicit “domain vs intent” separation guidance (and changes ordering to domain-first).
Tweaks the system prompt wording for the categorization prompt.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/extension/prompts/node/panel/promptCategorization.tsx	Updates the system prompt text used to instruct the classifier and includes the taxonomy prompt.
src/extension/prompt/common/promptCategorizationTaxonomy.ts	Reworks domain/intent taxonomy definitions and regenerates the taxonomy prompt/guidance formatting.

Comments suppressed due to low confidence (1)

src/extension/prompt/common/promptCategorizationTaxonomy.ts:327

The guidance explicitly instructs using unknown for intent/domain, but it doesn't mention what to do when scope is unclear. Since the scope taxonomy uses unknown_scope (not unknown), the current guidance can increase invalid tool outputs (rejected by isValidScope). Consider adding an explicit rule like "If scope is unclear, use unknown_scope" to keep tool calls valid.

**Domain** is the technical subject area or problem space the user is operating in.
- It describes a system, architecture, technology area, or problem space — never an activity.
- Think of it as answering: "What area of technology is this about?"
- If the prompt does not clearly indicate a technical domain, use \`unknown\`.

**Intent** is the developer action or goal being performed within that domain.
- It describes what the user is trying to accomplish — the verb, not the noun.
- Think of it as answering: "What is the user trying to do?"
- If the prompt does not clearly indicate an intent, use \`unknown\`.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…cking Remove configuration files and documentation references from description and keywords. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…header Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

digitarald

Taxonomy changes look good — data-driven consolidation is a solid approach. Left a few non-blocking nits for follow-up (telemetry versioning, prompt boundary, dead code).

digitarald · 2026-03-02T18:05:36Z


 // ============================================================================
 // INTENTS - What action the user wants
 // ============================================================================


nit (non-blocking, follow-up): The promptCategorization telemetry event in promptCategorizer.ts emits intent and domain as raw strings. Since v2 keys are completely different from v1 (e.g., code_fixing → troubleshoot_debug, frontend → web_ui), queries on this data will get mixed v1/v2 values with no way to distinguish them once this ships.

Suggestion: Add a taxonomyVersion: 'v2' property to the telemetry event and its GDPR annotation so dashboards can filter correctly.

I like this idea. Let me update.

digitarald · 2026-03-02T18:05:37Z

+			'You are an expert classifier for AI coding assistant prompts. Classify developer requests in context of their workspace and active file across domain, intent, time estimate, and scope.',
 			'You MUST use the categorize_prompt tool to provide your classification.',
 			generateTaxonomyPrompt(),
 		].join('\n\n');


nit (non-blocking): systemPrompt is joined with \n\n but then rendered immediately before <SafetyRules /> with no separator, so the last line of the taxonomy can run together with the safety rules text.

Consider adding a trailing \n\n:

].join('\n\n') + '\n\n';

Or a <br /> before <SafetyRules /> in the JSX.

digitarald · 2026-03-02T18:05:37Z

+		parts.push(`- Keywords: ${def.keywords.join(', ')}`);
 	}
 	if (def.signals?.length) {
 		parts.push(`Signals: ${def.signals.join(', ')}`);


nit (non-blocking): def.signals rendering is now dead code — no v2 definitions use signals anymore (all switched to keywords). Could be cleaned up in a follow-up.

- Add taxonomyVersion field to telemetry events to distinguish v1/v2 data - Fix prompt/SafetyRules boundary by adding trailing newlines - Consistent bullet formatting for signals in scope definitions

- Add taxonomyVersion field to telemetry events to distinguish v1/v2 data - Fix prompt/SafetyRules boundary by adding trailing newlines - Consistent bullet formatting for signals in scope definitions Co-authored-by: Harald Kirschner <digitarald@gmail.com>

Kevin Kent and others added 3 commits February 27, 2026 11:34

Add new_feature and refactor intents, remove design_review

8309164

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Clean up section header comments

3c5adf2

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings February 27, 2026 18:33

Copilot started reviewing on behalf of kevin-m-kent February 27, 2026 18:34 View session

vs-code-engineering Bot assigned dileepyavan Feb 27, 2026

vs-code-engineering Bot added the triage-needed label Feb 27, 2026

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Comment thread src/extension/prompts/node/panel/promptCategorization.tsx

Comment thread src/extension/prompt/common/promptCategorizationTaxonomy.ts Outdated

Kevin Kent and others added 7 commits February 27, 2026 14:32

Add backend_dev domain category

fcec146

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Update src/extension/prompt/common/promptCategorizationTaxonomy.ts

abf1d2a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Rename unknown to need_info for domain and intent categories

41c0a8e

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Narrow project_config_docs domain to project management and issue tra…

04f6671

…cking Remove configuration files and documentation references from description and keywords. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Rename project_config_docs to project_management

b205b2b

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Rename project_management to project_mgmt, remove (v2) from taxonomy …

00031c0

…header Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Add ml_statistics domain category

5d6de61

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

digitarald removed the triage-needed label Mar 2, 2026

digitarald assigned digitarald and unassigned dileepyavan Mar 2, 2026

Merge branch 'main' into kevin-m-kent/v2-domain-intent-taxonomy

84b3ae7

digitarald self-requested a review March 2, 2026 17:59

digitarald enabled auto-merge March 2, 2026 18:04

digitarald reviewed Mar 2, 2026

View reviewed changes

digitarald approved these changes Mar 2, 2026

View reviewed changes

vs-code-engineering Bot added this to the March 2026 milestone Mar 2, 2026

Yoyokrazy approved these changes Mar 2, 2026

View reviewed changes

digitarald added this pull request to the merge queue Mar 2, 2026

Merged via the queue into microsoft:main with commit 177a295 Mar 2, 2026
9 checks passed

digitarald mentioned this pull request Mar 2, 2026

Follow-up fixes for v2 taxonomy (#4071) #4120

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kevin m kent/v2 domain intent taxonomy#4071

Kevin m kent/v2 domain intent taxonomy#4071
digitarald merged 11 commits into
microsoft:mainfrom
kevin-m-kent:kevin-m-kent/v2-domain-intent-taxonomy

kevin-m-kent commented Feb 27, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

digitarald left a comment

Uh oh!

digitarald Mar 2, 2026

Uh oh!

kevin-m-kent Mar 2, 2026

Uh oh!

digitarald Mar 2, 2026

Uh oh!

digitarald Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

kevin-m-kent commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

digitarald left a comment

Choose a reason for hiding this comment

Uh oh!

digitarald Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

kevin-m-kent Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

digitarald Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

digitarald Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kevin-m-kent commented Feb 27, 2026 •

edited

Loading