Skip to content

Commit b33a64f

Browse files
bexxxmarkcowl
authored andcommitted
Add v3.2 for Speech to text (3.2p2 as base) (#28664)
* Add v3.2 for Speech to text (3.2p2 as base) * Update swagger with new changes and updated version number * Update readme.md with new version * Added evaluation token errors * Revert ttl changes for transcriptions * Finalized api changes for GA * Try fix avocado error
1 parent 26820f9 commit b33a64f

93 files changed

Lines changed: 10989 additions & 8 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

specification/cognitiveservices/data-plane/Speech/SpeechToText/readme.md

Lines changed: 31 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,17 +4,17 @@
44
55
Configuration for generating SpeechToText SDK.
66

7-
The current release for the SpeechToText is `release_3_1`.
7+
The current release for the SpeechToText is `release_3_2`.
88

99
``` yaml
10-
tag: release_3_1
10+
tag: release_3_2
1111
add-credentials: true
1212
openapi-type: data-plane
1313
```
1414
1515
# Releases
1616
17-
## SpeechToText 2.0
17+
## Tag: release_2_0
1818
These settings apply only when `--tag=release_2_0` is specified on the command line.
1919

2020
``` yaml $(tag) == 'release_2_0'
@@ -34,7 +34,7 @@ directive:
3434

3535
---
3636

37-
## SpeechToText 3.0
37+
## Tag: release_3_0
3838
These settings apply only when `--tag=release_3_0` is specified on the command line.
3939

4040
``` yaml $(tag) == 'release_3_0'
@@ -52,7 +52,7 @@ directive:
5252

5353
---
5454

55-
## SpeechToText 3.1-preview.1
55+
## Tag: release_3_1_preview_1
5656
These settings apply only when `--tag=release_3_1_preview_1` is specified on the command line.
5757

5858
``` yaml $(tag) == 'release_3_1_preview_1'
@@ -70,7 +70,7 @@ directive:
7070

7171
---
7272

73-
## SpeechToText 3.1
73+
## Tag: release_3_1
7474

7575
These settings apply only when `--tag=release_3_1` is specified on the command line.
7676

@@ -95,7 +95,7 @@ directive:
9595

9696
---
9797

98-
## SpeechToText 3.2-preview.1
98+
## Tag: release_3_2_preview_1
9999

100100
These settings apply only when `--tag=release_3_2_preview_1` is specified on the command line.
101101

@@ -118,7 +118,7 @@ directive:
118118
reason: Existing API, change would potentially be breaking.
119119
```
120120

121-
## SpeechToText 3.2-preview.2
121+
## Tag: release_3_2_preview_2
122122

123123
These settings apply only when `--tag=release_3_2_preview_2` is specified on the command line.
124124

@@ -129,6 +129,29 @@ input-file:
129129

130130
AutoRest-Linter Suppressions
131131

132+
``` yaml
133+
# Ignore autorest-linter issues that cannot be resolve without updates to the API implementation
134+
directive:
135+
- suppress: LongRunningOperationsWithLongRunningExtension
136+
reason: Does not apply in those two places. The method is a DELETE which lazily deletes blobs, so it's Accepted, not NoContent.
137+
- suppress: OperationIdNounVerb
138+
where: $..paths[($..operationId["Models_*"])]
139+
reason: There is a sub-route called /models/base/ that refers to the base models. Therefore, the correct operation ID seems to be "Models_GetBaseModel", for example.
140+
- suppress: HostParametersValidation
141+
reason: Existing API, change would potentially be breaking.
142+
```
143+
144+
## Tag: release_3_2
145+
146+
These settings apply only when `--tag=release_3_2` is specified on the command line.
147+
148+
```yaml $(tag) == 'release_3_2'
149+
input-file:
150+
- stable/v3.2/speechtotext.json
151+
```
152+
153+
AutoRest-Linter Suppressions
154+
132155
``` yaml
133156
# Ignore autorest-linter issues that cannot be resolve without updates to the API implementation
134157
directive:
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"id": "9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1",
6+
"blockList": [
7+
{
8+
"kind": "Uncommitted",
9+
"id": "AAA="
10+
},
11+
{
12+
"kind": "Uncommitted",
13+
"id": "AAE="
14+
},
15+
{
16+
"kind": "Uncommitted",
17+
"id": "AAI="
18+
}
19+
]
20+
},
21+
"responses": {
22+
"200": {
23+
"headers": {}
24+
}
25+
}
26+
}
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"id": "827712a5-f942-4997-91c3-7c6cde35600b",
6+
"modelCopyAuthorization": {
7+
"targetResourceRegion": "westus2",
8+
"targetResourceId": "/subscriptions/targetSubscriptionId/resourceGroups/targetResourceGroupName/providers/Microsoft.CognitiveServices/accounts/targetSpeechResourceName",
9+
"targetResourceEndpoint": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/models",
10+
"sourceResourceId": "/subscriptions/sourceSubscriptionId/resourceGroups/sourceResourceGroupName/providers/Microsoft.CognitiveServices/accounts/sourceSpeechResourceName",
11+
"expirationDateTime": "2019-01-07T11:34:12Z",
12+
"id": "d61573c6-788b-4eff-b3f5-38a1c7a9585b"
13+
},
14+
"Content-Type": "application/json"
15+
},
16+
"responses": {
17+
"202": {
18+
"headers": {
19+
"Operation-Location": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/operations/models/copy/E30F6A27-82BE-4CCA-9258-0399C70489FF"
20+
},
21+
"body": {
22+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/operations/models/copy/E30F6A27-82BE-4CCA-9258-0399C70489FF",
23+
"createdDateTime": "2019-01-07T11:34:12Z",
24+
"lastActionDateTime": "2019-01-07T11:34:12Z",
25+
"status": "NotStarted",
26+
"id": "e30f6a27-82be-4cca-9258-0399c70489ff"
27+
}
28+
}
29+
}
30+
}
Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,55 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"transcription": {
6+
"contentContainerUrl": "https://customspeech-usw.blob.core.windows.net/artifacts/audiofiles/",
7+
"properties": {
8+
"diarizationEnabled": false,
9+
"wordLevelTimestampsEnabled": false,
10+
"displayFormWordLevelTimestampsEnabled": false,
11+
"punctuationMode": "DictatedAndAutomatic",
12+
"profanityFilterMode": "Masked"
13+
},
14+
"locale": "en-US",
15+
"displayName": "Transcription of storage container using default model for en-US"
16+
},
17+
"Content-Type": "application/json"
18+
},
19+
"responses": {
20+
"201": {
21+
"headers": {
22+
"Location": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/transcriptions/ba7ea6f5-3065-40b7-b49a-a90f48584683"
23+
},
24+
"body": {
25+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/transcriptions/ba7ea6f5-3065-40b7-b49a-a90f48584683",
26+
"model": {
27+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/models/827712a5-f942-4997-91c3-7c6cde35600b"
28+
},
29+
"links": {
30+
"files": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/transcriptions/ba7ea6f5-3065-40b7-b49a-a90f48584683/files"
31+
},
32+
"properties": {
33+
"diarizationEnabled": false,
34+
"wordLevelTimestampsEnabled": false,
35+
"displayFormWordLevelTimestampsEnabled": false,
36+
"channels": [
37+
0,
38+
1
39+
],
40+
"punctuationMode": "DictatedAndAutomatic",
41+
"profanityFilterMode": "Masked",
42+
"duration": "PT42S"
43+
},
44+
"lastActionDateTime": "2019-01-07T11:36:07Z",
45+
"status": "Succeeded",
46+
"createdDateTime": "2019-01-07T11:34:12Z",
47+
"locale": "en-US",
48+
"displayName": "Transcription using adapted model en-US",
49+
"customProperties": {
50+
"key": "value"
51+
}
52+
}
53+
}
54+
}
55+
}
Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,23 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"modelCopyAuthorizationDefinition": {
6+
"sourceResourceId": "/subscriptions/sourceSubscriptionId/resourceGroups/sourceResourceGroupName/providers/Microsoft.CognitiveServices/accounts/sourceSpeechResourceName"
7+
},
8+
"Content-Type": "application/json"
9+
},
10+
"responses": {
11+
"200": {
12+
"headers": {},
13+
"body": {
14+
"targetResourceRegion": "westus2",
15+
"targetResourceId": "/subscriptions/targetSubscriptionId/resourceGroups/targetResourceGroupName/providers/Microsoft.CognitiveServices/accounts/targetSpeechResourceName",
16+
"targetResourceEndpoint": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/models",
17+
"sourceResourceId": "/subscriptions/sourceSubscriptionId/resourceGroups/sourceResourceGroupName/providers/Microsoft.CognitiveServices/accounts/sourceSpeechResourceName",
18+
"expirationDateTime": "2019-01-07T11:34:12Z",
19+
"id": "d61573c6-788b-4eff-b3f5-38a1c7a9585b"
20+
}
21+
}
22+
}
23+
}
Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"dataset": {
6+
"kind": "Acoustic",
7+
"locale": "en-US",
8+
"displayName": "My speech dataset name",
9+
"description": "My speech dataset description"
10+
},
11+
"Content-Type": "application/json"
12+
},
13+
"responses": {
14+
"201": {
15+
"headers": {},
16+
"body": {
17+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1",
18+
"kind": "Acoustic",
19+
"links": {
20+
"files": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/files",
21+
"commitBlocks": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/blocks:commit",
22+
"listBlocks": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/blocks",
23+
"uploadBlocks": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/blocks"
24+
},
25+
"lastActionDateTime": "2019-01-07T11:36:07Z",
26+
"status": "NotStarted",
27+
"createdDateTime": "2019-01-07T11:34:12Z",
28+
"locale": "en-US",
29+
"displayName": "Acoustic dataset"
30+
}
31+
}
32+
}
33+
}
Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"dataset": {
6+
"kind": "Acoustic",
7+
"contentUrl": "https://contoso.com/location",
8+
"locale": "en-US",
9+
"displayName": "My speech dataset name",
10+
"description": "My speech dataset description"
11+
},
12+
"Content-Type": "application/json"
13+
},
14+
"responses": {
15+
"201": {
16+
"headers": {
17+
"Location": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1"
18+
},
19+
"body": {
20+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1",
21+
"kind": "Acoustic",
22+
"contentUrl": "https://www.contoso.com/acousticdata/sourcelocation",
23+
"links": {
24+
"files": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/datasets/9d5f4100-5f8e-4dd6-bd83-9bbbf50d57f1/files"
25+
},
26+
"properties": {
27+
"textNormalizationKind": "Default",
28+
"acceptedLineCount": 11,
29+
"rejectedLineCount": 2,
30+
"duration": "PT4M12S"
31+
},
32+
"lastActionDateTime": "2019-01-07T11:36:07Z",
33+
"status": "Succeeded",
34+
"createdDateTime": "2019-01-07T11:34:12Z",
35+
"locale": "en-US",
36+
"displayName": "Acoustic dataset"
37+
}
38+
}
39+
}
40+
}
Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,50 @@
1+
{
2+
"parameters": {
3+
"Endpoint": "https://westus.api.cognitive.microsoft.com/",
4+
"Ocp-Apim-Subscription-Key": "{API Key}",
5+
"endpoint": {
6+
"model": {
7+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/models/827712a5-f942-4997-91c3-7c6cde35600b"
8+
},
9+
"properties": {
10+
"loggingEnabled": true
11+
},
12+
"locale": "en-US",
13+
"displayName": "Speech endpoint definition",
14+
"description": "This is a speech endpoint"
15+
},
16+
"Content-Type": "application/json"
17+
},
18+
"responses": {
19+
"201": {
20+
"headers": {
21+
"Location": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/afa0669c-a01e-4693-ae3a-93baf40f26d6"
22+
},
23+
"body": {
24+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/afa0669c-a01e-4693-ae3a-93baf40f26d6",
25+
"model": {
26+
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/models/827712a5-f942-4997-91c3-7c6cde35600b"
27+
},
28+
"links": {
29+
"logs": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/afa0669c-a01e-4693-ae3a-93baf40f26d6/files/logs",
30+
"restInteractive": "https://westus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6",
31+
"restConversation": "https://westus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6",
32+
"restDictation": "https://westus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6",
33+
"webSocketInteractive": "wss://westus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6",
34+
"webSocketConversation": "wss://westus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6",
35+
"webSocketDictation": "wss://westus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=afa0669c-a01e-4693-ae3a-93baf40f26d6"
36+
},
37+
"properties": {
38+
"timeToLive": "PT30M",
39+
"loggingEnabled": true
40+
},
41+
"lastActionDateTime": "2019-01-07T11:36:07Z",
42+
"status": "Succeeded",
43+
"createdDateTime": "2019-01-07T11:34:12Z",
44+
"locale": "en-US",
45+
"displayName": "Speech endpoint",
46+
"description": "Example for speech endpoint"
47+
}
48+
}
49+
}
50+
}

0 commit comments

Comments
 (0)