Commit 00e5549
Use tiktoken (openai#1044)
* use tiktoken==0.3.0
* formatting
* tuple should be safer
* Update whisper/tokenizer.py
Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>
* use tiktoken 0.3.1
* reflecting suggestions
* cleanup
* bypassing load_tiktoken_bpe to avoid blobfile dep
---------
Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>1 parent f05f320 commit 00e5549
File tree
15 files changed
+100601
-100096
lines changed- tests
- whisper
- assets
- gpt2
- multilingual
15 files changed
+100601
-100096
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
6 | | - | |
7 | 5 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | | - | |
| 6 | + | |
7 | 7 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| 7 | + | |
7 | 8 | | |
8 | 9 | | |
9 | 10 | | |
| |||
24 | 25 | | |
25 | 26 | | |
26 | 27 | | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
27 | 33 | | |
28 | 34 | | |
29 | 35 | | |
30 | 36 | | |
31 | 37 | | |
32 | 38 | | |
33 | 39 | | |
34 | | - | |
35 | 40 | | |
36 | 41 | | |
37 | 42 | | |
0 commit comments