Commit c26953f
Use tiktoken (openai#1044)
* use tiktoken==0.3.0
* formatting
* tuple should be safer
* Update whisper/tokenizer.py
Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>
* use tiktoken 0.3.1
* reflecting suggestions
* cleanup
* bypassing load_tiktoken_bpe to avoid blobfile dep
---------
Co-authored-by: Ruhollah Majdoddin <r.majdodin@gmail.com>1 parent 6d09ca5 commit c26953f
File tree
15 files changed
+100601
-100096
lines changed- tests
- whisper
- assets
- gpt2
- multilingual
15 files changed
+100601
-100096
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
6 | | - | |
7 | 5 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | | - | |
| 6 | + | |
7 | 7 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
| 9 | + | |
9 | 10 | | |
10 | 11 | | |
11 | 12 | | |
| |||
46 | 47 | | |
47 | 48 | | |
48 | 49 | | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
49 | 55 | | |
50 | 56 | | |
51 | 57 | | |
52 | 58 | | |
53 | 59 | | |
54 | 60 | | |
55 | 61 | | |
56 | | - | |
57 | 62 | | |
58 | 63 | | |
59 | 64 | | |
0 commit comments