-
|
I am comparing the results using v20230306 transcribing the same audio twice, first with The refinement of timing is excellent with Since only |
Beta Was this translation helpful? Give feedback.
Replies: 7 comments 11 replies
-
|
Is anyone else seeing this if you do an A/B comparison with this new version? |
Beta Was this translation helpful? Give feedback.
-
|
I can confirm the repetitions occur, and that they're not model hallucinations as they happen even with very small pauses. Putting the repetitions aside, the quality of the transcription looks even better. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks for reporting this; I wanted to merge the implementation before the branch gets too divergent but I'm sure it still has many rough edges. Trying to reproduce the repetition issue myself to push a fix .. |
Beta Was this translation helpful? Give feedback.
-
|
Additional tests: the duplicates are not always identical, some look like two different transcriptions of the same chunk |
Beta Was this translation helpful? Give feedback.
-
|
@jongwook Not sure exactly how this shift code below should function, is it a problem that if len(consecutive) > 0 and len(word_end_timestamps) > 0:
seek_shift = round(
(word_end_timestamps[-1] - time_offset) * FRAMES_PER_SECOND
)
if seek_shift > 0:
seek = previous_seek + seek_shift #################### |
Beta Was this translation helpful? Give feedback.
-
|
The problem for hallucination regarding the Whisper Model can be solved just by using the detailed code provided in the official github repository of Whisper. But before using the code for generation do remember to "SPLIT THE AUDIOS INTO 30 sec SEGMENTS EACH" and hence the problem will be solved. Audio Splitting code is also provided below : from moviepy.editor import AudioFileClip audio_path = glob.glob("./wav_files/*.wav")[0] Once splitting the audio files using the above code snippet, then we have to call this whisper code provided below import whisper Problem primarily assisted by my team mate Sudipa Dutta (github - sudipa23) Hope this will solve the problem. |
Beta Was this translation helpful? Give feedback.
-
|
hi, I've setup my environment like this: Wondering if anyone has any ideas/suggestions why this is happening |
Beta Was this translation helpful? Give feedback.
Thanks for reporting this; I wanted to merge the implementation before the branch gets too divergent but I'm sure it still has many rough edges. Trying to reproduce the repetition issue myself to push a fix ..