ChatterboxTurboTTS fails with RuntimeError: expected scalar type Double but found Float when specifying audio_prompt_path when numpy>=2.0

When running the following code, I get an error due to a mismatch between float types

```python
tts_model = ChatterboxTurboTTS.from_pretrained(device="cuda")

text = "Whereas recognition of the inherent dignity and of the equal and inalienable rights..."
wav = tts_model.generate(
    text,
    audio_prompt_path="voice_example.wav",
)
```
I am running this on python 3.13, so I have numpy>2.4.4 installed. I think the issue is in https://github.com/resemble-ai/chatterbox/blob/59bc590b3cad826e5d5987745bf6844627a21ad5/src/chatterbox/tts_turbo.py#L211

`gain_linear` is a `np.float64` and due to the [new type promotion rules](https://numpy.org/devdocs/numpy_2_0_migration_guide.html#changes-to-numpy-data-type-promotion), this promotes wav from an np.float32 to a np.float64.

In my use case, it's good enough to cast `gain_linear` to `np.float32`, but there may be more places where promotions happen. 

I'll raise a pull request.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatterboxTurboTTS fails with RuntimeError: expected scalar type Double but found Float when specifying audio_prompt_path when numpy>=2.0 #499

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ChatterboxTurboTTS fails with RuntimeError: expected scalar type Double but found Float when specifying audio_prompt_path when numpy>=2.0 #499

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions