Latency improvements with the new manual VAD algorithm

Done:

- [x] Implement monitoring metrics to follow improvements

Todo:

- [ ] Parallelize more TTS and database calls (study OTEL traces for opportunity confirmation)
- [ ] Reduce dependency calls before sending call to the LLM or defer them
- [ ] Compress the prompt (LLMlingua?)
- [ ] Use a LLM with a lower latency (Phi 4?)
- [ ] Trace the code executions with local debugger to pin points unseen optimizations


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Latency improvements with the new manual VAD algorithm #368

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Latency improvements with the new manual VAD algorithm #368

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions