← All notes

Why does speech need different dialogue management than text?

Speech-driven conversation requires different architectural choices than text due to acoustic noise, latency cascades, and unmeasured reasoning capabilities.

Topic Hub · 7 linked notes · 5 sections
View as

ASR and Dialogue Management Under Noisy Input

1 note

Why do dialogue systems need probabilistic reasoning?

Explores whether deterministic flowchart-based dialogue systems can handle realistic speech recognition error rates of 15-30 percent, and what alternative approaches might be necessary.

Explore related Read →

Speech-to-Speech Architectures and Latency

1 note

Speech Encoders and Articulatory Modeling

1 note

Do speech models learn language-specific sounds or universal physics?

Exploring whether self-supervised speech models encode phonetic categories tied to specific languages or instead capture the underlying vocal-tract physics common to all humans. This matters for understanding why these models transfer across languages without retraining.

Explore related Read →

Speech Evaluation

1 note