Artificial Intelligence @lemmy.sdf.org Tea @programming.dev 2d ago

OCTAVE: the first text-to-speech model that understands what it’s saying.

www.hume.ai OCTAVE: the first text-to-speech model that understands what it’s saying • Hume AI

Today we’re launching OCTAVE, the first of a new generation of text-to-speech models. Built on the backbone of an LLM, OCTAVE understands what it’s saying, unlocking a range of new AI voice capabilities.

Today we’re launching Octave (Omni-capable text and voice engine), the first LLM for text-to-speech. Unlike conventional TTS that merely “reads” words, Octave is a speech-language model that understands what words mean in context, unlocking a new level of expressiveness and nuance—and new AI voice capabilities. It acts out characters, generates voices from prompts, and takes instructions to modify the emotion and style of a given utterance.

0 comments