LiquidAI
/

LFM2-Audio-1.5B

speech-to-speech

Model card Files Files and versions

notcaleb commited on Oct 1

Commit

6163e3a

·

verified ·

1 Parent(s): 59875b5

Fix typo(s) in README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -36,7 +36,7 @@ LFM2-Audio is an end-to-end multimodal speech and text language model, and as su
 Our model consists of a pretrained LFM2 model as its multimodal backbone, along with a FastConformer based audio encoder to handle continuous audio inputs, and a RQ-transformer generating discrete Mimi tokens as audio output.
 LFM2-Audio supports two distinct generation routines, each suitable for a set of tasks.
-Interleaved genration enables real-time speech-to-speech conversational chatbot capabilities, where audio generation latency is key.
 Sequential generation is suited for non-conversational tasks such as ASR or TTS, and allows the model to switch generated modality on the fly.
 ## 📄 Model details

 Our model consists of a pretrained LFM2 model as its multimodal backbone, along with a FastConformer based audio encoder to handle continuous audio inputs, and a RQ-transformer generating discrete Mimi tokens as audio output.
 LFM2-Audio supports two distinct generation routines, each suitable for a set of tasks.
+Interleaved generation enables real-time speech-to-speech conversational chatbot capabilities, where audio generation latency is key.
 Sequential generation is suited for non-conversational tasks such as ASR or TTS, and allows the model to switch generated modality on the fly.
 ## 📄 Model details