The model supports multilingual transcription, but voice output is only in English or English-like languages.
 
 
Models:
CSM: sesame/csm-1b
CSM-EXPRESSIVA(WHISPERING & NO VC): senstella/csm-expressiva-1b
LLAMA: meta-llama/Llama-3.2-1B
LLAMA-VIKHR: Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct
LLAMA-ULTRAVOX: fixie-ai/ultravox-v0_5-llama-3_2-1b
   
CSM:
CSM-EXPRESSIVA(WHISPERING & NO VC):
LLAMA:
LLAMA-VIKHR:
LLAMA-ULTRAVOX:
Model tree for Derur/csm-models
Base model
meta-llama/Llama-3.2-1B-Instruct
				Finetuned
	
	
Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct
						