- 
	
	
	Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You LikePaper • 2402.07383 • Published • 16
- 
	
	
	Matcha-TTS: A fast TTS architecture with conditional flow matchingPaper • 2309.03199 • Published • 13
- 
	
	
	Natural language guidance of high-fidelity text-to-speech with synthetic annotationsPaper • 2402.01912 • Published • 12
- 
	
	
	Fast Timing-Conditioned Latent Audio DiffusionPaper • 2402.04825 • Published • 8
RO-HOON OH
heiscold
		·
				AI & ML interests
TTS, Audio Editing, Speech Editing
		
		Organizations
None yet
Music_Generation
			
			
	
	- 
	
	
	MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion ModelsPaper • 2402.06178 • Published • 15
- 
	
	
	DITTO: Diffusion Inference-Time T-Optimization for Music GenerationPaper • 2401.12179 • Published • 21
- 
	
	
	Fast Timing-Conditioned Latent Audio DiffusionPaper • 2402.04825 • Published • 8
- 
	
	
	Brain2Music: Reconstructing Music from Human Brain ActivityPaper • 2307.11078 • Published • 41
TTS, VC
			
			
	
	- 
	
	
	Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You LikePaper • 2402.07383 • Published • 16
- 
	
	
	Matcha-TTS: A fast TTS architecture with conditional flow matchingPaper • 2309.03199 • Published • 13
- 
	
	
	Natural language guidance of high-fidelity text-to-speech with synthetic annotationsPaper • 2402.01912 • Published • 12
- 
	
	
	Fast Timing-Conditioned Latent Audio DiffusionPaper • 2402.04825 • Published • 8
Music_Generation
			
			
	
	- 
	
	
	MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion ModelsPaper • 2402.06178 • Published • 15
- 
	
	
	DITTO: Diffusion Inference-Time T-Optimization for Music GenerationPaper • 2401.12179 • Published • 21
- 
	
	
	Fast Timing-Conditioned Latent Audio DiffusionPaper • 2402.04825 • Published • 8
- 
	
	
	Brain2Music: Reconstructing Music from Human Brain ActivityPaper • 2307.11078 • Published • 41
			models
			0
		
			
	None public yet
			datasets
			0
		
			
	None public yet
