 weleen
			's Collections
			weleen
			's Collections
			
			
		aigc
		
	updated
			
 
				
				
 - Emu Video: Factorizing Text-to-Video Generation by Explicit Image
  Conditioning- 
			Paper
			 •- 
			2311.10709
			 •
			Published
				
			•- 
				26
			 
 - Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and
  Attribute Control- 
			Paper
			 •- 
			2405.12970
			 •
			Published
				
			•- 
				25
			 
 - FIFO-Diffusion: Generating Infinite Videos from Text without Training- 
			Paper
			 •- 
			2405.11473
			 •
			Published
				
			•- 
				57
			 
   - stabilityai/stable-diffusion-3-medium- 
			Text-to-Image
			 • 
		
	
				Updated
					
				
				•- 
					12.3k
				 • 
			
	
				•- 
					4.86k
				 
   - stabilityai/stable-diffusion-3-medium-tensorrt- 
			Text-to-Image
			 • 
		
	
				Updated
					
				
				•- 
					34
				
	
				 •- 
					150
				 
 - DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis- 
			Paper
			 •- 
			2405.14224
			 •
			Published
				
			•- 
				16
			 
 - Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching- 
			Paper
			 •- 
			2405.11252
			 •
			Published
				
			•- 
				16
			 
 - OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any
  Person- 
			Paper
			 •- 
			2407.16224
			 •
			Published
				
			•- 
				29
			 
 - 
			Paper
			 •- 
			2407.15595
			 •
			Published
				
			•- 
				14
			 
 - Video Diffusion Alignment via Reward Gradients- 
			Paper
			 •- 
			2407.08737
			 •
			Published
				
			•- 
				49
			 
 - GTA: A Benchmark for General Tool Agents- 
			Paper
			 •- 
			2407.08713
			 •
			Published
				
			•- 
				17
			 
 - Lazy Diffusion Transformer for Interactive Image Editing- 
			Paper
			 •- 
			2404.12382
			 •
			Published
 - DDK: Distilling Domain Knowledge for Efficient Large Language Models- 
			Paper
			 •- 
			2407.16154
			 •
			Published
				
			•- 
				22
			 
 - SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View
  Consistency- 
			Paper
			 •- 
			2407.17470
			 •
			Published
				
			•- 
				16
			 
 - DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized
  Deepfake Detection- 
			Paper
			 •- 
			2406.00856
			 •
			Published
				
			•- 
				12
			 
 - Tora: Trajectory-oriented Diffusion Transformer for Video Generation- 
			Paper
			 •- 
			2407.21705
			 •
			Published
				
			•- 
				27
			 
 - The Llama 3 Herd of Models- 
			Paper
			 •- 
			2407.21783
			 •
			Published
				
			•- 
				116
			 
 - VidGen-1M: A Large-Scale Dataset for Text-to-video Generation- 
			Paper
			 •- 
			2408.02629
			 •
			Published
				
			•- 
				15
			 
 - xGen-MM (BLIP-3): A Family of Open Large Multimodal Models- 
			Paper
			 •- 
			2408.08872
			 •
			Published
				
			•- 
				100
			 
 - JPEG-LM: LLMs as Image Generators with Canonical Codec Representations- 
			Paper
			 •- 
			2408.08459
			 •
			Published
				
			•- 
				45
			 
 - TurboEdit: Instant text-based image editing- 
			Paper
			 •- 
			2408.08332
			 •
			Published
				
			•- 
				20
			 
 - Scalable Autoregressive Image Generation with Mamba- 
			Paper
			 •- 
			2408.12245
			 •
			Published
				
			•- 
				26
			 
 - MegaFusion: Extend Diffusion Models towards Higher-resolution Image
  Generation without Further Tuning- 
			Paper
			 •- 
			2408.11001
			 •
			Published
				
			•- 
				13
			 
 - TraDiffusion: Trajectory-Based Training-Free Image Generation- 
			Paper
			 •- 
			2408.09739
			 •
			Published
				
			•- 
				9