 BuiDoan
			's Collections
			BuiDoan
			's Collections
			
			
		Great paper
		
	updated
			
 
				
				
 - 
			Paper
			 •- 
			2410.05258
			 •
			Published
				
			•- 
				179
			 
 - PaliGemma 2: A Family of Versatile VLMs for Transfer- 
			Paper
			 •- 
			2412.03555
			 •
			Published
				
			•- 
				133
			 
 - VisionZip: Longer is Better but Not Necessary in Vision Language Models- 
			Paper
			 •- 
			2412.04467
			 •
			Published
				
			•- 
				118
			 
 - o1-Coder: an o1 Replication for Coding- 
			Paper
			 •- 
			2412.00154
			 •
			Published
				
			•- 
				44
			 
 - SNOOPI: Supercharged One-step Diffusion Distillation with Proper
  Guidance- 
			Paper
			 •- 
			2412.02687
			 •
			Published
				
			•- 
				113
			 
 - TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any
  Point in Long Video- 
			Paper
			 •- 
			2411.18671
			 •
			Published
				
			•- 
				20
			 
 - Fully Open Source Moxin-7B Technical Report- 
			Paper
			 •- 
			2412.06845
			 •
			Published
				
			•- 
				11
			 
 - Small Language Models: Survey, Measurements, and Insights- 
			Paper
			 •- 
			2409.15790
			 •
			Published
				
			•- 
				2
			 
 - 
			Paper
			 •- 
			2407.10671
			 •
			Published
				
			•- 
				166
			 
 - 
			Paper
			 •- 
			2412.08905
			 •
			Published
				
			•- 
				121
			 
 - Apollo: An Exploration of Video Understanding in Large Multimodal Models- 
			Paper
			 •- 
			2412.10360
			 •
			Published
				
			•- 
				147
			 
 - Byte Latent Transformer: Patches Scale Better Than Tokens- 
			Paper
			 •- 
			2412.09871
			 •
			Published
				
			•- 
				108
			 
 - 
			Paper
			 •- 
			2412.15115
			 •
			Published
				
			•- 
				376
			 
 - Search-o1: Agentic Search-Enhanced Large Reasoning Models- 
			Paper
			 •- 
			2501.05366
			 •
			Published
				
			•- 
				102
			 
 - rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
  Thinking- 
			Paper
			 •- 
			2501.04519
			 •
			Published
				
			•- 
				285
			 
 - MiniMax-01: Scaling Foundation Models with Lightning Attention- 
			Paper
			 •- 
			2501.08313
			 •
			Published
				
			•- 
				298
			 
 - Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
  Large Language Models- 
			Paper
			 •- 
			2501.09686
			 •
			Published
				
			•- 
				41
			 
 - DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
  Reinforcement Learning- 
			Paper
			 •- 
			2501.12948
			 •
			Published
				
			•- 
				420
			 
 - SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
  Post-training- 
			Paper
			 •- 
			2501.17161
			 •
			Published
				
			•- 
				123
			 
 - Baichuan-Omni-1.5 Technical Report- 
			Paper
			 •- 
			2501.15368
			 •
			Published
				
			•- 
				62
			 
 - OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
  Animation Models- 
			Paper
			 •- 
			2502.01061
			 •
			Published
				
			•- 
				221
			 
 - The Differences Between Direct Alignment Algorithms are a Blur- 
			Paper
			 •- 
			2502.01237
			 •
			Published
				
			•- 
				113
			 
 - Hermes 3 Technical Report- 
			Paper
			 •- 
			2408.11857
			 •
			Published
				
			•- 
				56
			 
 - From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence
  Generation up to 100K Tokens- 
			Paper
			 •- 
			2502.18890
			 •
			Published
				
			•- 
				30
			 
 - SemViQA: A Semantic Question Answering System for Vietnamese Information
  Fact-Checking- 
			Paper
			 •- 
			2503.00955
			 •
			Published
				
			•- 
				28
			 
 - InternVL3: Exploring Advanced Training and Test-Time Recipes for
  Open-Source Multimodal Models- 
			Paper
			 •- 
			2504.10479
			 •
			Published
				
			•- 
				298
			 
 - Tina: Tiny Reasoning Models via LoRA- 
			Paper
			 •- 
			2504.15777
			 •
			Published
				
			•- 
				56
			 
 - Absolute Zero: Reinforced Self-play Reasoning with Zero Data- 
			Paper
			 •- 
			2505.03335
			 •
			Published
				
			•- 
				185