- 
	
	
	Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-FunctionsPaper • 2309.10150 • Published • 25
- 
	
	
	In-Context Pretraining: Language Modeling Beyond Document BoundariesPaper • 2310.10638 • Published • 30
- 
	
	
	Farzi Data: Autoregressive Data DistillationPaper • 2310.09983 • Published • 10
- 
	
	
	LLaVA-Plus: Learning to Use Tools for Creating Multimodal AgentsPaper • 2311.05437 • Published • 51
Mat Miller
matdmiller
		AI & ML interests
None yet
		
		
 
								 
								
