Collections
Discover the best community collections!
Collections including paper arxiv:2109.10282 
						
					
				- 
	
	
	Can large language models explore in-context?Paper • 2403.15371 • Published • 33
- 
	
	
	GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative ModelingPaper • 2403.19655 • Published • 19
- 
	
	
	WavLLM: Towards Robust and Adaptive Speech Large Language ModelPaper • 2404.00656 • Published • 11
- 
	
	
	Enabling Memory Safety of C Programs using LLMsPaper • 2404.01096 • Published • 1
- 
	
	
	CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped WindowsPaper • 2107.00652 • Published • 2
- 
	
	
	Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text RenderingPaper • 2403.09622 • Published • 18
- 
	
	
	Veagle: Advancements in Multimodal Representation LearningPaper • 2403.08773 • Published • 10
- 
	
	
	mPLUG-Owl: Modularization Empowers Large Language Models with MultimodalityPaper • 2304.14178 • Published • 3
- 
	
	
	FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information ExtractionPaper • 2305.02549 • Published • 6
- 
	
	
	FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information ExtractionPaper • 2203.08411 • Published • 1
- 
	
	
	More efficient manual review of automatically transcribed tabular dataPaper • 2306.16126 • Published • 1
- 
	
	
	CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documentsPaper • 2004.12629 • Published • 3
- 
	
	
	Disentangling Writer and Character Styles for Handwriting GenerationPaper • 2303.14736 • Published • 3
- 
	
	
	A Transformer Architecture for Online Gesture Recognition of Mathematical ExpressionsPaper • 2211.02643 • Published • 2
- 
	
	
	A tailored Handwritten-Text-Recognition System for Medieval LatinPaper • 2308.09368 • Published • 3
- 
	
	
	Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabetsPaper • 2303.16256 • Published • 2
- 
	
	
	FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information ExtractionPaper • 2305.02549 • Published • 6
- 
	
	
	FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information ExtractionPaper • 2203.08411 • Published • 1
- 
	
	
	More efficient manual review of automatically transcribed tabular dataPaper • 2306.16126 • Published • 1
- 
	
	
	CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documentsPaper • 2004.12629 • Published • 3
- 
	
	
	Can large language models explore in-context?Paper • 2403.15371 • Published • 33
- 
	
	
	GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative ModelingPaper • 2403.19655 • Published • 19
- 
	
	
	WavLLM: Towards Robust and Adaptive Speech Large Language ModelPaper • 2404.00656 • Published • 11
- 
	
	
	Enabling Memory Safety of C Programs using LLMsPaper • 2404.01096 • Published • 1
- 
	
	
	CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped WindowsPaper • 2107.00652 • Published • 2
- 
	
	
	Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text RenderingPaper • 2403.09622 • Published • 18
- 
	
	
	Veagle: Advancements in Multimodal Representation LearningPaper • 2403.08773 • Published • 10
- 
	
	
	mPLUG-Owl: Modularization Empowers Large Language Models with MultimodalityPaper • 2304.14178 • Published • 3
- 
	
	
	Disentangling Writer and Character Styles for Handwriting GenerationPaper • 2303.14736 • Published • 3
- 
	
	
	A Transformer Architecture for Online Gesture Recognition of Mathematical ExpressionsPaper • 2211.02643 • Published • 2
- 
	
	
	A tailored Handwritten-Text-Recognition System for Medieval LatinPaper • 2308.09368 • Published • 3
- 
	
	
	Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabetsPaper • 2303.16256 • Published • 2
 
				