Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2510.17800

iVideoGPT: Interactive VideoGPTs are Scalable World Models

Paper • 2405.15223 • Published May 24, 2024 • 17
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 55
An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90
Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34

Visual context for LLM

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published 11 days ago • 63

about 12 hours ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 26 days ago • 460
Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published 28 days ago • 93
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published 23 days ago • 47
StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 21 days ago • 49

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 428k • • 12.8k
deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 195k • • 3.98k
krutrim-ai-labs/Krutrim-2-instruct

Updated Mar 17 • 204 • 33

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 28
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 18 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published 23 days ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published 26 days ago • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 19 days ago • 26

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 267 • 96
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

iVideoGPT: Interactive VideoGPTs are Scalable World Models

Paper • 2405.15223 • Published May 24, 2024 • 17
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 55
An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90
Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 28
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Visual context for LLM

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published 11 days ago • 63

Read Later Stack

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published 18 days ago • 31
Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published 23 days ago • 9
Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published 26 days ago • 22
DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published 19 days ago • 26

about 12 hours ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published 26 days ago • 460
Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published 28 days ago • 93
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published 23 days ago • 47
StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published 21 days ago • 49

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 267 • 96
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73
deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 428k • • 12.8k
deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 195k • • 3.98k
krutrim-ai-labs/Krutrim-2-instruct

Updated Mar 17 • 204 • 33

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs