2 56 14

Logan Bolton

loganbolton

LoganBolton

AI & ML interests

Computer Vision, Explainable AI, NLP

Recent Activity

upvoted a paper 17 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

upvoted a paper about 1 month ago

Qwen3-Omni Technical Report

liked a Space about 1 month ago

taesiri/arXiv

View all activity

Organizations

upvoted a paper 17 days ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 127

upvoted 2 papers about 1 month ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 133

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Paper • 2509.16941 • Published Sep 21 • 20

upvoted a paper 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 202

upvoted a paper 3 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 65

upvoted a collection 3 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 544

upvoted 5 papers 3 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30 • 46

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Paper • 2507.14137 • Published Jul 18 • 34

upvoted 9 papers 4 months ago

Voxtral

Paper • 2507.13264 • Published Jul 17 • 29

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17 • 75

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15 • 32

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 42

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13 • 84

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers

Paper • 2507.10787 • Published Jul 14 • 11

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

One Token to Fool LLM-as-a-Judge

Paper • 2507.08794 • Published Jul 11 • 31

Logan Bolton

AI & ML interests

Recent Activity

Organizations

loganbolton's activity