Maxwell-Jia
's Collections
Daily arXiv
updated
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Paper
•
2407.06027
•
Published
•
11
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
•
2407.09025
•
Published
•
139
Toto: Time Series Optimized Transformer for Observability
Paper
•
2407.07874
•
Published
•
33
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Paper
•
2407.09413
•
Published
•
11
Paper
•
2407.10671
•
Published
•
166
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Paper
•
2407.11895
•
Published
•
7
Scaling Granite Code Models to 128K Context
Paper
•
2407.13739
•
Published
•
21
Vision language models are blind
Paper
•
2407.06581
•
Published
•
84
Data Mixture Inference: What do BPE Tokenizers Reveal about their
Training Data?
Paper
•
2407.16607
•
Published
•
23
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
•
2407.18961
•
Published
•
40
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
•
2407.18248
•
Published
•
33
SAM 2: Segment Anything in Images and Videos
Paper
•
2408.00714
•
Published
•
116
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
79
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation
Extraction on an Academic Budget
Paper
•
2408.00103
•
Published
•
23
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge
Conflicts in LLM
Paper
•
2408.12076
•
Published
•
12
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
•
2408.06292
•
Published
•
126
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs
with 1000x Input Token Reduction
Paper
•
2409.17422
•
Published
•
25
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case
Study
Paper
•
2409.17580
•
Published
•
9
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for
Data-Driven Scientific Discovery
Paper
•
2410.05080
•
Published
•
21
Cut Your Losses in Large-Vocabulary Language Models
Paper
•
2411.09009
•
Published
•
49
Open-Sora Plan: Open-Source Large Video Generation Model
Paper
•
2412.00131
•
Published
•
33
o1-Coder: an o1 Replication for Coding
Paper
•
2412.00154
•
Published
•
44
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
•
2412.03555
•
Published
•
133
Chain-of-Retrieval Augmented Generation
Paper
•
2501.14342
•
Published
•
58
Critique Fine-Tuning: Learning to Critique is More Effective than
Learning to Imitate
Paper
•
2501.17703
•
Published
•
58
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper
•
2501.18585
•
Published
•
61
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and
Understanding
Paper
•
2501.18362
•
Published
•
23
Diverse Inference and Verification for Advanced Reasoning
Paper
•
2502.09955
•
Published
•
18
FoNE: Precise Single-Token Number Embeddings via Fourier Features
Paper
•
2502.09741
•
Published
•
15
Injecting Domain-Specific Knowledge into Large Language Models: A
Comprehensive Survey
Paper
•
2502.10708
•
Published
•
4
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Paper
•
2502.14922
•
Published
•
32
LightThinker: Thinking Step-by-Step Compression
Paper
•
2502.15589
•
Published
•
31
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper
•
2503.14476
•
Published
•
141
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large
Language Models
Paper
•
2503.24235
•
Published
•
54
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for
Large Language Models
Paper
•
2503.24377
•
Published
•
18
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning
Systems in LLMs
Paper
•
2507.09477
•
Published
•
84
A Survey of Context Engineering for Large Language Models
Paper
•
2507.13334
•
Published
•
257
Reinforced Visual Perception with Tools
Paper
•
2509.01656
•
Published
•
31