Project-O1

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

zsqzz authored a paper 6 days ago

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

ZYHowell authored a paper 11 days ago

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

ZYHowell authored a paper 11 days ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

View all activity

zsqzz

authored a paper 6 days ago

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published 8 days ago • 8

ZYHowell

authored 7 papers 11 days ago

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

Paper • 2310.16355 • Published Oct 25, 2023

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

K2-Think: A Parameter-Efficient Reasoning System

Paper • 2509.07604 • Published Sep 9 • 10

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 15 days ago • 117

Snyhlxde

authored a paper 19 days ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published 22 days ago • 26

zsqzz

authored a paper 22 days ago

GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare

Paper • 2510.08872 • Published 25 days ago • 2

Viol2000

authored a paper 2 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 87

Viol2000

authored a paper 4 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24 • 12

Snyhlxde

authored a paper 6 months ago

lmgame-Bench: How Good are LLMs at Playing Games?

Paper • 2505.15146 • Published May 21 • 20

Snyhlxde

authored 2 papers 8 months ago

TrimLLM: Progressive Layer Dropping for Domain-Specific LLMs

Paper • 2412.11242 • Published Dec 15, 2024 • 1

ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration

Paper • 2502.00675 • Published Feb 2 • 2

Snyhlxde

authored a paper 9 months ago

GameArena: Evaluating LLM Reasoning through Live Computer Games

Paper • 2412.06394 • Published Dec 9, 2024 • 1

Viol2000

authored 3 papers 10 months ago

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Paper • 2406.05981 • Published Jun 10, 2024 • 16

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Paper • 2406.07368 • Published Jun 11, 2024 • 2

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37

zsqzz

authored a paper 10 months ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37

AI & ML interests

Recent Activity

Team members 4

Project-O1's activity