Ubec's picture

4 8 39

Ubec

hrw

·

https://github.com/UbeCc/

UbeCc

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

hrw/Omni-7B-grpo-mix-0920

published a model about 1 month ago

hrw/Omni-7B-grpo-mix-0920

updated a model about 1 month ago

hrw/Omni-7B-sft-mix-0920

View all activity

Organizations

upvoted a paper about 1 month ago

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Paper • 2509.07403 • Published Sep 9 • 58

upvoted 2 papers about 2 months ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3 • 24

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

upvoted a paper 8 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 75

upvoted a paper 9 months ago

SURGE: On the Potential of Large Language Models as General-Purpose Surrogate Code Executors

Paper • 2502.11167 • Published Feb 16 • 10

upvoted 2 papers 12 months ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 68

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

Paper • 2411.00412 • Published Nov 1, 2024 • 10

upvoted a collection about 1 year ago

🎯DART-Math

Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math • 20 items • Updated Feb 19 • 7