8 14 39

Honglin Guo

KYLN24

KYLN24

AI & ML interests

None yet

Recent Activity

new activity 13 days ago

nebius/SWE-rebench:How can I find all instance_ids that come with a Docker image?

new activity about 2 months ago

AgentGym/AgentGym-RL-Data-ID:Upload webarena_train.json

new activity about 2 months ago

AgentGym/AgentGym-RL-Data-ID:Add comprehensive dataset card for AgentGym-RL-Data-ID

View all activity

Organizations

New activity in nebius/SWE-rebench 13 days ago

How can I find all instance_ids that come with a Docker image?

#10 opened 13 days ago by

KYLN24

New activity in AgentGym/AgentGym-RL-Data-ID about 2 months ago

Upload webarena_train.json

#3 opened about 2 months ago by

SixPlusSeven13

Add comprehensive dataset card for AgentGym-RL-Data-ID

#2 opened about 2 months ago by

nielsr

published a dataset about 2 months ago

AgentGym/AgentGym-RL-Data-ID

Viewer • Updated Sep 12 • 186k • 528 • 4

authored 2 papers 2 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4 • 23

New activity in AgentGym/AgentTraj-L 2 months ago

Update sciworld_train.json

#3 opened 2 months ago by

SixPlusSeven13

updated a dataset 2 months ago

AgentGym/AgentGym-RL-Data-ID

Viewer • Updated Sep 12 • 186k • 528 • 4

upvoted a paper 4 months ago

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4 • 23

commented a paper 4 months ago

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4 • 23 •

New activity in mlx-community/Kimi-VL-A3B-Thinking-4bit 6 months ago

Request for a 3bit version

#4 opened 6 months ago by

KYLN24

upvoted a paper 6 months ago

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 23

New activity in OpenVINO/DeepSeek-R1-Distill-Qwen-1.5B-int4-ov 6 months ago

Unable to serve using OpenVINO Model Server

#1 opened 6 months ago by

KYLN24

upvoted a paper 6 months ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2 • 13

liked 2 models 7 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • 16B • Updated Aug 18 • 6.94k • 441

moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • 16B • Updated Jul 30 • 76.7k • 238

authored 2 papers 8 months ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published Mar 2 • 13

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

upvoted an article 8 months ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

Jan 31

• 51

liked a model 8 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 402k • 1.52k

Honglin Guo

AI & ML interests

Recent Activity

Organizations

KYLN24's activity

How can I find all instance_ids that come with a Docker image?

Upload webarena_train.json

Add comprehensive dataset card for AgentGym-RL-Data-ID

Update sciworld_train.json

Request for a 3bit version

Unable to serve using OpenVINO Model Server

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial