4 37 94

Richard Lian

richardlian

dachenlian

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

Sentence Transformers is joining Hugging Face!

published a model 16 days ago

lopentu/Llama-3-8B-Taiwan-Llawa-TCxYZL-DPO-Beta-0.01-Instruct

updated a collection 16 days ago

Taiwan Legal LLMs

View all activity

Organizations

upvoted an article 13 days ago

Article

Sentence Transformers is joining Hugging Face!

14 days ago

• 72

upvoted an article about 1 month ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1

• 120

upvoted a collection about 1 month ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 253

upvoted a paper 3 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

upvoted an article 5 months ago

Article

KV Cache from scratch in nanoVLM

Jun 4

• 98

upvoted a paper 5 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185

upvoted a paper 6 months ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 83

upvoted 2 articles 6 months ago

Article

The Transformers Library: standardizing model definitions

May 15

• 120

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 559

upvoted a collection 6 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 53 items • Updated about 24 hours ago • 235

upvoted 2 articles 7 months ago

Article

Introducing HELMET

Apr 16

• 40

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 340

upvoted 4 articles 8 months ago

Article

Rearchitecting Hugging Face Uploads and Downloads

Nov 26, 2024

• 50

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

Nov 20, 2024

• 66

Article

Xet is on the Hub

Mar 18

• 78

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 245

upvoted 2 papers 10 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 41

upvoted 2 articles 10 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 217

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 58

Richard Lian

AI & ML interests

Recent Activity

Organizations

richardlian's activity

Sentence Transformers is joining Hugging Face!

Introducing RTEB: A New Standard for Retrieval Evaluation

KV Cache from scratch in nanoVLM

The Transformers Library: standardizing model definitions

Vision Language Models (Better, Faster, Stronger)

Introducing HELMET

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Rearchitecting Hugging Face Uploads and Downloads

From Files to Chunks: Improving Hugging Face Storage Efficiency

Xet is on the Hub

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Train 400x faster Static Embedding Models with Sentence Transformers

Efficient LLM Pretraining: Packed Sequences and Masked Attention