5 1 3

Canwen Xu

canwenxu

https://www.canwenxu.net

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

liked a dataset about 1 year ago

Vezora/Open-Critic-GPT

liked a model over 1 year ago

bosonai/Higgs-Llama-3-70B

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 6 days ago • 108

liked a dataset about 1 year ago

Vezora/Open-Critic-GPT

Viewer • Updated Jul 28, 2024 • 55.1k • 122 • 96

liked a model over 1 year ago

bosonai/Higgs-Llama-3-70B

Text Generation • 71B • Updated Aug 20, 2024 • 8.36k • • 227

New activity in open-llm-leaderboard/open_llm_leaderboard over 2 years ago

Models for Human/GPT4 Eval

❤️ 2

#65 opened over 2 years ago by

natolambert

New activity in HuggingFaceH4/Falcon-vs-LLaMA over 2 years ago

Wrong way of using Baize v2

#3 opened over 2 years ago by

canwenxu

Wrong way of using Baize v2

#2 opened over 2 years ago by

canwenxu

authored 2 papers over 2 years ago

Small Models are Valuable Plug-ins for Large Language Models

Paper • 2305.08848 • Published May 15, 2023 • 4

Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data

Paper • 2304.01196 • Published Apr 3, 2023

liked a Space over 2 years ago

379

Chat with Baize

🐲

authored a paper over 2 years ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 34

Canwen Xu

AI & ML interests

Recent Activity

Organizations

canwenxu's activity

Models for Human/GPT4 Eval

Wrong way of using Baize v2

Wrong way of using Baize v2

Chat with Baize