2 9

Sehyun Choi

syncdoth

https://syncdoth.github.io

AI & ML interests

ML, LLM, Multimodal LLM, Video AI, Controllable AI

Recent Activity

authored a paper 2 months ago

Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers

authored a paper 2 months ago

Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset

authored a paper 2 months ago

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

View all activity

Organizations

authored 6 papers 2 months ago

Cross-Architecture Transfer Learning for Linear-Cost Inference Transformers

Paper • 2404.02684 • Published Apr 3, 2024

Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset

Paper • 2109.07679 • Published Sep 16, 2021

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

Paper • 2311.09174 • Published Nov 15, 2023

AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation

Paper • 2402.10646 • Published Feb 16, 2024

CKBP v2: Better Annotation and Reasoning for Commonsense Knowledge Base Population

Paper • 2304.10392 • Published Apr 20, 2023

Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging

Paper • 2504.18580 • Published Apr 23

updated 2 models over 1 year ago

NucleusAI/RetNet-1B-Hybrid-XATL

Updated Jul 28, 2024 • 3

NucleusAI/RetNet-410m-XATL

Text Generation • Updated Jul 28, 2024 • 8 • 2

liked a Space over 1 year ago

233

MMLU-Pro Leaderboard

🥇

More advanced and challenging multi-task evaluation

liked a model over 1 year ago

ahxt/LiteLlama-460M-1T

Text Generation • Updated Jan 8, 2024 • 6.7k • 165

authored a paper almost 2 years ago

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Paper • 2310.09044 • Published Oct 13, 2023

liked a model almost 2 years ago

likenneth/honest_llama2_chat_7B

Text Generation • Updated Oct 19, 2023 • 1 • 9

liked 2 models about 2 years ago

NucleusAI/nucleus-22B-token-500B

Text Generation • 22B • Updated Oct 12, 2023 • 458 • 25

declare-lab/flacuna-13b-v1.0

Text Generation • Updated Jul 7, 2023 • 2 • 19

commented a paper over 2 years ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 172 •

liked a model over 2 years ago

stabilityai/StableBeluga2

Text Generation • Updated Sep 18, 2023 • 683 • 883

commented a paper over 2 years ago

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 172 •

liked a Space over 2 years ago

13.6k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked a dataset over 2 years ago

Open-Orca/OpenOrca

Viewer • Updated Feb 19 • 2.94M • 8.05k • 1.46k

liked a model over 2 years ago

tloen/alpaca-lora-7b

Updated Apr 4, 2023 • 446

Sehyun Choi

AI & ML interests

Recent Activity

Organizations

syncdoth's activity

MMLU-Pro Leaderboard

Open LLM Leaderboard