Huseyin ABANOZ's picture

Huseyin ABANOZ

habanoz

·

AI & ML interests

LLM, RL

Recent Activity

upvoted an article 5 days ago

Finally, a Replacement for BERT: Introducing ModernBERT

liked a dataset about 2 months ago

metu-yks/yksbench

liked a dataset about 2 months ago

AtlasPolat/yks2024

View all activity

Organizations

None yet

upvoted an article 5 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 705

upvoted a collection about 2 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated 28 days ago • 292

upvoted 2 collections 4 months ago

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 17

DeepSeek-R1

10 items • Updated May 29 • 807

upvoted a collection 5 months ago

Qwen3-Reranker

3 items • Updated Jul 21 • 64

upvoted a collection 6 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Sep 18 • 94

upvoted a paper 12 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 54

upvoted an article about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 420

upvoted a paper over 1 year ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted an article over 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 292

upvoted a paper over 1 year ago

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 26

upvoted 2 articles over 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 189

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

• 103

upvoted 3 collections over 1 year ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 92

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6, 2024 • 130

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12, 2024 • 152

upvoted a paper over 1 year ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 46

upvoted 3 papers about 2 years ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 63

Effective Long-Context Scaling of Foundation Models

Paper • 2309.16039 • Published Sep 27, 2023 • 30

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 246