Eugene Klimov's picture

Eugene Klimov

Slach

·

Slach

AI & ML interests

None yet

Recent Activity

updated a collection about 19 hours ago

usefull opensource models

updated a collection 6 days ago

usefull opensource models

liked a model 7 days ago

moonshotai/Kimi-K2-Thinking

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

usefull opensource models

41 items • Updated about 19 hours ago • 1

upvoted a collection 3 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated Sep 1 • 130

upvoted a paper 3 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 87

upvoted 2 papers 4 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 118

upvoted a collection 6 months ago

Qwen3

84 items • Updated Aug 6 • 1.42k

upvoted an article 6 months ago

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

By

and 7 others •

May 7

• 56

upvoted 2 collections 7 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 13 days ago • 231

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 13 days ago • 35

upvoted 2 papers 8 months ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 72

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 232

upvoted 2 papers 9 months ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published Feb 25 • 67

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 174

upvoted a collection 12 months ago

Hymba

A series of Hybrid Small Language Models. • 3 items • Updated 4 days ago • 32

upvoted a collection about 1 year ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 4 days ago • 61

upvoted a paper over 1 year ago

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

Paper • 2408.03910 • Published Aug 7, 2024 • 18

upvoted 3 collections over 1 year ago

AQLM

AQLM quantized LLMs • 21 items • Updated Feb 28 • 46

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 28 • 21

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 79

upvoted a paper over 1 year ago

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 36