3 14 4

Emil Zakirov

Emil-Zakirov

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

masint/gpt-oss-deflate-general

upvoted an article 2 months ago

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

upvoted a collection 4 months ago

💧 LFM2

View all activity

Organizations

None yet

liked a dataset about 1 month ago

masint/gpt-oss-deflate-general

Updated Sep 20 • 41 • 5

upvoted an article 2 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11

•

upvoted a collection 4 months ago

💧 LFM2

Collection

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 22 items • Updated 13 days ago • 119

upvoted a paper 7 months ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29 • 92

New activity in marcelbinz/Llama-3.1-Centaur-70B-adapter 12 months ago

How can this be accessed for research without GPUs on hand?

#7 opened about 1 year ago by

BiasedByBytes

liked a model about 1 year ago

marcelbinz/Llama-3.1-Centaur-70B-adapter

Updated Jul 1 • 160

upvoted a paper about 1 year ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 45

upvoted a collection about 1 year ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 26

upvoted 5 papers over 1 year ago

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 55

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 77

upvoted a paper almost 2 years ago

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31, 2024 • 22

liked a model almost 2 years ago

dphn/dolphin-2.5-mixtral-8x7b

Text Generation • 47B • Updated May 21, 2024 • 1.46k • 1.24k

upvoted a paper almost 2 years ago

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 54

liked a model about 2 years ago

mistralai/Mistral-7B-v0.1

Text Generation • 7B • Updated Jul 24 • 530k • 4.01k

upvoted 2 papers about 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

Paper • 2308.16137 • Published Aug 30, 2023 • 40

commented a paper over 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81 •

Emil Zakirov

AI & ML interests

Recent Activity

Organizations

Emil-Zakirov's activity

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

How can this be accessed for research without GPUs on hand?