Esmaeiliyan's picture

Esmaeiliyan

Mohammadreza

·

https://t.me/AI_360

AI & ML interests

VLM and LLM interest

Recent Activity

upvoted a paper 24 days ago

OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing

upvoted a paper about 1 month ago

Reinforcement Learning on Pre-Training Data

upvoted a paper about 1 month ago

LIMI: Less is More for Agency

View all activity

Organizations

upvoted a paper 24 days ago

OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing

Paper • 2509.24900 • Published 27 days ago • 53

upvoted 3 papers about 1 month ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 67

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 100

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 112

upvoted a paper 4 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19 • 17

upvoted a collection 8 months ago

SFTvsRL Models & Data

This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 7 items • Updated Mar 13 • 9

upvoted a collection 10 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 166

upvoted a paper 10 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 156

upvoted an article 11 months ago

Article

Zero to Hero with the TRL learning link bomb 💣

By

•

Nov 25, 2024

• 7

upvoted a collection 11 months ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 252

upvoted a collection about 1 year ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15 • 123

upvoted 3 papers about 1 year ago

Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization

Paper • 2410.04717 • Published Oct 7, 2024 • 18

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 14

upvoted a collection about 1 year ago

Persian Models

This is the largest collection of Persian models available on Huggingface • 772 items • Updated Aug 23 • 16

upvoted 2 papers about 1 year ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 100

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 25

upvoted an article over 1 year ago

Article

Deploy hundreds of open source models on one GPU using LoRAX

By

•

Jul 18, 2024

• 4

upvoted a collection over 1 year ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 693

upvoted an article over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 238