MOHAMMED ABDALLAH's picture

57 408

MOHAMMED ABDALLAH PRO

melsiddieg

·

melsiddieg

AI & ML interests

biomedical nlp, knowledge graphs, genomics

Recent Activity

liked a model 5 days ago

IQuestLab/IQuest-Coder-V1-40B-Instruct

liked a model 8 days ago

tencent/WeDLM-8B-Instruct

liked a model 11 days ago

nvidia/canary-qwen-2.5b

View all activity

Organizations

None yet

upvoted a paper 3 months ago

Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3, 2025 • 24

upvoted 2 collections 4 months ago

MultiCaRe

MultiCaRe: Open-Source Clinical Case Dataset • 4 items • Updated Sep 25, 2025 • 16

FastVLM

Efficient Vision Encoding for Vision Language Models • 9 items • Updated Sep 2, 2025 • 106

upvoted an article 7 months ago

Article

Introducing the SQL Console on Datasets

Sep 17, 2024

•

25

upvoted a collection 7 months ago

SARD: Synthetic Arabic Recognition Dataset

A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts • 2 items • Updated May 19, 2025 • 5

upvoted a collection 10 months ago

BD3-LMs

https://m-arriola.com/bd3lms/ • 4 items • Updated Sep 2, 2025 • 27

upvoted a paper 11 months ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30, 2025 • 23

upvoted a collection about 1 year ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3, 2025 • 25

upvoted a paper over 1 year ago

Med42-v2: A Suite of Clinical LLMs

Paper • 2408.06142 • Published Aug 12, 2024 • 52

upvoted a collection over 1 year ago

FalconMamba 7B

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated Nov 6, 2025 • 34

upvoted an article over 1 year ago

Article

Welcome Falcon Mamba: The first strong attention-free 7B model

+4

Aug 12, 2024

•

113

upvoted 2 papers over 1 year ago

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Paper • 2407.16607 • Published Jul 23, 2024 • 23

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

upvoted an article over 1 year ago

Article

Introducing the Open Arabic LLM Leaderboard

+3

May 14, 2024

•

101

upvoted 2 papers almost 2 years ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

upvoted a collection almost 2 years ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 89

upvoted 3 papers almost 2 years ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 627

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 134

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Paper • 2402.08609 • Published Feb 13, 2024 • 36