Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Mahmud ElHuseyni 🇵🇸's picture

Mahmud ElHuseyni 🇵🇸

MElHuseyni

dvilasuero's profile picture

ozayezerceli's profile picture

frascuchon's profile picture

·

mahmut_cv
MElHuseyni
melhussieni

AI & ML interests

Computer Vision NLP Machine Learning

Organizations

MElHuseyni 's collections 11

liqi03/whisper-large-v3-tr-fleurs3

2B • Updated Aug 8, 2024 • 14
Darveht/zenvion-voice-detector-v0.3

Audio Classification • Updated 29 days ago • 115 • 1

Instance Segmentation

fcakyon/yolov5n-seg-v7.0

Updated Dec 20, 2022 • 6

Image Segmentation Models 🍪

nvidia/segformer-b5-finetuned-cityscapes-1024-1024

Image Segmentation • Updated Aug 9, 2022 • 87.7k • • 36
nvidia/segformer-b0-finetuned-ade-512-512

Image Segmentation • 3.75M • Updated Jan 14, 2024 • 197k • • 178
facebook/maskformer-swin-base-ade

Image Segmentation • Updated Nov 10, 2022 • 2.38k • • 13
facebook/maskformer-swin-base-coco

Image Segmentation • 0.1B • Updated May 3, 2024 • 870 • 26

Object Detection Models 🍉

atalaydenknalbant/Yolov13

Object Detection • Updated Sep 15 • 3.91k • 19
Ultralytics/YOLO11

Updated Aug 4 • 6.79k • 156
nielsr/yolov12n

Object Detection • Updated Feb 20 • 5
Ultralytics/YOLOv8

Object Detection • Updated Jan 11 • 6.66k • 300

Vision Language Leader-boards 📈

Running

44

OCRBenchv2 Leaderboard

🏆

44

Display OCRBench leaderboard for text recognition models
Running

192

Vidore Leaderboard

🥇

192

Browse and compare visual document retrieval models
Running on CPU Upgrade

954

Open VLM Leaderboard

🌎

954

VLMEvalKit Evaluation Results Collection
Running

Featured

558

Vision Arena (Testing VLMs side-by-side)

🖼

558

Display image analysis results

LLM Inference 🚀

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

Paper • 2401.08671 • Published Jan 9, 2024 • 15
NanoFlow: Towards Optimal Large Language Model Serving Throughput

Paper • 2408.12757 • Published Aug 22, 2024 • 19
richard-park/llama3-deepspeed-v1.0

Text Generation • 8B • Updated Jul 4, 2024 • 15 • • 1

Arabic Models (LLM, VLM, Multimodel)

NAMAA-Space/GATE-Reranker-V1

Text Ranking • 0.1B • Updated Apr 3 • 84 • 10
NAMAA-Space/gliner_arabic-v2.1

Token Classification • Updated Apr 13 • 61 • 15
NAMAA-Space/AraModernBert-Base-V1.0

Fill-Mask • 0.1B • Updated Mar 3 • 246 • 13
NAMAA-Space/AraModernBert-Base-STS

Sentence Similarity • 0.1B • Updated Mar 9 • 31 • 6

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8 • 47.3k • 567
OpenGVLab/InternVL3-1B

Image-Text-to-Text • 0.9B • Updated Sep 11 • 112k • 77
OpenGVLab/InternVL3-2B

Image-Text-to-Text • 2B • Updated Sep 11 • 26.7k • 43
LiquidAI/LFM2-VL-450M

Image-Text-to-Text • 0.5B • Updated 25 days ago • 5.49k • 140

OCR Models 👀️📃

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31 • 847k • 1.17k
reducto/RolmOCR

Image-to-Text • 8B • Updated Apr 2 • 3.71k • 569
numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Nov 13 • 178k • 216
ChatDOC/OCRFlux-3B

Image-to-Text • 4B • Updated Jul 9 • 156k • 358

Visual Embedding Models 🖼️

jinaai/jina-embeddings-v4

Visual Document Retrieval • 4B • Updated Sep 2 • 113k • 441
vidore/colqwen2.5-v0.2

Visual Document Retrieval • Updated Jun 16 • 55.2k • 91
nomic-ai/colnomic-embed-multimodal-7b

Visual Document Retrieval • Updated Apr 15 • 3.24k • 96
nvidia/llama-nemoretriever-colembed-3b-v1

Visual Document Retrieval • 4B • Updated 7 days ago • 829 • 69

Speech Models 🎧

ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 137 • 417
AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 55
OpenMOSS-Team/SpeechGPT-7B-cm

Text Generation • Updated Sep 15, 2023 • 86 • 7
parler-tts/parler_tts_mini_v0.1

Text-to-Speech • 0.6B • Updated Apr 30, 2024 • 2.99k • 358

liqi03/whisper-large-v3-tr-fleurs3

2B • Updated Aug 8, 2024 • 14
Darveht/zenvion-voice-detector-v0.3

Audio Classification • Updated 29 days ago • 115 • 1

Arabic Models (LLM, VLM, Multimodel)

NAMAA-Space/GATE-Reranker-V1

Text Ranking • 0.1B • Updated Apr 3 • 84 • 10
NAMAA-Space/gliner_arabic-v2.1

Token Classification • Updated Apr 13 • 61 • 15
NAMAA-Space/AraModernBert-Base-V1.0

Fill-Mask • 0.1B • Updated Mar 3 • 246 • 13
NAMAA-Space/AraModernBert-Base-STS

Sentence Similarity • 0.1B • Updated Mar 9 • 31 • 6

Instance Segmentation

fcakyon/yolov5n-seg-v7.0

Updated Dec 20, 2022 • 6

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8 • 47.3k • 567
OpenGVLab/InternVL3-1B

Image-Text-to-Text • 0.9B • Updated Sep 11 • 112k • 77
OpenGVLab/InternVL3-2B

Image-Text-to-Text • 2B • Updated Sep 11 • 26.7k • 43
LiquidAI/LFM2-VL-450M

Image-Text-to-Text • 0.5B • Updated 25 days ago • 5.49k • 140

Image Segmentation Models 🍪

nvidia/segformer-b5-finetuned-cityscapes-1024-1024

Image Segmentation • Updated Aug 9, 2022 • 87.7k • • 36
nvidia/segformer-b0-finetuned-ade-512-512

Image Segmentation • 3.75M • Updated Jan 14, 2024 • 197k • • 178
facebook/maskformer-swin-base-ade

Image Segmentation • Updated Nov 10, 2022 • 2.38k • • 13
facebook/maskformer-swin-base-coco

Image Segmentation • 0.1B • Updated May 3, 2024 • 870 • 26

OCR Models 👀️📃

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31 • 847k • 1.17k
reducto/RolmOCR

Image-to-Text • 8B • Updated Apr 2 • 3.71k • 569
numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Nov 13 • 178k • 216
ChatDOC/OCRFlux-3B

Image-to-Text • 4B • Updated Jul 9 • 156k • 358

Object Detection Models 🍉

atalaydenknalbant/Yolov13

Object Detection • Updated Sep 15 • 3.91k • 19
Ultralytics/YOLO11

Updated Aug 4 • 6.79k • 156
nielsr/yolov12n

Object Detection • Updated Feb 20 • 5
Ultralytics/YOLOv8

Object Detection • Updated Jan 11 • 6.66k • 300

Visual Embedding Models 🖼️

jinaai/jina-embeddings-v4

Visual Document Retrieval • 4B • Updated Sep 2 • 113k • 441
vidore/colqwen2.5-v0.2

Visual Document Retrieval • Updated Jun 16 • 55.2k • 91
nomic-ai/colnomic-embed-multimodal-7b

Visual Document Retrieval • Updated Apr 15 • 3.24k • 96
nvidia/llama-nemoretriever-colembed-3b-v1

Visual Document Retrieval • 4B • Updated 7 days ago • 829 • 69

Vision Language Leader-boards 📈

Running

44

OCRBenchv2 Leaderboard

🏆

44

Display OCRBench leaderboard for text recognition models
Running

192

Vidore Leaderboard

🥇

192

Browse and compare visual document retrieval models
Running on CPU Upgrade

954

Open VLM Leaderboard

🌎

954

VLMEvalKit Evaluation Results Collection
Running

Featured

558

Vision Arena (Testing VLMs side-by-side)

🖼

558

Display image analysis results

Speech Models 🎧

ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 137 • 417
AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 55
OpenMOSS-Team/SpeechGPT-7B-cm

Text Generation • Updated Sep 15, 2023 • 86 • 7
parler-tts/parler_tts_mini_v0.1

Text-to-Speech • 0.6B • Updated Apr 30, 2024 • 2.99k • 358

LLM Inference 🚀

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

Paper • 2401.08671 • Published Jan 9, 2024 • 15
NanoFlow: Towards Optimal Large Language Model Serving Throughput

Paper • 2408.12757 • Published Aug 22, 2024 • 19
richard-park/llama3-deepspeed-v1.0

Text Generation • 8B • Updated Jul 4, 2024 • 15 • • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs