-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.4M • 352 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 82.2k • 118 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 1.58M • 351 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 48.7k • 732
Bjorn Melin
BjornMelin
AI & ML interests
Large Language Models, AI Agents, Multi-Agent Orchestrations, Deep Learning, NLP, Local LLM Optimization.
Recent Activity
updated a collection about 1 month ago
Google liked a model about 1 month ago
unsloth/gemma-4-26B-A4B-it-GGUF updated a collection 6 months ago
RerankersOrganizations
None yet
Datasets
Fine Tuning
- Running68
GGUF Model VRAM Calculator
📈68Calculate VRAM requirements for LLM models
- Runtime errorAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningFeatured1.05k
Can You Run It? LLM version
🚀1.05kCalculate GPU needs for running LLMs on your hardware
Legendary VL Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.1k • 397 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 17.2k • • 222 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 377k • • 861 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.43M • • 707
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 120k • 15 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.1M • • 2.14k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.7M • • 5.89k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 133k • 34
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.19M • • 4.07k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 57 -
internlm/internlm3-8b-instruct
Text Generation • Updated • 77.6k • 231 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 39 • • 551
Embedding Models
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 1.86k • • 124 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 901k • 133 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 81 • 391 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 567k • • 648
Leaderboards
Coding Models
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 514 • 400 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 21.6k • 202 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 1.13M • • 1.97k -
google/gemma-3n-E4B-it
Image-Text-to-Text • Updated • 27.8k • • 914
Qwen
Rerankers
-
Qwen/Qwen3-Reranker-0.6B
Text Ranking • 0.6B • Updated • 1.4M • 352 -
jinaai/jina-reranker-m0
Text Classification • 2B • Updated • 82.2k • 118 -
jinaai/jina-reranker-v2-base-multilingual
Text Ranking • 0.3B • Updated • 1.58M • 351 -
jinaai/jina-embeddings-v2-base-en
Feature Extraction • 0.1B • Updated • 48.7k • 732
Embedding Models
Datasets
Single 4090 Laptop GPU
-
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 1.86k • • 124 -
Qwen/Qwen3-32B-AWQ
Text Generation • 33B • Updated • 901k • 133 -
OpenHands/openhands-lm-32b-v0.1
Text Generation • 33B • Updated • 81 • 391 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation • 15B • Updated • 567k • • 648
Fine Tuning
- Running68
GGUF Model VRAM Calculator
📈68Calculate VRAM requirements for LLM models
- Runtime errorAgentsFeatured1.01k
Model Memory Utility
🚀1.01kCalculate GPU memory needed for training Hugging Face models
- RunningFeatured1.05k
Can You Run It? LLM version
🚀1.05kCalculate GPU needs for running LLMs on your hardware
Leaderboards
Legendary VL Models
Coding Models
Smol Models
My favorite smaller models under 10B parameters.
-
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF
Text Generation • 8B • Updated • 50.1k • 397 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 17.2k • • 222 -
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 377k • • 861 -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 2.43M • • 707
Google
-
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text • 27B • Updated • 514 • 400 -
unsloth/gemma-3-27b-it-GGUF
Image-Text-to-Text • 27B • Updated • 21.6k • 202 -
google/gemma-3-27b-it
Image-Text-to-Text • 27B • Updated • 1.13M • • 1.97k -
google/gemma-3n-E4B-it
Image-Text-to-Text • Updated • 27.8k • • 914
Llama
-
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation • 3B • Updated • 120k • 15 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.1M • • 2.14k -
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 10.7M • • 5.89k -
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation • 8B • Updated • 133k • 34
Qwen
LLMs
-
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.19M • • 4.07k -
sentence-transformers/static-retrieval-mrl-en-v1
Sentence Similarity • Updated • 57 -
internlm/internlm3-8b-instruct
Text Generation • Updated • 77.6k • 231 -
NovaSky-AI/Sky-T1-32B-Preview
Text Generation • 33B • Updated • 39 • • 551