AI2 WildBench Leaderboard (V2)
Display and explore a leaderboard of language models
Display and explore a leaderboard of language models
Display LMArena Leaderboard
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Calculate GPU requirements for running LLMs
Identify key entities in text
Explore various text processing tasks with TURNA
Explore and submit LLM benchmarks
Generate text from document images
Analyze document layout from images
Extract text from documents using images or PDFs
Submit model evaluation results to leaderboard
Generate descriptions and answers about images
Efficient quantized retrieval over Wikipedia
Display and analyze reward model evaluation results
Highlight objects in images based on text descriptions
Display image analysis results
VLMEvalKit Evaluation Results Collection
Generate interactive web apps with Streamlit
Visualize Open vs. Proprietary LLM Progress
Upload a PDF and ask questions about its content
Submit and evaluate models on GAIA leaderboard
Identify named entities in text
Explore and analyze code completion benchmarks
Transform text files into a Hugging Face dataset
Generate speech from text in multiple languages
Generate captions and analyze images with various tasks
Display a React app with TypeScript
Video captioning/tracking
Explore visual document retrieval model rankings
In-browser speech recognition w/ word-level timestamps
Generate insights from charts using text prompts
Need to analyze data? Let a Llama-3.1 agent do it for you!
Display MTEB Arena interface
View and submit LLM benchmark evaluations
Detect objects in images using text prompts
VLMEvalKit Eval Results in video understanding benchmark
Extract text from images using various OCR modes
Generate a leaderboard for evaluating language models
remove background from any image
Vote on AI responses to rank models
What happened in open-source AI this year, and whatβs next?
Generate visual data analysis plots
Detect and estimate human poses in images and videos
Generate interactive Jupyter notebooks with user input
Ranking of LLMs for agentic tasks
OmniParser, turn your LLM into GUI agent
Enhance low-light images to improve clarity
PDF to Structured Data powered by Google DeepMind Gemini 2.0
Handwritten Signature Detection
Convert images and text to structured documents
Generate text and speech from text, audio, images, and videos
Detect faces in uploaded images
Convert PDFs to Markdown with open-source parsers
Remove background from images
A Unified Framework for Image Customization
Dolphin Demo
Create and enrich datasets with AI
Display OCRBench leaderboard with model scores
Hand-controlled arpeggiator, drum machine, and visualizer
nanonets ocr2 / olmocr / qwen2vl ocr / aya vision / rolmocr
Display OCRBench leaderboard for text recognition models
camel doc ocr / core ocr / docscope ocr / monkey ocr
nanonets ocr / smoldocling / monkey ocr / typhoon ocr
Run GGUF directly on your browser!
Extract text from images and XML files using OCR models
AI Image Detection Demo
Kontext image editing on FLUX[dev]
Classify text with zero-shot classification
GLiClass for Reranking Sentence Pairs
High-accuracy vision & reasoning for complex tasks
Run code and analyze data in a Jupyter notebook
Demo Space for EfficientLoFTR architecture in Transformers
Convert images to structured documents and answer questions
Interact with a multimodal chatbot using text, audio, images, or video
Vision-Language Models for Document Conversion
Extract and visualize layout from PDFs or images
Compare original and improved OCR text from historical documents
Chat with an AI assistant using text and images
Turkish Benchmark Leaderboard of LLM Models
In-browser tool calling with IBM Granite-4.0
Generate step-by-step solutions to complex queries
Let Us Detect your multilingual hallucinations!