AGI
-
🚀112
-
teknium/Mistral-Trismegistus-7B
Text Generation • Updated • 109 • 233 -
Latent Consistency Models
⚡345 -
XTTS
🐸2.77kGenerate speech from text using a reference voice
-
VALL E X
🎙366Generate audio from text using voice prompts
-
LLaMA Board
🦙216Fine-tuning large language model with Gradio UI
-
MusicGen
🎵5.07kGenerate music from text descriptions and optional melodies
-
MagicAnimate
💃1.43kGenerate animated videos from images and motion sequences
-
Seamless M4T v2
📞516Translate speech and text between languages
-
Stable Video Diffusion 1.1
📺1.98kGenerate a video from a single image
-
Video LLaVA
📚234 -
Mustango
🐢167 -
OpenAI TTS New
📊562 -
3D Arena
🏢354Vote and view 3D leaderboard
-
Distil Whisper Web
👀227Convert spoken words into text
-
Zero123++ Demo Space
🌒286 -
InstaFlow
🐨108 -
CLIP Interrogator 2
🕵1.33kGenerate descriptive prompts from images
-
ZoeDepth
🦀804Generate 3D models from images
-
LooseControl
📚41 -
Enhance This DemoFusion SDXL
🔍302Creative Upscaler High-Res Image Generation DemoFusion SDXL
-
axiong/PMC_LLaMA_13B
Text Generation • Updated • 600 • 33 -
axiong/pmc_llama_instructions
Viewer • Updated • 514k • 143 • 33 -
med-flamingo/med-flamingo
Updated • 55 -
wikimedia/wikisource
Viewer • Updated • 1.66M • 3.48k • 81 -
OutfitAnyone
🏢2.76kTry on clothes virtually with any model
-
Pixel Aligned Language Models
Paper • 2312.09237 • Published • 18 -
Gemini Playground
💬189Chat with Gemini Pro and upload images for responses
-
Singing Voice Conversion
🎼261Transform your voice into a singer's
-
Text To Speech
🔥59Generate speech from text with different voices
-
Text To Audio
🌖29 -
NaturalSpeech2
🎧52Generate speech with cloned timbre
-
AnyDoor Online
👁209Teleport objects into new backgrounds using masks
-
MotionCtrl
📊58 -
MotionGPT
🏃118Generate human motion from text or audio
-
MotionGPT: Human Motion as a Foreign Language
Paper • 2306.14795 • Published • 27 -
GPT-Academic
😻435Generate academic responses using GPT
-
M2UGen Demo
💻94 -
VCoder
✌63 -
IP-Adapter-FaceID
🧑988Generate images with your face
-
AnyText
👁268Generate images with text and edit existing images
-
osunlp/Mind2Web
Viewer • Updated • 253 • 1.19k • 113 -
FaceChain
🏆144Display Hugging Face status and loading animation
-
Dreamtalk
😛229Animate a portrait from audio speech
-
I2VGen-XL
🔥104 -
ReplaceAnything
📚954Replace objects in images using prompts or reference images
-
PhotoMaker
📷1.93kGenerate customized realistic human photos from images and prompts
-
Resemble Enhance
🚀424Enhance and denoise your audio files
-
DiffusionGPT
👁6Generate images from text prompts
-
DiffusionGPT XL
🐢15 -
InstantID
😻3.52kGenerate images preserving face identity
-
DuckDB NSQL 7B
🏢42Generate DuckDB SQL queries from natural language
-
InstructIR
💻205Improve images using text instructions
-
Image to Music v2
🎺560Get a music sample inspired by the mood of an image
-
BRIA RMBG 1.4
💻847Remove background from images
-
YOLO World
🔥491Detect objects in images or videos
-
Vision Arena (Testing VLMs side-by-side)
🖼557Display image analysis results
-
Stable Cascade
👁1.68kGenerate high-resolution images from text prompts
-
Diffusion Transformers (DiT)
🚀68 -
SDXL Lightning
⚡466Super-fast image generation on SDX
-
YOLO-World + EfficientSAM
🔥257Identify and segment objects in images and videos
-
Differential Diffusion
😻127Edit images using prompts and change maps
-
YOLOv9 Object Detection w/ Transformers.js
🖼53In-browser object detection w/ YOLOv9 and Transformers.js
-
Depth Anything Video
👁75Generate depth maps for video frames
-
Depth Anything
🌖552Generate depth maps from images
-
MeloTTS
🗣467Fast, efficient, & multilingual text-to-speech
-
Playground V2.5
🌍1.13kGenerate highly aesthetic images
-
MoMask
🎭113Generate 3D human motions from text descriptions
-
PhotoMaker Style
📷656Generate customized face images with styles
-
TCD
📈56Official Demo Space for Trajectory Consistency Distillation
-
TripoSR
🐳816 -
Magi Demo
🏢37Generate comic transcriptions from images
-
Animagine XL 3.1
🌍1.38kThe most opinionated, anime-themed SDXL model
-
Img2img Turbo Sketch
📚205 -
APISR
🏃134Enhance anime images with super-resolution
-
DynamiCrafter
🐨166Generate animated videos from images and text prompts
-
DynamiCrafter
🐨291Generate animated videos from images and prompts
-
DragAPart
🏢9 -
AI Comic Factory
👩10.8kCreate your own AI comic with a single prompt
-
GRM
🏆85Display a live demo website
-
AnyV2V
🎥73Video Editing
-
DesignEdit
🌿49 -
InstructPix2Pix
🚀1.54kTransform images based on text instructions
-
Parler-TTS
🥖831High-fidelity Text-To-Speech
-
MagicTime
🚀117MagicTime: Time-lapse Video Generation Models as Metamorphic
-
CustomNet
🐠45Customize objects in images with text prompts and viewpoints
-
PixArt Sigma
👁241Generate images from text prompts
-
Sd3 Api
😻47Generate images from text prompts
-
InstantMesh
📚1.56kCreate a 3D model from an image in 10 seconds!
-
Hyper SDXL 1Step T2I
🐠242Generate images from text prompts
-
IDM VTON
👕2.04kHigh-fidelity Virtual Try-on
-
IC Light
📈1.35kGenerate relit images with foreground condition
-
Phi-3 WebGPU
🚀289A private and powerful AI that runs locally in your browser
-
PaliGemma Demo
🤲314Annotate and describe images with text prompts
-
Yolov10
📉101Detect objects in images with YOLOv10
-
Open Sora Plan V1.1.0
⚡74 -
Chattts Zero
🐢339Generate audio from text with tuning options
-
ToonCrafter
😻1.03kGenerate a video from two images and text prompts
-
Bark
🐶2.37kGenerate realistic audio from text
-
MimicBrush
🐨126Transfers textures from a reference image to a masked region in a source image
-
SD3 ControlNet
⚡73Generate images using a reference image and text prompt
-
ChatTTS Speaker
🌍114Explore and download stable speaker embeddings for ChatTTS
-
SD3 Long Captioner
🏃260Generate detailed captions for images
-
MassivelyMultilingualTTS
🌍208Generate speech from text in multiple languages
-
Flash SD3
⚡197Generate images from text prompts
-
Florence 2
📉810Generate captions and analyze images with various tasks
-
ExVideo SVD 128f V1
🐨112Generate a video from an image
-
InternLM XComposer
🏢163Display a web page
-
Llm Pricing
📊273Display a React app with TypeScript
-
FoleyCrafter
📚131Generate audio for silent videos
-
PhotoMaker V2
📷1.21kGenerate customized realistic photos from face images
-
Live Portrait
🤪3.64kApply the motion of a video on a portrait
-
Diffree
🖼137 -
ViPer
😻13Generate personalized images based on comments
-
Stable Fast 3D
🎮1.13kGenerate a 3D mesh model from an image
-
Background Removal
🌘2.55kRemove background from images
-
LongWriter
💬162LLM for long context
-
Kolors Virtual Try-On
👕9.88kTry on clothes virtually by uploading images
-
CogVideoX-5B
🎥1.02kText-to-Video
-
Svd Keyframe Interpolation
🐨67Generate a video by interpolating between two images
-
OpenAudio S1
🏆660Generate speech from text
-
Finegrain Object Cutter
✂509Create HD cutouts from any image with just a prompt
-
GOT Online
💬359Extract text from images using various OCR modes
-
Dream Machine
🦀21 -
PDF2Audio
📚451Transform text into engaging podcast dialogues or detailed reports
-
Llama-Vision-11B
🚀391Generate text by uploading images and asking questions
-
Llama 3.2 90b Text Preview Groq
🌖8 -
Whisper Turbo
🤯991Transcribe audio or YouTube videos into text
-
Open NotebookLM
🎙1.1kPersonalised Podcasts For All - Available in 13 Languages
-
Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature
🚀69Generate a podcast from text, URLs, PDFs, and images
-
PMRF
🖼314A gradio demo for Posterior-Mean Rectified Flow (PMRF)
-
F5-TTS
🗣2.69kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
MaskGCT TTS Demo
😻260MaskGCT TTS Demo
-
Fish Agent
💬147An end-to-end (e2e) Voice Language Model by Fish Audio.
-
Qwen2.5 Coder Artifacts
🐢1.69kGenerate code for applications
-
SeedEdit-APP-V1.0
🎨449Generate and edit images using text instructions
-
OOTDiffusion
🥼1.1kHigh-quality virtual try-on ~ Your cyber fitting room
-
MagicQuill
🪶2.18kGenerate edited images using scribble inputs
-
OminiControl
🌍922Generate an edited image based on text and input image
-
IC Light V2-Vary
📈1.16kExecute custom code from environment variable
-
TryOffDiff
🔥60Extract garment images from everyday images!
-
Janus Pro 7b
🌍110A unified multimodal understanding and generation model.
-
Chat With Janus-Pro-7B
🌍2.01kA unified multimodal understanding and generation model.
-
Llasa 3b Tts
🔥311Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
Paligemma2 Mix
🌖95Generate text and segment images using PaliGemma 2
-
Gemini Co-Drawing
✏461Gemini 2.0 native image generation co-doodling
-
Di♪♪Rhythm
🎶656Blazingly Fast and Embarrassingly Simple Song Generation
-
InfiniteYou-FLUX
📸1.09kFlexible Photo Recrafting While Preserving Your Identity
-
Text2Human
🏃76Generate human images from text descriptions
-
GFPGAN
😁1.1kEnhance and restore old photos and AI-generated faces
-
LiveCC
🐠43LiveCC-7B-Instruct
-
ACE Step
😻598A Step Towards Music Generation Foundation Model
-
MedGemma - Radiology Explainer Demo
🩺209Radiology Image & Report Explainer Demo. Built with MedGemma
-
PlayDiffusion
🎨118Generate modified audio from text and voice
-
MiniMax M1
💬346Generate code from text prompts
-
NetaLumina T2I Playground
😻73Neta's latest text-to-image model
-
PaddleOCR-VL Online Demo
📈183Parse and recognize text in images
-
Tx1 Demo
🚀4Upload your anndata, get Tx1 embeddings in minutes