Gyanateet Dutta
AI & ML interests
Recent Activity
Organizations
-
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Paper โข 2307.14620 โข Published โข 15 -
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs
Paper โข 2306.05410 โข Published โข 4 -
ashawkey/nerf2mesh
Updated โข 14 - Build errorFeatured25
NeRF
๐ฎ25
-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper โข 2312.11514 โข Published โข 264 -
3D-LFM: Lifting Foundation Model
Paper โข 2312.11894 โข Published โข 15 -
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper โข 2312.15166 โข Published โข 61 -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper โข 2312.16862 โข Published โข 31
-
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks
Paper โข 2402.00892 โข Published โข 13 - Running on ZeroMCPFeatured293
MusicGen Streaming
๐ฅ293Generate music from text descriptions in real-time
- Runtime errorAgents145
Whisper JAX
๐145Transcribe or translate audio from microphone, file, or YouTube
-
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
Paper โข 2406.03344 โข Published โข 22
- Runtime errorAgentsFeatured1.16k
Stable Fast 3D
๐ฎ1.16kGenerate a 3D mesh model from an image
- Runtime errorAgentsFeatured184
Roblox 3D Assets Generator v1
๐ช184Create a 3D model from an image in 10 seconds!
- Running on ZeroAgentsFeatured148
LLaMA Mesh
๐148Create 3D mesh by chatting.
-
stabilityai/stable-point-aware-3d
Image-to-3D โข 2B โข Updated โข 1.02k โข 343
-
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper โข 2309.05793 โข Published โข 51 -
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Paper โข 2308.04079 โข Published โข 199 -
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image โข Updated โข 1.98M โข โข 7.65k -
Ryukijano/lora-trained-xl-kaggle-p100
Text-to-Image โข Updated โข 4 โข 1
-
Ryukijano/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning โข Updated -
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer
Reinforcement Learning โข Updated โข 3 -
Ryukijano/poca-SoccerTwos
Reinforcement Learning โข Updated โข 9 -
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Paper โข 2308.03526 โข Published โข 29
-
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper โข 2311.12229 โข Published โข 25 - Running on ZeroAgentsFeatured1k
IP-Adapter-FaceID
๐ง1kGenerate AI images that blend your face with any prompt
-
Design2Code: How Far Are We From Automating Front-End Engineering?
Paper โข 2403.03163 โข Published โข 98
-
Unsupervised Universal Image Segmentation
Paper โข 2312.17243 โข Published โข 20 -
Denoising Vision Transformers
Paper โข 2401.02957 โข Published โข 31 -
timm/ViT-B-16-SigLIP
Zero-Shot Image Classification โข Updated โข 90.1k โข 37 - Runtime errorAgents19
Slimsam
๐19Small yet powerful mask generation application โก๏ธ
- Running on ZeroAgents68
MeshAnythingV2
๐68Generate artist-style 3D mesh from your input model
- Runtime errorAgents10
En3D
๐10 - Running on ZeroAgents55
MASt3R
๐55Generate 3D models from images
-
naver/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric
Image-to-3D โข 0.7B โข Updated โข 34.8k โข 18
-
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper โข 2309.05793 โข Published โข 51 -
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Paper โข 2308.04079 โข Published โข 199 -
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image โข Updated โข 1.98M โข โข 7.65k -
Ryukijano/lora-trained-xl-kaggle-p100
Text-to-Image โข Updated โข 4 โข 1
-
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
Paper โข 2307.14620 โข Published โข 15 -
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs
Paper โข 2306.05410 โข Published โข 4 -
ashawkey/nerf2mesh
Updated โข 14 - Build errorFeatured25
NeRF
๐ฎ25
-
Ryukijano/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning โข Updated -
Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer
Reinforcement Learning โข Updated โข 3 -
Ryukijano/poca-SoccerTwos
Reinforcement Learning โข Updated โข 9 -
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Paper โข 2308.03526 โข Published โข 29
-
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper โข 2311.12229 โข Published โข 25 - Running on ZeroAgentsFeatured1k
IP-Adapter-FaceID
๐ง1kGenerate AI images that blend your face with any prompt
-
Design2Code: How Far Are We From Automating Front-End Engineering?
Paper โข 2403.03163 โข Published โข 98
-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper โข 2312.11514 โข Published โข 264 -
3D-LFM: Lifting Foundation Model
Paper โข 2312.11894 โข Published โข 15 -
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper โข 2312.15166 โข Published โข 61 -
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper โข 2312.16862 โข Published โข 31
-
Unsupervised Universal Image Segmentation
Paper โข 2312.17243 โข Published โข 20 -
Denoising Vision Transformers
Paper โข 2401.02957 โข Published โข 31 -
timm/ViT-B-16-SigLIP
Zero-Shot Image Classification โข Updated โข 90.1k โข 37 - Runtime errorAgents19
Slimsam
๐19Small yet powerful mask generation application โก๏ธ
-
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks
Paper โข 2402.00892 โข Published โข 13 - Running on ZeroMCPFeatured293
MusicGen Streaming
๐ฅ293Generate music from text descriptions in real-time
- Runtime errorAgents145
Whisper JAX
๐145Transcribe or translate audio from microphone, file, or YouTube
-
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
Paper โข 2406.03344 โข Published โข 22
- Runtime errorAgentsFeatured1.16k
Stable Fast 3D
๐ฎ1.16kGenerate a 3D mesh model from an image
- Runtime errorAgentsFeatured184
Roblox 3D Assets Generator v1
๐ช184Create a 3D model from an image in 10 seconds!
- Running on ZeroAgentsFeatured148
LLaMA Mesh
๐148Create 3D mesh by chatting.
-
stabilityai/stable-point-aware-3d
Image-to-3D โข 2B โข Updated โข 1.02k โข 343
- Running on ZeroAgents68
MeshAnythingV2
๐68Generate artist-style 3D mesh from your input model
- Runtime errorAgents10
En3D
๐10 - Running on ZeroAgents55
MASt3R
๐55Generate 3D models from images
-
naver/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric
Image-to-3D โข 0.7B โข Updated โข 34.8k โข 18