Sumit Yadav's picture

2 9 59

Sumit Yadav

rockerritesh

·

https://sumityadav.com.np

AI & ML interests

AI(GAN) || LLM RAG

Recent Activity

liked a dataset 6 days ago

rockerritesh/accelerometerData

updated a dataset 6 days ago

rockerritesh/accelerometerData

published a dataset 6 days ago

rockerritesh/accelerometerData

View all activity

Organizations

upvoted 2 papers 2 months ago

Can maiBERT Speak for Maithili?

Paper • 2509.15048 • Published Sep 18, 2025 • 1

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25, 2025 • 29

upvoted a collection 9 months ago

Cogito v1 Preview

5 items • Updated Apr 8, 2025 • 120

upvoted a paper 9 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 202

upvoted 2 collections 9 months ago

Vision Language Models Quantization

Vision Language Models (VLMs) quantized by Neural Magic • 20 items • Updated Mar 4, 2025 • 6

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 10 days ago • 34

upvoted a collection 10 months ago

MoshiVis v0.1

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 9 items • Updated 10 days ago • 23

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12, 2025

•

480

upvoted an article 11 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

+5

Feb 20, 2025

•

320