6 21 24

Jihwan Kim

jjihwannn

https://jjihwan.github.io/

AI & ML interests

Computer Vision, Diffusion Models, Generative Models

Recent Activity

liked a model 18 days ago

OpenGVLab/InternVL3_5-8B-Flash

upvoted a paper 27 days ago

VideoNSA: Native Sparse Attention Scales Video Understanding

upvoted a paper 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

View all activity

Organizations

None yet

liked a model 18 days ago

OpenGVLab/InternVL3_5-8B-Flash

Image-Text-to-Text • 9B • Updated Sep 28 • 1.07k • 4

upvoted a paper 27 days ago

VideoNSA: Native Sparse Attention Scales Video Understanding

Paper • 2510.02295 • Published 29 days ago • 9

upvoted a paper 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 202

New activity in ShareGPT4Video/ShareGPT4Video 3 months ago

4.8M videos

#26 opened 3 months ago by

jjihwannn

upvoted a collection 3 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 544

liked 2 datasets 3 months ago

OpenGVLab/InternVideo2_Vid_Text

Viewer • Updated Jul 10, 2024 • 40.5M • 9 • 13

OpenGVLab/InternVid-Full

Viewer • Updated Jun 5, 2024 • 47.6M • 207 • 15

upvoted a paper 4 months ago

STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing

Paper • 2506.22868 • Published Jun 28 • 5

upvoted a paper 5 months ago

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54

upvoted a collection 5 months ago

Unofficial Mamba2 for Hf Transformers

Collection

Just the original weights converted to be compatible with transformers. • 5 items • Updated Oct 16, 2024 • 1

upvoted a paper 6 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 85

upvoted 2 papers 7 months ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 76

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

liked a model 7 months ago

LGAI-EXAONE/EXAONE-Deep-32B

Text Generation • 32B • Updated Mar 19 • 708 • 297

upvoted 4 papers 9 months ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 66

Weak-to-Strong Diffusion with Reflection

Paper • 2502.00473 • Published Feb 1 • 23

Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach

Paper • 2502.03639 • Published Feb 5 • 9

Scaling Embedding Layers in Language Models

Paper • 2502.01637 • Published Feb 3 • 24

updated a dataset about 1 year ago

jjihwannn/Calvin-ABCD-Gen

Updated Oct 11, 2024 • 1

upvoted a paper about 1 year ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179

Jihwan Kim

AI & ML interests

Recent Activity

Organizations

jjihwannn's activity

4.8M videos