6 171 192

Inui

Norm

https://normxu.github.io/

AI & ML interests

Video Diffusion; Large Language Model; Object Detection; OCR

Recent Activity

liked a dataset 3 days ago

wsdwJohn1231/DreamLIP_capion_csv_w_key

liked a dataset 3 days ago

Jyuhamdik/RealSyn15M

upvoted a paper about 2 months ago

Revisiting Multimodal Positional Encoding in Vision-Language Models

View all activity

Organizations

liked 2 datasets 3 days ago

wsdwJohn1231/DreamLIP_capion_csv_w_key

Viewer • Updated about 1 month ago • 13M • 23 • 1

Jyuhamdik/RealSyn15M

Viewer • Updated 14 days ago • 15.2M • 77 • 1

upvoted 2 papers about 2 months ago

Revisiting Multimodal Positional Encoding in Vision-Language Models

Paper • 2510.23095 • Published Oct 27, 2025 • 20

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published Oct 31, 2025 • 22

liked a model about 2 months ago

meituan-longcat/LongCat-Flash-Omni

Any-to-Any • 561B • Updated Nov 11, 2025 • 42 • 102

upvoted a paper 3 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 500

liked a model 3 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 776k • 1.17k

liked a model 4 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 21.8k • 513

upvoted a paper 4 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 138

liked a model 4 months ago

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1, 2025 • 623k • 2.12k

upvoted a paper 4 months ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12, 2025 • 37

liked a model 4 months ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 46k • • 2.24k

upvoted 3 papers 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published Jul 25, 2025 • 28

Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding

Paper • 2507.19427 • Published Jul 25, 2025 • 18

upvoted 2 papers 6 months ago

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8, 2025 • 113

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

upvoted 3 papers 7 months ago

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16, 2025 • 26

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Inui

AI & ML interests

Recent Activity

Organizations

Norm's activity