Yasunori Ozaki's picture

In a Training Loop 🔄

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a Space about 14 hours ago

tencent/HY-Motion-1.0

upvoted a paper about 18 hours ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

liked a dataset 2 days ago

cais/hle

View all activity

Organizations

upvoted a paper about 18 hours ago

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published 4 days ago • 49

upvoted a paper 8 days ago

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published 21 days ago • 77

upvoted a paper 11 days ago

DeContext as Defense: Safe Image Editing in Diffusion Transformers

Paper • 2512.16625 • Published 12 days ago • 24

upvoted a paper 12 days ago

IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

Paper • 2512.15635 • Published 13 days ago • 19

upvoted a collection 12 days ago

Qwen-Image

10 items • Updated 11 days ago • 41

upvoted a paper 19 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 26 days ago • 167

upvoted a paper 22 days ago

Self-Improving VLM Judges Without Human Annotations

Paper • 2512.05145 • Published 28 days ago • 18

upvoted a collection 23 days ago

Z-Image

4 items • Updated 30 days ago • 96

upvoted a changelog 27 days ago

Changelog

Duplicate Datasets

27 days ago

• 86

upvoted a paper 28 days ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published 28 days ago • 28

upvoted a paper about 1 month ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17 • 67

upvoted a collection about 2 months ago

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Oct 28 • 1

upvoted 2 papers about 2 months ago

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models

Paper • 2510.22276 • Published Oct 25 • 3

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 58

upvoted 2 papers 2 months ago

UltraGen: High-Resolution Video Generation with Hierarchical Attention

Paper • 2510.18775 • Published Oct 21 • 17

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

upvoted a collection 3 months ago

RAE

Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14 • 10

upvoted 2 papers 3 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 95

SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer

Paper • 2509.24695 • Published Sep 29 • 44

upvoted a paper 4 months ago

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Paper • 2509.09676 • Published Sep 11 • 33