linyiqi

linyq

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

upvoted a paper 4 days ago

Computer-Use Agents as Judges for Generative User Interface

upvoted a paper 24 days ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

View all activity

Organizations

None yet

upvoted a paper 3 days ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published 4 days ago • 25

upvoted a paper 4 days ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published 10 days ago • 49

upvoted a paper 24 days ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published 25 days ago • 101

upvoted a paper about 2 months ago

Code2Video: A Code-centric Paradigm for Educational Video Generation

Paper • 2510.01174 • Published Oct 1 • 33

upvoted a paper 4 months ago

Multi-human Interactive Talking Dataset

Paper • 2508.03050 • Published Aug 5 • 9

upvoted a paper 7 months ago

LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper • 2504.16030 • Published Apr 22 • 36

upvoted a paper 8 months ago

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

upvoted 4 papers 9 months ago

Impossible Videos

Paper • 2503.14378 • Published Mar 18 • 61

Automated Movie Generation via Multi-Agent CoT Planning

Paper • 2503.07314 • Published Mar 10 • 44

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 89

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published Mar 3 • 44

upvoted 2 papers 10 months ago

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published Feb 12 • 28

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11 • 46

liked 2 datasets 10 months ago

CSU-JPG/TextAtlasEval

Viewer • Updated Aug 22 • 4k • 125 • 9

CSU-JPG/TextAtlas5M

Viewer • Updated Oct 14 • 5.4M • 2.18k • 34

upvoted 2 papers 10 months ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7 • 24

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

Paper • 2502.01572 • Published Feb 3 • 21

upvoted a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 27

updated a model 12 months ago

linyq/test_livecommand3

Feature Extraction • 8B • Updated Dec 10, 2024 • 2

upvoted a paper about 1 year ago

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 87

linyiqi

AI & ML interests

Recent Activity

Organizations

linyq's activity