YiRan's picture

38 12

YiRan

YiRan-KJ

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

PICABench: How Far Are We from Physically Realistic Image Editing?

upvoted a paper 11 days ago

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model

liked a model 6 months ago

sand-ai/MAGI-1

View all activity

Organizations

None yet

upvoted 2 papers 11 days ago

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published 12 days ago • 61

TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model

Paper • 2510.16449 • Published 14 days ago • 34

upvoted a paper 6 months ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 158

upvoted 17 papers 8 months ago

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Paper • 2407.04693 • Published Jul 5, 2024 • 3

LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking

Paper • 2407.04020 • Published Jul 4, 2024 • 4

PartCraft: Crafting Creative Objects by Parts

Paper • 2407.04604 • Published Jul 5, 2024 • 6

Understanding Visual Feature Reliance through the Lens of Complexity

Paper • 2407.06076 • Published Jul 8, 2024 • 7

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7, 2024 • 10

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

Paper • 2407.06027 • Published Jul 8, 2024 • 11

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

Paper • 2407.06191 • Published Jul 8, 2024 • 14

InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct

Paper • 2407.05700 • Published Jul 8, 2024 • 14

Compositional Video Generation as Flow Equalization

Paper • 2407.06182 • Published Jun 10, 2024 • 14

UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

Paper • 2407.05282 • Published Jul 7, 2024 • 15

Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction

Paper • 2407.03651 • Published Jul 4, 2024 • 18

ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Paper • 2407.06135 • Published Jul 8, 2024 • 23

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Paper • 2407.03471 • Published Jul 3, 2024 • 31

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 36

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Paper • 2407.05975 • Published Jul 8, 2024 • 37

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 56