Alpha-VLLM

company

https://github.com/Alpha-VLLM

Alpha-VLLM

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Dakerqi authored a paper about 18 hours ago

Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis

Dakerqi authored a paper about 18 hours ago

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark

Dakerqi authored a paper about 18 hours ago

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

View all activity

Papers

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

View all Papers

Dakerqi

authored 3 papers about 18 hours ago

Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis

Paper • 2510.15710 • Published Oct 17 • 6

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark

Paper • 2402.02242 • Published Feb 3, 2024

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Paper • 2512.19433 • Published 4 days ago • 3

XiN0919

updated a collection 2 days ago

Lumina-DiMOO Family

Collection

Open-Sourced Large Diffusion Language Model for Multi-Modal Generation and Understanding • 4 items • Updated 2 days ago • 5

XiN0919

updated a collection 3 days ago

Lumina-DiMOO Family

Collection

Open-Sourced Large Diffusion Language Model for Multi-Modal Generation and Understanding • 4 items • Updated 2 days ago • 5

Dakerqi

updated a collection 7 days ago

Lumina-Image 2.0

Collection

3 items • Updated 7 days ago

RuoyiDu

authored 2 papers 23 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 29 days ago • 212

Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

Paper • 2511.22677 • Published 29 days ago • 28

Cxxs

authored a paper 25 days ago

ImageBind-LLM: Multi-modality Instruction Tuning

Paper • 2309.03905 • Published Sep 7, 2023 • 17

zsLin

authored a paper 25 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 29 days ago • 212

Cxxs

authored a paper 25 days ago

Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

Paper • 2511.22677 • Published 29 days ago • 28

JackyZhuo

authored 4 papers about 1 month ago

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Paper • 2504.04903 • Published Apr 7

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Paper • 2510.05091 • Published Oct 6 • 19

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 54

PICABench: How Far Are We from Physically Realistic Image Editing?

Paper • 2510.17681 • Published Oct 20 • 62

Dakerqi

in Alpha-VLLM/Lumina-Image-2.0 about 2 months ago

Any controlnets for this model?

#18 opened about 2 months ago by

krigeta

xianbao

authored a paper about 2 months ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published Oct 20 • 7

stzhao

authored a paper about 2 months ago

TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning

Paper • 2511.01833 • Published Nov 3 • 15

AI & ML interests

Recent Activity

Papers

Team members 17

Alpha-VLLM's activity

Any controlnets for this model?