OpenVLA 0.5B (Prismatic)
OpenVLA 0.5B is a compact 0.5B-parameter model trained following the settings of the original OpenVLA.
It is initialized from a smaller Prismatic VLM checkpointβQwen2.5 0.5Bβfrom MiniVLA.
This Prismatic-compatible checkpoint is particularly useful if you wish to fully fine-tune OpenVLA using native PyTorch Fully Sharded Data Parallel (FSDP) with the Prismatic VLM training scripts.
We employ this pretrained model as the initialization for CronusVLA-0.5B, a multi-frame visual-language-action (VLA) model.
π§© Usage
Please refer to the official repositories for detailed usage instructions:
These repositories include guidelines for loading, fine-tuning, and evaluating this checkpoint.
π Citation
If you find this model useful, please cite our work:
@article{li2025cronusvla,
  title={CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation},
  author={Li, Hao and Yang, Shuai and Chen, Yilun and Tian, Yang and Yang, Xiaoda and Chen, Xinyi and Wang, Hanqing and Wang, Tai and Zhao, Feng and Lin, Dahua and others},
  journal={arXiv preprint arXiv:2506.19816},
  year={2025}
}
- Downloads last month
- 23