Parakeet TDT 0.6B V2 - OpenVINO

Discord GitHub Repo stars

OpenVINO-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model for high-performance automatic speech recognition on Intel NPUs and CPUs.

Benchmark Results

Hardware: Intel Core Ultra 7 155H (Meteor Lake) with Intel AI Boost NPU Dataset: LibriSpeech test-clean (2,620 files, 5.4 hours) Software: OpenVINO 2025.x

Metric Value
Average WER 2.87%
Median WER 0.00%
Average CER 1.07%
RTFx (NPU) 37.8×
RTFx (CPU) 5-8×
Total processing time 514.7s

Performance Comparison

Implementation Device RTFx
eddy (OpenVINO) Intel Core Ultra 7 155H NPU 37.8×
Parakeet (PyTorch) Intel Arc 140V GPU 19.8×
eddy (OpenVINO) Intel Core Ultra 7 155H CPU 5-8×

Note: Benchmarked on HP EliteBook Ultra G1i. eddy NPU is 1.9× faster than PyTorch on Intel Arc GPU, with lower power consumption.

Usage

Python usage via ctypes available - see eddy repository for details.

Model Details

  • Parameters: 600M
  • Architecture: FastConformer-RNNT (4-model pipeline)
  • Language: English only
  • Blank token ID: 1024
  • Context window: 10s chunks with 3s overlap
  • Features: LSTM state continuity, token deduplication, per-token timestamps

License

CC-BY-4.0 - See LICENSE for details.

Links

Acknowledgments

Based on NVIDIA's Parakeet TDT model. OpenVINO conversion and optimization by the FluidInference team.

Downloads last month
84
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for FluidInference/parakeet-tdt-0.6b-v2-ov

Finetuned
(15)
this model

Collection including FluidInference/parakeet-tdt-0.6b-v2-ov