Parakeet TDT 0.6B V2 - OpenVINO

OpenVINO-optimized version of NVIDIA's Parakeet TDT 0.6B V2 model for high-performance automatic speech recognition on Intel NPUs and CPUs.

Benchmark Results

Hardware: Intel Core Ultra 7 155H (Meteor Lake) with Intel AI Boost NPU Dataset: LibriSpeech test-clean (2,620 files, 5.4 hours) Software: OpenVINO 2025.x

Metric	Value
Average WER	2.87%
Median WER	0.00%
Average CER	1.07%
RTFx (NPU)	37.8×
RTFx (CPU)	5-8×
Total processing time	514.7s

Performance Comparison

Implementation	Device	RTFx
eddy (OpenVINO)	Intel Core Ultra 7 155H NPU	37.8×
Parakeet (PyTorch)	Intel Arc 140V GPU	19.8×
eddy (OpenVINO)	Intel Core Ultra 7 155H CPU	5-8×

Note: Benchmarked on HP EliteBook Ultra G1i. eddy NPU is 1.9× faster than PyTorch on Intel Arc GPU, with lower power consumption.

Usage

Python usage via ctypes available - see eddy repository for details.

Model Details

Parameters: 600M
Architecture: FastConformer-RNNT (4-model pipeline)
Language: English only
Blank token ID: 1024
Context window: 10s chunks with 3s overlap
Features: LSTM state continuity, token deduplication, per-token timestamps

License

CC-BY-4.0 - See LICENSE for details.

Acknowledgments

Based on NVIDIA's Parakeet TDT model. OpenVINO conversion and optimization by the FluidInference team.

Downloads last month: 84

Model tree for FluidInference/parakeet-tdt-0.6b-v2-ov

Base model

nvidia/parakeet-tdt-0.6b-v2

Finetuned

(15)

this model

Collection including FluidInference/parakeet-tdt-0.6b-v2-ov

OpenVINO NPU

Collection

Models specifically tested on Intel's NPU with OpenVINO • 16 items • Updated 10 days ago • 1

FluidInference
/

parakeet-tdt-0.6b-v2-ov