Yiddish Whisper Training Collection Yiddish based Whisper post-training - Crowd Sourced Open Data β’ 10 items β’ Updated 9 days ago β’ 2
Qwen 3 VL - CATMuS Collection A collection of finetunes of Qwen 3 VL. These models were finetuned on the CATMuS dataset via TRL SFT. β’ 3 items β’ Updated 9 days ago β’ 2
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper β’ 2411.02959 β’ Published Nov 5, 2024 β’ 70
Whisper Zulu ASR Models Collection This is a collection of Whisper models for transcribing audio/video in the Zulu language. β’ 4 items β’ Updated Aug 20, 2024 β’ 1
TrOCR Medieval HTR Collection This is a collection of models trained to recognize medieval scripts. β’ 10 items β’ Updated Jul 8, 2024 β’ 5
Medieval NER Collection This is a collection of Medieval NER datasets and models. β’ 7 items β’ Updated Jul 4, 2024 β’ 2
Historic Newsaper Datasets Collection Historic Newspaper Datasets on the Hub β’ 16 items β’ Updated May 8 β’ 6