PLOD: An Abbreviation Detection Dataset for Scientific Documents Paper • 2204.12061 • Published Apr 26, 2022 • 1
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Paper • 2406.08920 • Published Jun 13, 2024 • 7
Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content? Paper • 2410.06338 • Published Oct 8, 2024
PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation Paper • 2412.07754 • Published Dec 10, 2024
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages Paper • 2501.04473 • Published Jan 8
Towards a Robust Framework for Multimodal Hate Detection: A Study on Video vs. Image-based Content Paper • 2502.07138 • Published Feb 11
BESSTIE: A Benchmark for Sentiment and Sarcasm Classification for Varieties of English Paper • 2412.04726 • Published Dec 6, 2024
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music Paper • 2502.18309 • Published Feb 25
The Mind's Eye: A Multi-Faceted Reward Framework for Guiding Visual Metaphor Generation Paper • 2508.18569 • Published Aug 26
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages Paper • 2412.15248 • Published Dec 14, 2024 • 1