MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper • 2508.20453 • Published Aug 28 • 63
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model Paper • 2310.04445 • Published Oct 2, 2023
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks Paper • 2309.17002 • Published Sep 29, 2023 • 1
Exploring Domain-Specific Enhancements for a Neural Foley Synthesizer Paper • 2309.04641 • Published Sep 8, 2023 • 1
Improving Data Efficiency via Curating LLM-Driven Rating Systems Paper • 2410.10877 • Published Oct 9, 2024
Enhancing Retrieval for ESGLLM via ESG-CID -- A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS Paper • 2503.10674 • Published Mar 10
Deciphering GunType Hierarchy through Acoustic Analysis of Gunshot Recordings Paper • 2506.20609 • Published Jun 25
ProRefine: Inference-time Prompt Refinement with Textual Feedback Paper • 2506.05305 • Published Jun 5 • 1
Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection Paper • 2410.03904 • Published Oct 4, 2024
Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond Paper • 2408.11338 • Published Aug 21, 2024
Harnessing Business and Media Insights with Large Language Models Paper • 2406.06559 • Published Jun 2, 2024
Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms Paper • 2310.07161 • Published Oct 11, 2023 • 1