NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation Paper • 2312.11361 • Published Dec 18, 2023 • 1
On the importance of Data Scale in Pretraining Arabic Language Models Paper • 2401.07760 • Published Jan 15, 2024 • 1
Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models Paper • 2402.02244 • Published Feb 3, 2024 • 1
QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning Paper • 2402.10462 • Published Feb 16, 2024
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation Paper • 2203.09391 • Published Mar 17, 2022 • 1
SortedNet, a Place for Every Network and Every Network in its Place: Towards a Generalized Solution for Training Many-in-One Neural Networks Paper • 2309.00255 • Published Sep 1, 2023 • 1
Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding Paper • 2205.10687 • Published May 21, 2022
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation Paper • 2210.07558 • Published Oct 14, 2022 • 1
Towards Fine-tuning Pre-trained Language Models with Integer Forward and Backward Propagation Paper • 2209.09815 • Published Sep 20, 2022 • 1
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages Paper • 2210.09984 • Published Oct 18, 2022 • 2
CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search Paper • 2406.05013 • Published Jun 7, 2024
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems Paper • 2405.15110 • Published May 24, 2024
S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models Paper • 2407.01955 • Published Jul 2, 2024
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models Paper • 2409.14595 • Published Sep 22, 2024
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models Paper • 2305.14775 • Published May 24, 2023
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models Paper • 2503.05005 • Published Mar 6 • 1