6 1

Mehdi Rezagholizadeh

mrgzadeh

AI & ML interests

Natural Language Processing

Recent Activity

new activity about 1 month ago

amd/Zebra-Llama-1B-8MLA-8Mamba-DPO:Please use safetensors format

updated a model about 1 month ago

amd/Zebra-Llama-8B-8MLA-24Mamba-SFT

updated a model about 1 month ago

amd/X-EcoMLA-3B3B-dynamic-0.95-DPO

View all activity

Organizations

authored 20 papers about 1 month ago

Evaluating Embedding APIs for Information Retrieval

Paper • 2305.06300 • Published May 10, 2023 • 1

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation

Paper • 2312.11361 • Published Dec 18, 2023 • 1

On the importance of Data Scale in Pretraining Arabic Language Models

Paper • 2401.07760 • Published Jan 15, 2024 • 1

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

Paper • 2402.02244 • Published Feb 3, 2024 • 1

JABER and SABER: Junior and Senior Arabic BERt

Paper • 2112.04329 • Published Dec 8, 2021

QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning

Paper • 2402.10462 • Published Feb 16, 2024

When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation

Paper • 2203.09391 • Published Mar 17, 2022 • 1

SortedNet, a Place for Every Network and Every Network in its Place: Towards a Generalized Solution for Training Many-in-One Neural Networks

Paper • 2309.00255 • Published Sep 1, 2023 • 1

Dynamic Position Encoding for Transformers

Paper • 2204.08142 • Published Apr 18, 2022 • 1

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

Paper • 2205.10687 • Published May 21, 2022

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic Search-Free Low-Rank Adaptation

Paper • 2210.07558 • Published Oct 14, 2022 • 1

Towards Fine-tuning Pre-trained Language Models with Integer Forward and Backward Propagation

Paper • 2209.09815 • Published Sep 20, 2022 • 1

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages

Paper • 2210.09984 • Published Oct 18, 2022 • 2

CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search

Paper • 2406.05013 • Published Jun 7, 2024

CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems

Paper • 2405.15110 • Published May 24, 2024

S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models

Paper • 2407.01955 • Published Jul 2, 2024

EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models

Paper • 2409.14595 • Published Sep 22, 2024

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

Paper • 2305.14775 • Published May 24, 2023

ReGLA: Refining Gated Linear Attention

Paper • 2502.01578 • Published Feb 3

Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models

Paper • 2503.05005 • Published Mar 6 • 1

Mehdi Rezagholizadeh

AI & ML interests

Recent Activity

Organizations

mrgzadeh's activity