imenelydiaker (Imene Kerboua)

upvoted an article about 1 month ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1

•

132

upvoted 2 papers about 2 months ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10 • 105

Value Drifts: Tracing Value Alignment During LLM Post-Training

Paper • 2510.26707 • Published Oct 30 • 12

upvoted 2 papers 2 months ago

HUME: Measuring the Human-Model Performance Gap in Text Embedding Task

Paper • 2510.10062 • Published Oct 11 • 8

FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents

Paper • 2510.03204 • Published Oct 3 • 6

upvoted 2 papers 6 months ago

Build the web for agents, not agents for the web

Paper • 2506.10953 • Published Jun 12 • 21

LineRetriever: Planning-Aware Observation Reduction for Web Agents

Paper • 2507.00210 • Published Jun 30 • 6

upvoted an article 8 months ago

Article

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

Apr 24

•

16

upvoted a paper 9 months ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14 • 20

upvoted a paper 10 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 43

upvoted an article 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

262

upvoted a paper 11 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 252

upvoted a paper over 1 year ago

Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents

Paper • 2406.04028 • Published Jun 6, 2024 • 2

upvoted 4 papers almost 2 years ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 152

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 50

GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer

Paper • 2311.08526 • Published Nov 14, 2023 • 12

Grandmaster-Level Chess Without Search

Paper • 2402.04494 • Published Feb 7, 2024 • 69

Imene Kerboua

AI & ML interests

Organizations

Introducing RTEB: A New Standard for Retrieval Evaluation

Grounding Computer Use Agents on Human Demonstrations

Value Drifts: Tracing Value Alignment During LLM Post-Training

HUME: Measuring the Human-Model Performance Gap in Text Embedding Task

FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents

Build the web for agents, not agents for the web

LineRetriever: Planning-Aware Observation Reduction for Web Agents

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

MIEB: Massive Image Embedding Benchmark

MMTEB: Massive Multilingual Text Embedding Benchmark

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents

StarCoder 2 and The Stack v2: The Next Generation

Teaching Large Language Models to Reason with Reinforcement Learning

GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer

Grandmaster-Level Chess Without Search

Imene Kerboua

AI & ML interests

Organizations

imenelydiaker's activity

Introducing RTEB: A New Standard for Retrieval Evaluation

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge