12 14 79

Anshuman Suri

iamgroot42

https://anshumansuri.com/

AI & ML interests

Privacy, Distribution Inference, Membership Inference

Recent Activity

liked a model about 3 hours ago

kernels-community/vllm-flash-attn3

liked a model 4 days ago

princeton-nlp/QuRater-1.3B

upvoted a paper 5 days ago

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

View all activity

Organizations

upvoted a paper 5 days ago

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 22

upvoted a paper 18 days ago

Simple Projection Variants Improve ColBERT Performance

Paper • 2510.12327 • Published 20 days ago • 5

upvoted 2 collections 26 days ago

Chart-RVR

Collection

Models trained using GRPO for enhanced Chart Reasoning • 3 items • Updated Aug 24 • 1

Steering the CensorShip

Collection

3 items • Updated Sep 28 • 1

upvoted an article about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 161

upvoted a paper 3 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

upvoted a paper 4 months ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17 • 13

upvoted 2 articles 4 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 82

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

• 709

upvoted a paper 6 months ago

Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control

Paper • 2504.17130 • Published Apr 23 • 1

upvoted a paper 8 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

upvoted 2 papers 9 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 246

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 172

upvoted a paper about 1 year ago

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Paper • 2402.09391 • Published Feb 14, 2024 • 2

Anshuman Suri

AI & ML interests

Recent Activity

Organizations

iamgroot42's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

SmolLM3: smol, multilingual, long-context reasoner