3 19 5

Minsoo Kim

minsoo2333

https://marsjacobs.github.io

AI & ML interests

LLM compression

Recent Activity

upvoted a paper about 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

upvoted a paper 2 months ago

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

authored a paper 2 months ago

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13 • 174

upvoted a paper 2 months ago

QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models

Paper • 2509.17428 • Published Sep 22 • 9

authored a paper 2 months ago

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

upvoted 2 papers 2 months ago

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Paper • 2505.19640 • Published May 26 • 14

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

commented a paper 2 months ago

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19 •

upvoted a paper 5 months ago

KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction

Paper • 2505.23416 • Published May 29 • 11

authored 2 papers 5 months ago

RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy

Paper • 2412.01129 • Published Dec 2, 2024

InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding

Paper • 2506.15745 • Published Jun 18 • 13

upvoted a paper 5 months ago

InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding

Paper • 2506.15745 • Published Jun 18 • 13

commented a paper 5 months ago

InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding

Paper • 2506.15745 • Published Jun 18 • 13 •

upvoted a paper 12 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

authored 3 papers about 1 year ago

Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization

Paper • 2311.05161 • Published Nov 9, 2023 • 1

Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment

Paper • 2407.03051 • Published Jul 3, 2024

InfiniPot: Infinite Context Processing on Memory-Constrained LLMs

Paper • 2410.01518 • Published Oct 2, 2024 • 4

commented a paper about 1 year ago

InfiniPot: Infinite Context Processing on Memory-Constrained LLMs

Paper • 2410.01518 • Published Oct 2, 2024 • 4 •

upvoted a paper about 1 year ago

A Controlled Study on Long Context Extension and Generalization in LLMs

Paper • 2409.12181 • Published Sep 18, 2024 • 45

upvoted 3 papers over 1 year ago

Minsoo Kim

AI & ML interests

Recent Activity

Organizations

minsoo2333's activity