Hasan Arif's picture

3 21 8

Hasan Arif

hasanar1f

·

AI & ML interests

Efficient training and inference

Recent Activity

liked a dataset about 2 months ago

OpenAssistant/oasst1

liked a dataset about 2 months ago

allenai/WildChat-1M

upvoted a paper 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

View all activity

Organizations

liked 2 datasets about 2 months ago

OpenAssistant/oasst1

Viewer • Updated May 2, 2023 • 88.8k • 8.99k • 1.46k

allenai/WildChat-1M

Viewer • Updated Oct 17, 2024 • 838k • 12.1k • 404

upvoted 2 papers 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 177

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10, 2025 • 50

upvoted 2 papers 7 months ago

Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference

Paper • 2502.15294 • Published Feb 21, 2025 • 1

From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval

Paper • 2505.23059 • Published May 29, 2025 • 13

liked a dataset 9 months ago

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 69.6k • 837

updated a collection 9 months ago

ML Optimization Papers

19 items • Updated Apr 4, 2025 • 1

upvoted a paper 9 months ago

Adaptive Layer-skipping in Pre-trained LLMs

Paper • 2503.23798 • Published Mar 31, 2025 • 5

updated a collection 9 months ago

Fundamentals

6 items • Updated Mar 29, 2025

upvoted a paper 9 months ago

Unified Multimodal Discrete Diffusion

Paper • 2503.20853 • Published Mar 26, 2025 • 9

updated 2 collections 10 months ago

Video LLMs

3 items • Updated Mar 27, 2025

Fundamentals

6 items • Updated Mar 29, 2025

upvoted a paper 10 months ago

M3: 3D-Spatial MultiModal Memory

Paper • 2503.16413 • Published Mar 20, 2025 • 15

updated a collection 10 months ago

ML Optimization Papers

19 items • Updated Apr 4, 2025 • 1

upvoted 2 papers 10 months ago

Multi Agent based Medical Assistant for Edge Devices

Paper • 2503.05397 • Published Mar 7, 2025 • 8

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 170

liked a model 10 months ago

kuleshov-group/bd3lm-owt-block_size16

Text Generation • 0.2B • Updated Apr 13, 2025 • 1.15k • 16