Slad's picture

16 1

Slad

Sladwell

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

upvoted a paper 25 days ago

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

updated a collection 27 days ago

View all activity

Organizations

None yet

upvoted 2 papers 25 days ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 52

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19 • 45

upvoted a paper 27 days ago

When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance

Paper • 2509.22193 • Published Sep 26 • 37

upvoted a paper 28 days ago

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Paper • 2509.19894 • Published Sep 24 • 32

upvoted 2 papers about 1 month ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 132

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Paper • 2509.13313 • Published Sep 16 • 77

upvoted 4 articles about 1 month ago

Article

Small Language Models (SLM): A Comprehensive Overview

By

•

Feb 22

• 92

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

• 83

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 240

Article

`LeRobotDataset`: Bringing large-scale datasets to lerobot

Sep 16

• 44

upvoted 4 papers about 2 months ago

Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 49

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22 • 52

LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Paper • 2509.05263 • Published Sep 5 • 10

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5 • 45

upvoted 2 papers 2 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 126

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86