Kazuhisa Arakawa
kazuuu
AI & ML interests
None yet
Organizations
None yet
RL
-
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 49 -
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Paper • 2403.03950 • Published • 15 -
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Paper • 2403.02709 • Published • 9
Archtecture
-
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Paper • 2403.14520 • Published • 35 -
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Paper • 2403.15360 • Published • 13 -
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Paper • 2403.19888 • Published • 12
llm app
Vision and Language
RL
-
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 49 -
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Paper • 2403.03950 • Published • 15 -
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
Paper • 2403.02709 • Published • 9
Vision
Archtecture
-
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Paper • 2403.14520 • Published • 35 -
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Paper • 2403.15360 • Published • 13 -
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Paper • 2403.19888 • Published • 12