Jaward Sesay

Jaward

AI & ML interests

Building Lectūra Labs | CS Grad Student @BIT | AI/ML Research: Autonomous Agents, LLMs | Building The Cursor for Learning | Role Model Karpathy

Recent Activity

updated a model 21 days ago

Jaward/afri-aya-vision-8b-test

updated a model 21 days ago

Jaward/afri-aya-vision-krio-8b

liked a model 22 days ago

Jaward/afri-aya-vision-8b-test

View all activity

Organizations

upvoted 2 papers about 1 month ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23 • 67

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10 • 56

upvoted 5 papers 2 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 154

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 94

DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published Aug 14 • 28

upvoted 2 papers 3 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 186

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 306

upvoted 5 papers 4 months ago

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 121

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning

Paper • 2506.16012 • Published Jun 19 • 22

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 63

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 26

upvoted a paper 5 months ago

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9 • 50

upvoted an article 5 months ago

Article

KV Cache from scratch in nanoVLM

Jun 4

• 98

upvoted 3 papers 5 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 107

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 305

upvoted a paper 6 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 85

Jaward Sesay

AI & ML interests

Recent Activity

Organizations

Jaward's activity

KV Cache from scratch in nanoVLM