llm training

Feb 20, 2026 · 45 min read Intelligence Cartography

Re-visiting Mid-training Stage: for & with Agentic RL

Re-examining mid-training as the strategic centerpiece of the LLM pipeline — how it builds the knowledge foundation for agentic RL, and how RL signals are now flowing backward to improve mid-training itself

Read Post

Feb 13, 2026 · 38 min read Intelligence Cartography

Inside the Agentic RL Training Loop

A Step-by-Step Walkthrough using Slime and SWE-Bench as an Example

Read Post

Feb 8, 2026 · 43 min read Intelligence Cartography

RL Infra for Large-Scale Agentic Training

From GPU Memory Budgets to Framework Architectures

Read Post

Jan 15, 2026 · 4 min read Intelligence Cartography

JustTinker: Minimal RLVR for Building Reasoning Models Under $150

Low-Resource RLVR for Transforming Instruct Models into Reasoning Models

Read Post

Dec 18, 2025 · 46 min read Intelligence Cartography

Beyond Attention: SSMs, Linear Attention & Hybrid Architectures

Efficient Architectures for Agentic Tasks

Read Post