Feb 20, 2026 · 45 min read Intelligence Cartography
Re-visiting Mid-training Stage: for & with Agentic RL Re-examining mid-training as the strategic centerpiece of the LLM pipeline — how it builds the knowledge foundation for agentic RL, and how RL signals are now flowing backward to improve mid-training itself
Read Post