- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
-
Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression
-
Think Outside the Policy: In-Context Steered Policy Optimization
-
TheMCPCompany: Creating General-purpose Agents with Task-specific Tools
-
The Universal Landscape of Human Reasoning