- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
-
Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction
-
Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics
-
Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling
-
Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization