- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Prompts Generalize with Low Data: Non-vacuous Generalization Bounds for Optimizing Prompts with More Informative Priors
-
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
-
Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents
-
Predicting Task Performance with Context-aware Scaling Laws
-
Pre-training under infinite compute