- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
-
Diffusion Language Models are Super Data Learners
-
Detecting Data Contamination in LLMs via In-Context Learning
-
Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
-
DELTA: Decoupling Long-Tailed Online Continual Learning