- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
SAM 2: Segment Anything in Images and Videos
-
s1: Simple test-time scaling
-
RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training
-
Robust Layerwise Scaling Rules by Proper Weight Decay Tuning
-
RLHF: A comprehensive Survey for Cultural, Multimodal and Low Latency Alignment Methods