- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training
-
RewardDance: Reward Scaling in Visual Generation
-
Reusing Pre-Training Data at Test Time is a Compute Multiplier
-
Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation
-
Retrieval-Augmented Generation for Large Language Models: A Survey