- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
-
Artificial Hippocampus Networks for Efficient Long-Context Modeling
-
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
-
Are Large Language Models Sensitive to the Motives Behind Communication?
-
Are Agents Just Automata? On the Formal Equivalence Between Agentic AI and the Chomsky Hierarchy