- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Transition Models: Rethinking the Generative Learning Objective
-
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
-
Transformer Enhanced Relation Classification: A Comparative Analysis of Contextuality, Data Efficiency and Sequence Complexity
-
Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning
-
Train on Validation (ToV): Fast data selection with applications to fine-tuning