- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Why Less is More (Sometimes): A Theory of Data Curation
-
Who Said Neural Networks Aren't Linear?
-
What's the next frontier for Data-centric AI? Data Savvy Agents
-
What Limits Agentic Systems Efficiency?
-
What is the objective of reasoning with reinforcement learning?
-
Weight-sparse transformers have interpretable circuits
-
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
-
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
-
Voyager: An Open-Ended Embodied Agent with Large Language Models
-
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
-
Virtual Width Networks
-
Virtual Agent Economies
-
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
-
VibeVoice Technical Report
-
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
-
Valid Survey Simulations with Limited Human Data: The Roles of Prompting, Fine-Tuning, and Rectification
-
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
-
Unifying Tree Search Algorithm and Reward Design for LLM Reasoning: A Survey
-
Unifying Large Language Models and Knowledge Graphs: A Roadmap
-
UNIFORM: Unifying Knowledge from Large-scale and Diverse Pre-trained Models
-
Understanding the Role of Training Data in Test-Time Scaling
-
Understanding Robustness of Model Editing in Code LLMs: An Empirical Study
-
Understanding R1-Zero-Like Training: A Critical Perspective
-
Uncovering Scaling Laws for Large Language Models via Inverse Problems
-
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
-
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
-
Tree Training: Accelerating Agentic LLMs Training via Shared Prefix Reuse
-
Tree Search for LLM Agent Reinforcement Learning
-
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
-
Transition Models: Rethinking the Generative Learning Objective
-
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
-
Transformer Enhanced Relation Classification: A Comparative Analysis of Contextuality, Data Efficiency and Sequence Complexity
-
Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning
-
Train on Validation (ToV): Fast data selection with applications to fine-tuning
-
Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations
-
Towards Unbiased Calibration using Meta-Regularization
-
Towards Flash Thinking via Decoupled Advantage Policy Optimization
-
Towards a Unified View of Large Language Model Post-Training
-
Towards a Science of Scaling Agent Systems
-
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
-
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
-
Tongyi DeepResearch Technical Report
-
Thought Communication in Multiagent Collaboration
-
Thinking Augmented Pre-training
-
Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
-
Think Right: Learning to Mitigate Under-Over Thinking via Adaptive, Attentive Compression
-
Think Outside the Policy: In-Context Steered Policy Optimization
-
TheMCPCompany: Creating General-purpose Agents with Task-specific Tools
-
The Universal Landscape of Human Reasoning
-
The Rise and Potential of Large Language Model Based Agents: A Survey