- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
-
The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences
-
The Path Not Taken: RLVR Provably Learns Off the Principals
-
The Missing Layer of AGI: From Pattern Alchemy to Coordination Physics
-
The Llama 3 Herd of Models
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
-
The human biological advantage over AI
-
The FM Agent
-
The Era of Agentic Organization: Learning to Organize with Language Models
-
The Art of Scaling Reinforcement Learning Compute for LLMs
-
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
-
The 2025 Foundation Model Transparency Index
-
SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis
-
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
-
Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First
-
Supervised learning pays attention
-
Students' Voices on Generative AI: Perceptions, Benefits, and Challenges in Higher Education
-
Stronger Normalization-Free Transformers
-
Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention
-
StarCoder: may the source be with you!
-
Staircase Streaming for Low-Latency Multi-Agent Inference
-
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
-
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning
-
Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
-
SpecAttn: Speculating Sparse Attention
-
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
-
Sparse Attention Post-Training for Mechanistic Interpretability
-
Spanish Pre-trained BERT Model and Evaluation Data
-
Soft Adaptive Policy Optimization
-
SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding
-
SkyRL-Agent: Efficient RL Training for Multi-turn LLM Agent
-
SimPO: Simple Preference Optimization with a Reference-Free Reward
-
Sigmoid Loss for Language Image Pre-Training
-
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
-
Short-Context Dominance: How Much Local Context Natural Language Actually Needs?
-
SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs
-
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
-
Sentence-Anchored Gist Compression for Long-Context LLMs
-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
-
Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling
-
SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
-
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers
-
Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
-
Scaling Latent Reasoning via Looped Language Models
-
Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey
-
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding
-
Scaling and context steer LLMs along the same computational path as the human brain
-
Sampling and Loss Weights in Multi-Domain Training
-
SAM 2: Segment Anything in Images and Videos