Tutorials

每日AI最新进展分享。

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

2 min read · March 29, 2026

2026
The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences

3 min read · March 29, 2026

2026
The Path Not Taken: RLVR Provably Learns Off the Principals

1 min read · March 29, 2026

2026
The Missing Layer of AGI: From Pattern Alchemy to Coordination Physics

1 min read · March 29, 2026

2026
The Llama 3 Herd of Models

7 min read · March 29, 2026

2026
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

7 min read · March 29, 2026

2026
The human biological advantage over AI

1 min read · March 29, 2026

2026
The FM Agent

3 min read · March 29, 2026

2026
The Era of Agentic Organization: Learning to Organize with Language Models

3 min read · March 29, 2026

2026
The Art of Scaling Reinforcement Learning Compute for LLMs

2 min read · March 29, 2026

2026
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

2 min read · March 29, 2026

2026
The 2025 Foundation Model Transparency Index

1 min read · March 29, 2026

2026
SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis

3 min read · March 29, 2026

2026
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

4 min read · March 29, 2026

2026
Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First

2 min read · March 29, 2026

2026
Supervised learning pays attention

2 min read · March 29, 2026

2026
Students' Voices on Generative AI: Perceptions, Benefits, and Challenges in Higher Education

3 min read · March 29, 2026

2026
Stronger Normalization-Free Transformers

2 min read · March 29, 2026

2026
Stream: Scaling up Mechanistic Interpretability to Long Context in LLMs via Sparse Attention

2 min read · March 29, 2026

2026
StarCoder: may the source be with you!

4 min read · March 29, 2026

2026
Staircase Streaming for Low-Latency Multi-Agent Inference

2 min read · March 29, 2026

2026
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

2 min read · March 29, 2026

2026
SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning

4 min read · March 29, 2026

2026
Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

3 min read · March 29, 2026

2026
SpecAttn: Speculating Sparse Attention

2 min read · March 29, 2026

2026
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot

5 min read · March 29, 2026

2026
Sparse Attention Post-Training for Mechanistic Interpretability

1 min read · March 29, 2026

2026
Spanish Pre-trained BERT Model and Evaluation Data

2 min read · March 29, 2026

2026
Soft Adaptive Policy Optimization

1 min read · March 29, 2026

2026
SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding

3 min read · March 29, 2026

2026
SkyRL-Agent: Efficient RL Training for Multi-turn LLM Agent

1 min read · March 29, 2026

2026
SimPO: Simple Preference Optimization with a Reference-Free Reward

3 min read · March 29, 2026

2026
Sigmoid Loss for Language Image Pre-Training

4 min read · March 29, 2026

2026
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards

2 min read · March 29, 2026

2026
Short-Context Dominance: How Much Local Context Natural Language Actually Needs?

2 min read · March 29, 2026

2026
SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

2 min read · March 29, 2026

2026
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents

4 min read · March 29, 2026

2026
Sentence-Anchored Gist Compression for Long-Context LLMs

1 min read · March 29, 2026

2026
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

5 min read · March 29, 2026

2026
Seesaw: Accelerating Training by Balancing Learning Rate and Batch Size Scheduling

3 min read · March 29, 2026

2026
SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning

3 min read · March 29, 2026

2026
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers

3 min read · March 29, 2026

2026
Scaling Test-Time Compute to Achieve IOI Gold Medal with Open-Weight Models

1 min read · March 29, 2026

2026
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

2 min read · March 29, 2026

2026
Scaling Latent Reasoning via Looped Language Models

3 min read · March 29, 2026

2026
Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

1 min read · March 29, 2026

2026
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding

3 min read · March 29, 2026

2026
Scaling and context steer LLMs along the same computational path as the human brain

2 min read · March 29, 2026

2026
Sampling and Loss Weights in Multi-Domain Training

2 min read · March 29, 2026

2026
SAM 2: Segment Anything in Images and Videos

4 min read · March 29, 2026

2026