Tutorials

每日AI最新进展分享。

Prompt Repetition Improves Non-Reasoning LLMs

1 min read · March 29, 2026

2026
Prefill vs. Decode Bottlenecks: SRAM-Frequency Tradeoffs and the Memory-Bandwidth Ceiling

1 min read · March 29, 2026

2026
Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs)

1 min read · March 29, 2026

2026
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

2 min read · March 29, 2026

2026
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

1 min read · March 29, 2026

2026
On the Convergence Rate of LoRA Gradient Descent

1 min read · March 29, 2026

2026
NVIDIA Nemotron 3: Efficient and Open Intelligence

1 min read · March 29, 2026

2026
NRGPT: An Energy-based Alternative for GPT

2 min read · March 29, 2026

2026
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

2 min read · March 29, 2026

2026
Nested Learning: The Illusion of Deep Learning Architectures

1 min read · March 29, 2026

2026
MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models

2 min read · March 29, 2026

2026
Monitoring Monitorability

1 min read · March 29, 2026

2026
Monadic Context Engineering

2 min read · March 29, 2026

2026
MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs

2 min read · March 29, 2026

2026
Modular Prompt Optimization: Optimizing Structured Prompts with Section-Local Textual Gradients

1 min read · March 29, 2026

2026
Modeling Language as a Sequence of Thoughts

1 min read · March 29, 2026

2026
Mixture-of-Depths Attention

1 min read · March 29, 2026

2026
MiMo-V2-Flash Technical Report

1 min read · March 29, 2026

2026
mHC: Manifold-Constrained Hyper-Connections

1 min read · March 29, 2026

2026
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

1 min read · March 29, 2026

2026
Mesh-Attention: A New Communication-Efficient Distributed Attention with Improved Data Locality

2 min read · March 29, 2026

2026
MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

2 min read · March 29, 2026

2026
Memory in the Age of AI Agents

2 min read · March 29, 2026

2026
Memorization Dynamics in Knowledge Distillation for Language Models

2 min read · March 29, 2026

2026
Memoria: A Scalable Agentic Memory Framework for Personalized Conversational AI

1 min read · March 29, 2026

2026
MemEvolve: Meta-Evolution of Agent Memory Systems

1 min read · March 29, 2026

2026
Mechanisms of Introspective Awareness

1 min read · March 29, 2026

2026
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

1 min read · March 29, 2026

2026
LLM Router: Prefill is All You Need

1 min read · March 29, 2026

2026
LLM-in-Sandbox Elicits General Agentic Intelligence

2 min read · March 29, 2026

2026
LinMU: Multimodal Understanding Made Linear

1 min read · March 29, 2026

2026
Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs

2 min read · March 29, 2026

2026
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

1 min read · March 29, 2026

2026
Learning to Discover at Test Time

1 min read · March 29, 2026

2026
Learning from Synthetic Data: Limitations of ERM

1 min read · March 29, 2026

2026
Large language models are not about language

1 min read · March 29, 2026

2026
Large language models and the entropy of English

1 min read · March 29, 2026

2026
Kling-Omni Technical Report

2 min read · March 29, 2026

2026
Kimi K2.5: Visual Agentic Intelligence

2 min read · March 29, 2026

2026
Kascade: A Practical Sparse Attention Method for Long-Context LLM Inference

1 min read · March 29, 2026

2026
Increasing the Thinking Budget is Not All You Need

1 min read · March 29, 2026

2026
Improving Recursive Transformers with Mixture of LoRAs

1 min read · March 29, 2026

2026
How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

1 min read · March 29, 2026

2026
Hindsight is 20/20: Building Agent Memory that Retains, Recalls, and Reflects

1 min read · March 29, 2026

2026
HiFi-RAG: Hierarchical Content Filtering and Two-Pass Generation for Open-Domain RAG

1 min read · March 29, 2026

2026
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

1 min read · March 29, 2026

2026
Geometric and Dynamic Scaling in Deep Transformers

1 min read · March 29, 2026

2026
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

2 min read · March 29, 2026

2026
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents

1 min read · March 29, 2026

2026
Forgetful but Faithful: A Cognitive Memory Architecture and Benchmark for Privacy-Aware Generative Agents

1 min read · March 29, 2026

2026