- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
-
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
-
Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
-
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
-
Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI