- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
QLoRA: Efficient Finetuning of Quantized LLMs
-
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
-
QAgent: A modular Search Agent with Interactive Query Understanding
-
Putting on the Thinking Hats: A Survey on Chain of Thought Fine-tuning from the Perspective of Human Reasoning Mechanism
-
Prune4Web: DOM Tree Pruning Programming for Web Agent