Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

📅 2025-04-28    ⚓ Hacker News    🌐 Source    🖼️ Load Image