Read news on sampling with our app.
Read more in the app
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Batched reward model inference and Best-of-N sampling