sampling

Read news on sampling with our app.

Read more in the app

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Batched reward model inference and Best-of-N sampling