Get the latest tech news

S1: Simple Test-Time Scaling


s1: Simple test-time scaling. Contribute to simplescaling/s1 development by creating an account on GitHub.

Minimal recipe for test-time scaling and strong reasoning performance matching o1-preview with just 1,000 examples & budget forcing To run training, you can find our script at train/sft.py which you can invoke via one of the train/sft*sh scripts which in turn you can launch via train/launch.sh if you are on a SLURM cluster (requires editing the file for your cluster setup). If you want to compute statistics (avg thinking tokens etc) for an evaluation run you can use python eval/compute_sample_stats.py path_to_samples_file.jsonl

Get the Android app

Or read this on Hacker News