The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
-
allenai/Olmo-3.1-32B-Think
Text Generation β’ 32B β’ Updated β’ 5.19k β’ β’ 67 -
allenai/Olmo-3.1-32B-Instruct-SFT
32B β’ Updated β’ 2.12k β’ 6 -
allenai/Olmo-3.1-32B-Instruct-DPO
Text Generation β’ 32B β’ Updated β’ 638 β’ 4 -
allenai/Olmo-3.1-32B-Instruct
Text Generation β’ 32B β’ Updated β’ 9.98k β’ β’ 49