-
-
-
-
-
-
Inference Providers
Active filters:
Reward
Text Classification
•
2B
•
Updated
•
8
•
2
mradermacher/SmolTulu-1.7b-RM-GGUF
2B
•
Updated
•
167
mradermacher/SmolTulu-1.7b-RM-i1-GGUF
2B
•
Updated
•
229
Teen-Different/squiral_maze
Reinforcement Learning
•
Updated
Text Classification
•
Updated
•
62
•
8
Text Classification
•
Updated
•
50
•
1
Text Classification
•
Updated
•
86
•
25
Text Classification
•
Updated
•
62
•
5
wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel
Text Generation
•
8B
•
Updated
•
6
•
2
wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel
Text Generation
•
3B
•
Updated
•
23
mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-GGUF
3B
•
Updated
•
33
mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-i1-GGUF
3B
•
Updated
•
185
mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-GGUF
8B
•
Updated
•
56
•
1
mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-i1-GGUF
8B
•
Updated
•
193
•
1