-
-
-
-
-
-
Inference Providers
Active filters:
RLHF
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
Text Generation
•
47B
•
Updated
•
7.92k
•
453
NousResearch/Nous-Hermes-2-Mistral-7B-DPO
Text Generation
•
7B
•
Updated
•
1.12k
•
217
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
4.23k
•
•
227
NousResearch/Hermes-2-Pro-Llama-3-8B
Text Generation
•
8B
•
Updated
•
12.8k
•
•
437
11B
•
Updated
•
252
•
4
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
390
•
13
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
17
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
260
•
26
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
15.8k
•
•
244
Text Ranking
•
0.4B
•
Updated
•
1
•
3
nicholasKluge/RewardModelPT
Text Classification
•
0.1B
•
Updated
•
16
nicholasKluge/RewardModel
Text Classification
•
0.1B
•
Updated
•
88
•
1
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
5
•
11
kubernetes-bad/Ligma-L2-13b
Updated
•
2
•
3
Text Generation
•
Updated
•
167
•
205
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
7B
•
Updated
•
1.66k
•
557
berkeley-nest/Starling-RM-7B-alpha
Updated
•
15
•
103
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
3
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
3
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
1
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
1
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
7B
•
Updated
•
951
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
7B
•
Updated
•
41
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
7B
•
Updated
•
94
•
3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
7B
•
Updated
•
19
•
10
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
•
Updated
tastypear/chatglm-fitness-RLHF-GGML
CallComply/Starling-LM-11B-alpha
Text Generation
•
11B
•
Updated
•
556
•
15