-
-
-
-
-
-
Inference Providers
Active filters:
rl
ContextualAI/archangel_sft-ppo_llama13b
Text Generation
•
13B
•
Updated
•
4
ContextualAI/archangel_sft-ppo_llama30b
Text Generation
•
33B
•
Updated
•
5
ContextualAI/archangel_sft-csft_pythia1-4b
Text Generation
•
1B
•
Updated
•
3
ContextualAI/archangel_sft-slic_pythia1-4b
Text Generation
•
1B
•
Updated
•
4
ContextualAI/archangel_csft_pythia1-4b
Text Generation
•
1B
•
Updated
•
4
ContextualAI/archangel_sft-csft_pythia2-8b
Text Generation
•
3B
•
Updated
•
4
ContextualAI/archangel_sft-slic_pythia2-8b
Text Generation
•
3B
•
Updated
•
6
ContextualAI/archangel_csft_pythia2-8b
Text Generation
•
3B
•
Updated
•
5
•
2
ContextualAI/archangel_sft-csft_pythia6-9b
Text Generation
•
7B
•
Updated
•
3
ContextualAI/archangel_sft-slic_pythia6-9b
Text Generation
•
7B
•
Updated
•
3
ContextualAI/archangel_csft_pythia6-9b
Text Generation
•
7B
•
Updated
•
3
ContextualAI/archangel_sft-csft_pythia12-0b
Text Generation
•
12B
•
Updated
•
3
ContextualAI/archangel_sft-slic_pythia12-0b
Text Generation
•
12B
•
Updated
•
3
ContextualAI/archangel_csft_pythia12-0b
Text Generation
•
12B
•
Updated
•
6
ContextualAI/archangel_sft-csft_llama7b
Text Generation
•
7B
•
Updated
•
4
ContextualAI/archangel_sft-slic_llama7b
Text Generation
•
7B
•
Updated
•
2
ContextualAI/archangel_csft_llama7b
Text Generation
•
7B
•
Updated
•
3
ContextualAI/archangel_sft-csft_llama13b
Text Generation
•
13B
•
Updated
•
4
ContextualAI/archangel_sft-slic_llama13b
Text Generation
•
13B
•
Updated
•
4
ContextualAI/archangel_csft_llama13b
Text Generation
•
13B
•
Updated
•
3
ContextualAI/archangel_sft-csft_llama30b
Text Generation
•
33B
•
Updated
•
3
ContextualAI/archangel_csft_llama30b
Text Generation
•
33B
•
Updated
•
3
Text Generation
•
3B
•
Updated
•
2
•
1
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
7B
•
Updated
•
69
•
32
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
7B
•
Updated
•
339
•
2
mradermacher/archangel_sft-kto_llama30b-GGUF
33B
•
Updated
•
116
•
1
mradermacher/archangel_sft-kto_llama30b-i1-GGUF
33B
•
Updated
•
37
lithiumice/motion_imitation
Updated
tristan-deep/dqn-needle-tracker
Reinforcement Learning
•
Updated
•
1