-
-
-
-
-
-
Inference Providers
Active filters:
ppo
jvelja/gemma-2-2b-it_imdb_seeded_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2bit_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_1
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2bit_1
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2bit_2
Reinforcement Learning
•
Updated
jvelja/ppo-gemma-2-2b-it-unseeded_1
Reinforcement Learning
•
Updated
jvelja/ppo-gemma-2-2b-it-unseeded_2
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2bit_3
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it_imdb_2bit_4
Reinforcement Learning
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
•
1
Reinforcement Learning
•
0.1B
•
Updated
•
1
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
•
1
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
Reinforcement Learning
•
0.1B
•
Updated
•
1
Reinforcement Learning
•
0.1B
•
Updated
•
1
Reinforcement Learning
•
0.1B
•
Updated
jvelja/gemma-2-2b-it_imdb_probits_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it-seed-1_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it-paraphrase_0
Reinforcement Learning
•
Updated
jvelja/gemma-2-2b-it-seed-1_2bit_seed1_0
Reinforcement Learning
•
Updated