Rohanify commited on
Commit
1c458ef
·
verified ·
1 Parent(s): 500fa55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -17,13 +17,22 @@ tags:
17
  ---
18
 
19
  # 🚀 PyBlissa-Coder-50M
 
20
 
21
  A 50M-parameter Python code generation model trained from scratch on a single RTX 5080. Built as part of the **PRIME** lineup of small, locally-runnable AI systems.
22
 
23
  Despite its size, PyBlissa punches well above its weight on Python instruction-following tasks. Trained near-Chinchilla optimal (~13 tokens/parameter) for maximum capacity utilization.
24
 
 
 
 
 
 
 
25
  ![Loss Visualization](loss_visualization.png)
26
 
 
 
27
  ## Stats
28
 
29
  | | |
 
17
  ---
18
 
19
  # 🚀 PyBlissa-Coder-50M
20
+ # !! BENCHMARK ON HumaEval DATASET: 10.4% !!
21
 
22
  A 50M-parameter Python code generation model trained from scratch on a single RTX 5080. Built as part of the **PRIME** lineup of small, locally-runnable AI systems.
23
 
24
  Despite its size, PyBlissa punches well above its weight on Python instruction-following tasks. Trained near-Chinchilla optimal (~13 tokens/parameter) for maximum capacity utilization.
25
 
26
+ This model punched a solid 10.4% score in OpenAI's HumanEval dataset, with is an amazing number for this model's size!
27
+ However, this model technically *can* generate bad outputs. You'd need to tweak the temperature. But that's a rare case!
28
+
29
+ ![Benchmark](pyblissa_humaneval_benchmark.png)
30
+
31
+
32
  ![Loss Visualization](loss_visualization.png)
33
 
34
+
35
+
36
  ## Stats
37
 
38
  | | |