migtissera
/

HelixNet

Model card Files Files and versions

Migel Tissera commited on Nov 4, 2023

Commit

11fc037

·

1 Parent(s): b5f1ded

adding media folder

Files changed (1) hide show

README.md +19 -1

README.md CHANGED Viewed

@@ -18,6 +18,18 @@ HelixNet regenerates very pleasing and accurate responses, due to the entropy pr
 The actor network was trained with Supervised Fine-Tuning, on 250K very high-quality samples. It has 75K of Open-Orca's Chain-of-Thought data, and a mixture of Dolphin (GPT-4), SynthIA's Tree-of-Thought data.
 ## Phase 2: Critic
 To train the critic, the following process was followed:
@@ -127,4 +139,10 @@ while True:
     regenerator_response = generate_text(prompt_regenerator, model_regenerator, tokenizer_regenerator)
     print(f"REGENERATION: {regenerator_response}")
-```

 The actor network was trained with Supervised Fine-Tuning, on 250K very high-quality samples. It has 75K of Open-Orca's Chain-of-Thought data, and a mixture of Dolphin (GPT-4), SynthIA's Tree-of-Thought data.
+Here are the results for the Actor network on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+||||
+|:------:|:--------:|:-------:|
+|**Task**|**Metric**|**Value**|
+|*arc_challenge*|acc_norm|62.28|
+|*hellaswag*|acc_norm|83.22|
+|*mmlu*|acc_norm|63.10|
+|*truthfulqa_mc*|mc2|50.10|
+|**Total Average**|-|**0.64675**||
 ## Phase 2: Critic
 To train the critic, the following process was followed:
     regenerator_response = generate_text(prompt_regenerator, model_regenerator, tokenizer_regenerator)
     print(f"REGENERATION: {regenerator_response}")
+```
+![HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/media/sample-answer.png)
+![HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/media/sample-critique.png)
+![HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/media/sample-regeneration.png)