Update README.md
Browse files
README.md
CHANGED
|
@@ -11,4 +11,11 @@ PontAvignon is a specialized reasoning model using a special tokens to encode in
|
|
| 11 |
|
| 12 |
Any excerpts of a show program has to be submitted using the tags '<|text_startl>' and '<|text_end|>'.
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
|
|
|
|
| 11 |
|
| 12 |
Any excerpts of a show program has to be submitted using the tags '<|text_startl>' and '<|text_end|>'.
|
| 13 |
|
| 14 |
+
After '<|text_end|>' will model will generate an answer in three step:
|
| 15 |
+
* Initial fuzzy "thinking" (between <|thinking_start|> and <|thinking_end|>):
|
| 16 |
+
|
| 17 |
+
Some warnings apply:
|
| 18 |
+
* Given the intensive reinforcement learning specialization, the model will mostly work on theater show in French, preferably one structured similarly to original sources from the festival d'Avignon. More diverse sources would be needed to make the model more source agnostic.
|
| 19 |
+
* To improve results accuracy, we trained the model on filtered versions of the original sources with all the information about a unique show. Multi-show submissions will either result on the model focusing on only one show or, even more problematic, mixing information.
|
| 20 |
+
* Reasoning traces contribute to increase the accuracy of the model and provide some level of explainability. They can still be counterintuitive and potentially contain wrong assumptions that will not necessarily be kept in the final output.
|
| 21 |
|