Update evaluation_stats.json via script
Browse files- evaluation_stats.json +11 -11
evaluation_stats.json
CHANGED
|
@@ -1,6 +1,5 @@
|
|
| 1 |
{
|
| 2 |
"perplexity_history": [
|
| 3 |
-
291.7562255859375,
|
| 4 |
291.74176025390625,
|
| 5 |
291.7083740234375,
|
| 6 |
291.69140625,
|
|
@@ -19,19 +18,20 @@
|
|
| 19 |
289.20306396484375,
|
| 20 |
288.07843017578125,
|
| 21 |
287.02593994140625,
|
| 22 |
-
286.1083679199219
|
|
|
|
| 23 |
],
|
| 24 |
"last_examples": {
|
| 25 |
-
"Story Continuation": "How do I get the best picture of what we
|
| 26 |
"Simple Instruction": "I have been trying to make progress and challenges? What about my biography?\nI wanted to be very creative! I would love to go back to work with myself, but not too much effort or lessons than others, but it helps me feel better when we try to find ourselves happy and selfishness!\nWe are doing a good job here!!",
|
| 27 |
-
"Creative Prompt": "We are pleased to announce the launch of the first prototype projector is bringing us to
|
| 28 |
-
"Question Answering (Basic)": "As an example we've been using the same pattern threaded threading patches of new threads that can be removed from thread threads. These thread threads were used to thread thread thread threads removed them onto thread threads or thread threads. It is
|
| 29 |
-
"Code Generation (Simple Python)": "We are looking forward to seeing us in the future today
|
| 30 |
-
"Reasoning (Simple)": "
|
| 31 |
},
|
| 32 |
-
"last_update_time": "2025-05-08 15:
|
| 33 |
-
"datasets_processed_count":
|
| 34 |
-
"texts_processed_count":
|
| 35 |
-
"tokens_processed_count":
|
| 36 |
"lighteval_results": {}
|
| 37 |
}
|
|
|
|
| 1 |
{
|
| 2 |
"perplexity_history": [
|
|
|
|
| 3 |
291.74176025390625,
|
| 4 |
291.7083740234375,
|
| 5 |
291.69140625,
|
|
|
|
| 18 |
289.20306396484375,
|
| 19 |
288.07843017578125,
|
| 20 |
287.02593994140625,
|
| 21 |
+
286.1083679199219,
|
| 22 |
+
284.8231201171875
|
| 23 |
],
|
| 24 |
"last_examples": {
|
| 25 |
+
"Story Continuation": "How do I get the best picture of what we have now?\nI am getting a lot of awesome photos that can be amazing! Thanks for sharing!!!\nMonday!!!!!!!!! Thanks for all!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!\nSorry this is not the time!!!!!!\nHi guys!!!!",
|
| 26 |
"Simple Instruction": "I have been trying to make progress and challenges? What about my biography?\nI wanted to be very creative! I would love to go back to work with myself, but not too much effort or lessons than others, but it helps me feel better when we try to find ourselves happy and selfishness!\nWe are doing a good job here!!",
|
| 27 |
+
"Creative Prompt": "We are pleased to announce the launch of the first prototype projector is bringing us to life in the next few weeks.\nThe company has announced plans to launch its new design for the launching phase on Mars\u2019s journey through its newest production and growth potential future plans, but it will be soon for launch into orbit around 2015.\nS",
|
| 28 |
+
"Question Answering (Basic)": "As an example we've been using the same pattern threaded threading patches of new threads that can be removed from thread threads. These thread threads were used to thread thread thread threads removed them onto thread threads or thread threads. It is thread threads attached thread threads thread thread thread threads for thread thread thread threads when thread thread threads thread thread threadbone thread thread threads thread",
|
| 29 |
+
"Code Generation (Simple Python)": "We are looking forward to seeing us in the future today!\nWhat are your favorite images?\nI hope you\u2019ll see some of our favourite characters. Please note that this is the best way to get started. Thank you so much for your feedback!\nYou can check out our site regularly. Thanks!",
|
| 30 |
+
"Reasoning (Simple)": "This is a big task force to get ready for an amazingly awesome event!!! I just love this! We were really excited about this amazing trip! I\u2019m happy with my family and friends. I have been lucky enough to find new places where I go on tour!!\nI am not having fun on holiday day so I can relax as much yummy"
|
| 31 |
},
|
| 32 |
+
"last_update_time": "2025-05-08 15:42:00 UTC",
|
| 33 |
+
"datasets_processed_count": 5,
|
| 34 |
+
"texts_processed_count": 30,
|
| 35 |
+
"tokens_processed_count": 15360,
|
| 36 |
"lighteval_results": {}
|
| 37 |
}
|