jnjj commited on
Commit
89edd7f
·
verified ·
1 Parent(s): 2d0f692

Update evaluation_stats.json via script

Browse files
Files changed (1) hide show
  1. evaluation_stats.json +12 -12
evaluation_stats.json CHANGED
@@ -1,6 +1,5 @@
1
  {
2
  "perplexity_history": [
3
- 291.82647705078125,
4
  291.81048583984375,
5
  291.7937927246094,
6
  291.7740173339844,
@@ -19,19 +18,20 @@
19
  291.5338439941406,
20
  291.50592041015625,
21
  291.4782409667969,
22
- 291.44378662109375
 
23
  ],
24
  "last_examples": {
25
- "Story Continuation": "We are lucky enough to miss the newest newborns\nThe best way to go about what is now available for the worldwide launches today!\nWe've been excited to announce the arrival of our newest Christmas tree plantations, which will be the first time ever since. I think we would like to thank them so much for joining us and",
26
- "Simple Instruction": "The world\u2019s most excited and amazing! We're not a big surprise when we see this project is going to be around us again!\nHere are the picture of the best part about how our lives have been.\nI started thinking about the newest people I am happy to share with you all ages!\nWe wanted to know what I want is my",
27
- "Creative Prompt": "\u2605\u2605\u2605\u2605\u2605\u2605\u2605\u2605\u2605\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606\u2606",
28
- "Question Answering (Basic)": "The questionnaires are a powerful way to build your website?\nThe best way to find a way to get rid of your life! This is a great way to try out new things. It\u2019s not only used for animals but also good luck in their lives. If you want to take advantage of this unique bird, you will need something like that.\nHere",
29
- "Code Generation (Simple Python)": "The most popular toolpiece for us are looking forward to seeing our newest product we've got to share our best results!\n|Reviews||102035849284 *||Dec 26, 2017||Jul 2016||The Ultimate Guide to the Best Price|\n|",
30
- "Reasoning (Simple)": "The only thing we\u2019ve seen since the end of this weekend. It was so funny and we have just finished us in the next few weeks. But when we were starting to miss out on the busiest days, but now that is why we weren't expecting the best part of our lives! We are working hard, because we can continue to move"
31
  },
32
- "last_update_time": "2025-05-08 02:51:03 UTC",
33
- "datasets_processed_count": 27,
34
- "texts_processed_count": 7060,
35
- "tokens_processed_count": 82944,
36
  "lighteval_results": {}
37
  }
 
1
  {
2
  "perplexity_history": [
 
3
  291.81048583984375,
4
  291.7937927246094,
5
  291.7740173339844,
 
18
  291.5338439941406,
19
  291.50592041015625,
20
  291.4782409667969,
21
+ 291.44378662109375,
22
+ 289.20306396484375
23
  ],
24
  "last_examples": {
25
+ "Story Continuation": "How do I get the best picture of what we have now?\nI am getting a lot of awesome photos that can be amazing! Thanks for sharing!!!\nMonday!!!!!!!!! Thanks for all!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!\nSorry this is not the time!!!!!!\nHi guys!!!!",
26
+ "Simple Instruction": "I have been trying to make progress and challenges? What about my biography?\nI wanted to be very creative! I would love to go back to work with myself, but not too much effort or lessons than others, but it helps me feel better when we try to find ourselves happy patients will get a little bit more information on what I want here.",
27
+ "Creative Prompt": "We are pleased to announce the launch of the first prototype projector is bringing us to Mars Earth.\nThe mission is to be launching our new spacecrafts, which will come up with the launch of Marscraftia SpaceXicon Spacecraft and asteroid orbital spacecraft.\nWe have a mission to make Earthcraft asteroids through NASA Earth Day\u2122",
28
+ "Question Answering (Basic)": "As an example we've been using the same pattern threaded threading patches of new threads that can be removed from thread threads. These thread threads were used to thread thread thread threads removed them onto thread threads or thread threads. It is threadbare threads thread threads thread thread thread threads for thread thread thread threads when thread thread threads thread thread threadbone thread thread threads thread",
29
+ "Code Generation (Simple Python)": "We are looking forward to seeing us in the future today! This is one of our favourite gameplaying gameplayers! We have been lucky enough to play this game when we come back to the point where we need to learn how to get started, but what happens after we gotta go.\nWe had a great day ahead of time. It was fun",
30
+ "Reasoning (Simple)": "The time of day we were trying to get our latest updates. It\u2019s been amazed because we are going to see all the way upcoming events happening in Europe but hopefully we know some new people have lost their lives!\nIn recent years as they go on to see what happens when we are doing.\nThe biggest event has been so far away from the start"
31
  },
32
+ "last_update_time": "2025-05-08 15:36:17 UTC",
33
+ "datasets_processed_count": 1,
34
+ "texts_processed_count": 6,
35
+ "tokens_processed_count": 3072,
36
  "lighteval_results": {}
37
  }