autoevaluator HF Staff commited on
Commit
554bdf6
1 Parent(s): 74daef7

Add evaluation results on the default config and train split of social_i_qa

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 馃憢!\
Your model has been evaluated on the default config and train split of the [social_i_qa](https://huggingface.co/datasets/social_i_qa) dataset by @kingmbc , using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-social_i_qa-default-cabb3b-37040145034).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=social_i_qa).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=social_i_qa).

Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -6,7 +6,31 @@ datasets:
6
  - squad
7
  model-index:
8
  - name: roberta_test
9
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
6
  - squad
7
  model-index:
8
  - name: roberta_test
9
+ results:
10
+ - task:
11
+ type: question-answering
12
+ name: Question Answering
13
+ dataset:
14
+ name: social_i_qa
15
+ type: social_i_qa
16
+ config: default
17
+ split: train
18
+ metrics:
19
+ - type: f1
20
+ value: 6.806
21
+ name: F1
22
+ verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMmZlNTYzZTUxNTJjMjMwZmNhNWY4NzA1MGYwYmQzYTA3Y2ZlMGJlZmI2ZTMzZTRmMTM0OTMwYjE2OTlhYTliMSIsInZlcnNpb24iOjF9.7YOULka9SVa2rZJmQSnW_wVSKmC7_fVWRwD6wZcQVV6-iPTq2r_t05ro33DKkSaqgarNN7kI5HY0I64ESmsuCQ
24
+ - type: exact_match
25
+ value: 0.1515
26
+ name: Exact Match
27
+ verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZmE4NTM4M2RhYWRhZTc2ZWQ5NjQ0OTJmZjEyZGZlNTdlMGY3YWZlMTBjOWE5NjU3YmFhMDBiZDc2ZWNkM2M2ZCIsInZlcnNpb24iOjF9.rsw_dOa8qSKtNtZ7ojqjvUB0S0Kxocdilgm6kew9bxPVX193eoKxMK42UzkdyBFA4wRBXsY8km6uPgSycoMGDw
29
+ - type: loss
30
+ value: 5.458096981048584
31
+ name: loss
32
+ verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMmQzMjAwZGRjOGU0YmFhNDhiMTEwNjcwZDUwYWJlZDU2MmZlZDI4ODg4MTQ3Y2JkMDAwZWE5ODJjYzdlYjg2ZCIsInZlcnNpb24iOjF9.Z4jNTgWkE6Uw_-oeSx6leN1NPfRWCRmHE6bEbSsnABk8d76PjtoVHDaeFrCcB-Di-3GTOlHXk84JnjOTA0_pCA
34
  ---
35
 
36
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You