Add evaluation results on conll2003 dataset
Beep boop, I am a bot from Hugging Face's automatic model evaluator π!
Your model has been evaluated on the conll2003 dataset by
@douwekiela
, using the predictions stored here.
Accept this pull request to see the results displayed on the Hub leaderboard.
Evaluate your model on more datasets here.
@douwekiela
@lewtun
. QQ: Why is the f1 so much higher than on when i evaluated?
Good question :) This job was run on the validation split, so maybe the results you have were reported on the test split instead?
on the model card there are results for validation and test
could you please share the evaluation function you used @philschmid ? :)
thats the script i used for training evaluation: https://github.com/philschmid/distilroberta-token-classification/blob/master/src/training/train.py
π