evamxb's picture
Update README.md
1b82f56 verified
---
library_name: transformers
license: mit
datasets:
- evamxb/soft-cite-intent
language:
- en
pipeline_tag: text-classification
---
# Software Citation Intent Classifier
The Software Citation Intent Classifier (`soft-cite-intent-cls`) can be used to predict the citation or "reference" intent
behind a text reference or citation to a piece of software in an academic article.
Possible values include:
- `used`
- `mentioned`
- `created`
- `other`
For example, given the sentence: "The XYZ code and software, with an example input dataset and detailed instructions are available from GitHub (https://github.com/user/repo)"
should be predicted as "created" as the authors are directly referencing their own created code.
_The specific name of the code and software and username and repository have been removed for privacy._
In comparison, the sentence: "For the statistical analyses of the data in this study, the Statistical Package for the Social Sciences (SPSS) version 22 (IBM Corp, Armonk, New York) was used"
should be predicted as "used" as the authors are directly informing the reader that this is not their own software but rather software they used for analysis.
This was originally created during the [CZI Software Impact Hackathon](https://github.com/chanzuckerberg/software-impact-hackathon-2023) by Ana-Maria Istrate, Joshua Fisher, Xinyu Yang, Kara Moraw, Kai Li, Donghui Li, and Martin Klein.
Their original work can be found in the [SoftwareCitationIntent Repository](https://github.com/karacolada/SoftwareImpactHackathon2023_SoftwareCitationIntent).
Eva Maxfield Brown recreated and uploaded this version of the model to Huggingface Hub for her own work, her scripts for recreating this model can be found in the [grobid-soft-proc repository](https://github.com/evamaxfield/grobid-soft-proc).
## Model Details
### Model Description
<!-- Provide a longer summary of what this model is. -->
- **Developed by:** Originally created by Ana-Maria Istrate, Joshua Fisher, Xinyu Yang, Kara Moraw, Kai Li, Donghui Li, and Martin Klein.
- **Made Available by** Eva Maxfield Brown
- **Language(s) (NLP):** en (English)
- **License:** MIT
- **Finetuned from model [optional]:** [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base)
### Model Sources [optional]
<!-- Provide the basic links for the model. -->
- **Repository:** [Original Repository](https://github.com/karacolada/SoftwareImpactHackathon2023_SoftwareCitationIntent), [Distribution Repository](https://github.com/evamaxfield/grobid-soft-proc/tree/main/training)
- **Paper [optional]:** [Scientific Software Citation Intent Classification using Large Language Models](https://github.com/NFDI4DS/nslp2024/blob/main/accepted_papers/NSLP_2024_paper_20.pdf)
## Training Details
### Training Data
- **HuggingFace Dataset:** [soft-cite-intent](https://huggingface.co/datasets/evamxb/soft-cite-intent)
- **CSV from Repo:** [soft-cite-intent](https://github.com/evamaxfield/grobid-soft-proc/blob/main/training/sci-impact-hack-soft-cite-intent.csv)
### Training Procedure
- **Training Script:** [train-and-upload-best](https://github.com/evamaxfield/grobid-soft-proc/blob/main/training/train-and-upload-best.py)
### Results
- Accuracy: 0.916
- Precision: 0.916
- Recall: 0.916
- F1: 0.916
## Citation [optional]
See their paper [Scientific Software Citation Intent Classification using Large Language Models](https://github.com/NFDI4DS/nslp2024/blob/main/accepted_papers/NSLP_2024_paper_20.pdf) at NSLP2024.