T5-REF-CORRUPT-EN / README.md
elizaveta-dev's picture
Update README.md
386d840 verified
---
---
pipeline_tag: text-generation
language: en
library_name: transformers
tags:
- t5
- grammar-correction
- text-generation
---
# T5-REF-CORRUPT-EN: Automatic Error Correction of Academic Referencing According to Institutional Guidelines of the Center for Translation Studies (CTS) of University of Vienna
**Objective:** This model corrects errors in academic referencing. For example:
*Input (wrong sentence)*: According to Smith **&** Peterson **2016 56**, the translation reveals patterns that suggest underlying semantic shifts
*Output (clean sentence)*: According to Smith **and** Peterson **(2016: 56)**, the translation reveals patterns that suggest underlying semantic shifts.
**Model Details:**
- **Model name:** T5-REF-CORRUPT-EN
- **Base model:** T5-base
- **Language:** English
- **Training data:** Synthetically generated using LLMs and synthetically corrupted real student sentences.
**Usage Cases:** Error correction of academic references according to CTS guidelines.
## Example
```python
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
model_id = "elizaveta-dev/T5-REF-CORRUPT-EN"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSeq2SeqLM.from_pretrained(model_id)
text = "According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts."
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```
## Use-Cases
The model can perform automatic corrections of various referencing errors, including:
### 1. Incorrect Citation Type (Parenthetical vs. Narrative)
*Example of mistake:* (Lopez 2018; Chen 2012) found that cultural context strongly influences translation strategies.
*Example of correction:* Lopez (2018) and Chen (2012) found that cultural context strongly influences translation strategies.
*Example of mistake:* This topic has been widely researched Baker (2006).
*Example of correction:* This topic has been widely researched (Baker 2006).
---
### 2. Incorrect Citation for Two Authors
*Example of mistake:* The concept of functional equivalence was analyzed by Baker & Green (2007).
*Example of correction:* The concept of functional equivalence was analyzed by Baker and Green (2007).
*Example of mistake:* Previous research (Müller, Schmidt 2001) highlights challenges in literary translation.
*Example of correction:* Previous research (Müller & Schmidt 2001) highlights challenges in literary translation.
---
### 3. Incorrect Placement of Citations
*Example of mistake:* According to Williams, translation theory continues to evolve (2011: 77).
*Example of correction:* According to Williams (2011: 77), translation theory continues to evolve.
---
### 4. Redundant Entities
*Example of mistake:* As Lee (2009) explains, equivalence is central in translation (Lee 2009).
*Example of correction:* As Lee (2009) explains, equivalence is central in translation.