File size: 3,044 Bytes

---
---
pipeline_tag: text-generation
language: en
library_name: transformers
tags:
- t5
- grammar-correction
- text-generation
---

# T5-REF-CORRUPT-EN: Automatic Error Correction of Academic Referencing According to Institutional Guidelines of the Center for Translation Studies (CTS) of University of Vienna

**Objective:** This model corrects errors in academic referencing. For example:  

*Input (wrong sentence)*: According to Smith **&** Peterson **2016 56**, the translation reveals patterns that suggest underlying semantic shifts

*Output (clean sentence)*: According to Smith **and** Peterson **(2016: 56)**, the translation reveals patterns that suggest underlying semantic shifts.

**Model Details:**  

- **Model name:** T5-REF-CORRUPT-EN  
- **Base model:** T5-base  
- **Language:** English  
- **Training data:** Synthetically generated using LLMs and synthetically corrupted real student sentences.  

**Usage Cases:** Error correction of academic references according to CTS guidelines.

## Example

```python
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

model_id = "elizaveta-dev/T5-REF-CORRUPT-EN"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSeq2SeqLM.from_pretrained(model_id)

text = "According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts."
inputs = tokenizer(text, return_tensors="pt")

outputs = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```

## Use-Cases

The model can perform automatic corrections of various referencing errors, including:

### 1. Incorrect Citation Type (Parenthetical vs. Narrative)

*Example of mistake:* (Lopez 2018; Chen 2012) found that cultural context strongly influences translation strategies.

*Example of correction:* Lopez (2018) and Chen (2012) found that cultural context strongly influences translation strategies.


*Example of mistake:* This topic has been widely researched Baker (2006).

*Example of correction:* This topic has been widely researched (Baker 2006).

---

### 2. Incorrect Citation for Two Authors

*Example of mistake:* The concept of functional equivalence was analyzed by Baker & Green (2007).

*Example of correction:* The concept of functional equivalence was analyzed by Baker and Green (2007).


*Example of mistake:* Previous research (Müller, Schmidt 2001) highlights challenges in literary translation.

*Example of correction:* Previous research (Müller & Schmidt 2001) highlights challenges in literary translation.

---

### 3. Incorrect Placement of Citations

*Example of mistake:* According to Williams, translation theory continues to evolve (2011: 77).

*Example of correction:* According to Williams (2011: 77), translation theory continues to evolve.

---

### 4. Redundant Entities

*Example of mistake:* As Lee (2009) explains, equivalence is central in translation (Lee 2009).

*Example of correction:* As Lee (2009) explains, equivalence is central in translation.