elizaveta-dev
/

T5-REF-CORRUPT-EN

Text Generation

text2text-generation

grammar-correction

text-generation-inference

Model card Files Files and versions

T5-REF-CORRUPT-EN / README.md

elizaveta-dev's picture

Update README.md

386d840 verified 3 months ago

|

history blame contribute delete

3.04 kB

	---
	---
	pipeline_tag: text-generation
	language: en
	library_name: transformers
	tags:
	- t5
	- grammar-correction
	- text-generation
	---

	# T5-REF-CORRUPT-EN: Automatic Error Correction of Academic Referencing According to Institutional Guidelines of the Center for Translation Studies (CTS) of University of Vienna

	Objective: This model corrects errors in academic referencing. For example:

	Input (wrong sentence): According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts

	Output (clean sentence): According to Smith and Peterson (2016: 56), the translation reveals patterns that suggest underlying semantic shifts.

	Model Details:

	- Model name: T5-REF-CORRUPT-EN
	- Base model: T5-base
	- Language: English
	- Training data: Synthetically generated using LLMs and synthetically corrupted real student sentences.

	Usage Cases: Error correction of academic references according to CTS guidelines.

	## Example

	```python
	from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

	model_id = "elizaveta-dev/T5-REF-CORRUPT-EN"

	tokenizer = AutoTokenizer.from_pretrained(model_id)
	model = AutoModelForSeq2SeqLM.from_pretrained(model_id)

	text = "According to Smith & Peterson 2016 56, the translation reveals patterns that suggest underlying semantic shifts."
	inputs = tokenizer(text, return_tensors="pt")

	outputs = model.generate(**inputs, max_new_tokens=128)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	## Use-Cases

	The model can perform automatic corrections of various referencing errors, including:

	### 1. Incorrect Citation Type (Parenthetical vs. Narrative)

	Example of mistake: (Lopez 2018; Chen 2012) found that cultural context strongly influences translation strategies.

	Example of correction: Lopez (2018) and Chen (2012) found that cultural context strongly influences translation strategies.


	Example of mistake: This topic has been widely researched Baker (2006).

	Example of correction: This topic has been widely researched (Baker 2006).

	---

	### 2. Incorrect Citation for Two Authors

	Example of mistake: The concept of functional equivalence was analyzed by Baker & Green (2007).

	Example of correction: The concept of functional equivalence was analyzed by Baker and Green (2007).


	Example of mistake: Previous research (Müller, Schmidt 2001) highlights challenges in literary translation.

	Example of correction: Previous research (Müller & Schmidt 2001) highlights challenges in literary translation.

	---

	### 3. Incorrect Placement of Citations

	Example of mistake: According to Williams, translation theory continues to evolve (2011: 77).

	Example of correction: According to Williams (2011: 77), translation theory continues to evolve.

	---

	### 4. Redundant Entities

	Example of mistake: As Lee (2009) explains, equivalence is central in translation (Lee 2009).

	Example of correction: As Lee (2009) explains, equivalence is central in translation.