agentlans
/

mdeberta-v3-base-readability

Text Classification

Model card Files Files and versions

agentlans commited on Oct 12, 2024

Commit

de65283

·

verified ·

1 Parent(s): 702a19c

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -32,6 +32,30 @@ The model was trained on [agentlans/tatoeba-english-translations](https://huggin
 ## Usage
 ## Results
 In this study, 10 English text samples of varying readability were generated and translated into Arabic, Chinese, French, Russian, and Spanish using Google Translate. This resulted in a total of 50 translated samples, which were subsequently analyzed by a trained classifier to predict their readability scores.

 ## Usage
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+model_name="agentlans/mdeberta-v3-base-readability"
+# Put model on GPU or else CPU
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(model_name)
+device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+model = model.to(device)
+def readability(text):
+    """Processes the text using the model and returns its logits.
+    In this case, it's reading grade level in years of education
+    (the higher the number, the harder it is to read the text)."""
+    inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True).to(device)
+    with torch.no_grad():
+        logits = model(**inputs).logits.squeeze().cpu()
+    return logits.tolist()
+readability("Your text here.")
+```
 ## Results
 In this study, 10 English text samples of varying readability were generated and translated into Arabic, Chinese, French, Russian, and Spanish using Google Translate. This resulted in a total of 50 translated samples, which were subsequently analyzed by a trained classifier to predict their readability scores.