Model Card for byt5-scan-gl-sg

Metrical scansion in Galician (lexical to metrical syllabification). Fine-tuned byT5.

Operates on a single line (without addidtional context lines, unlike the models ending with -cx in this collection.

Input format: E / os / *her- / mos / re- / ver- / *de- / cen / do / es- / *pri- / to / on- / de / mo- / *ra- / ren

Output for the above: E os / *her- / mos / re- / ver- / *de- / cen / do es- / *pri- / to on- / de / mo- / *ra- / ren

Use the code below to get started with the model.

import torch
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

model_name = "compellit/byt5-scan-gl-sg"

device = "cuda" if torch.cuda.is_available() else "cpu"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSeq2SeqLM.from_pretrained(model_name)

text = "E / os / *her- / mos / re- / ver- / *de- / cen / do / es- / *pri- / to / on- / de / mo- / *ra- / ren"

inputs = tokenizer(text, return_tensors="pt")

with torch.no_grad():
    outputs = model.generate(
        **inputs,
        max_length=128,
        num_beams=1,
        do_sample=False
    )

print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Downloads last month: 5

Safetensors

Model size

0.3B params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for compellit/byt5-scansion-gl-sg

Base model

google/byt5-small

Finetuned

(290)

this model

Collection including compellit/byt5-scansion-gl-sg

Scansion Models

Collection

Automatic metrical scansion of poetry in Galician. Best in the series is byt5-scansion-gl-cx. • 6 items • Updated Apr 15