LocalLaws/LOCUS-Function

A ModernBERT classifier for the Primary Function axis of the LOCUS (Local Ordinances Corpus, United States) dataset.

Fine-tuned from answerdotai/ModernBERT-base on LocalLaws/LOCUS-v1.0.

Labels

  • Context
  • Enforcement
  • Process
  • Rules
  • Structural

Training

Base model answerdotai/ModernBERT-base
Max length 1024
Classifier pooling mean
Train / val / test 79106 / 10447 / 10447

Evaluation

Metric macro-F1
Validation macro-F1 0.8443
Test macro-F1 0.8428
Test accuracy 0.8849
              precision    recall  f1-score   support

     Context     0.8399    0.9138    0.8753      1033
 Enforcement     0.7561    0.8682    0.8083      1032
     Process     0.6038    0.7691    0.6765       654
       Rules     0.9308    0.8570    0.8924      4896
  Structural     0.9675    0.9555    0.9614      2832

    accuracy                         0.8849     10447
   macro avg     0.8196    0.8727    0.8428     10447
weighted avg     0.8940    0.8849    0.8876     10447

Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

tok = AutoTokenizer.from_pretrained("LocalLaws/LOCUS-Function")
model = AutoModelForSequenceClassification.from_pretrained("LocalLaws/LOCUS-Function")
model.eval()

text = "No person shall keep any swine within the city limits."
enc = tok(text, return_tensors="pt", truncation=True, max_length=1024)
with torch.no_grad():
    logits = model(**enc).logits
pred = logits.argmax(-1).item()
print(model.config.id2label[pred])
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LocalLaws/LOCUS-Function

Finetuned
(1234)
this model

Dataset used to train LocalLaws/LOCUS-Function

Collection including LocalLaws/LOCUS-Function