RoBERTa Fine-tuned for Legal Contract Clause Extraction

Model Description

This model is a fine-tuned version of RoBERTa on the CUAD dataset for extracting 6 key clause types from legal contracts:

  • Governing Law
  • Expiration Date
  • Effective Date
  • Anti-Assignment
  • Cap On Liability
  • License Grant

Training Details

  • Base Model: roberta-base
  • Training Data: 30% of CUAD dataset (357 contracts)
  • Epochs: 2
  • Final Training Loss: 0.348

Usage

from transformers import pipeline

qa_pipeline = pipeline("question-answering", model="srraghuram/roberta-cuad-clause-extraction")

result = qa_pipeline( question="Highlight the parts related to Governing Law", context="Your contract text here..." ) print(result)

Limitations

  • Trained on commercial contracts, may not generalize to other legal document types
  • Performance varies by clause type
Downloads last month
31
Safetensors
Model size
81.5M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train srraghuram/roberta-cuad-clause-extraction