Peter512/patents-50k-green
Viewer • Updated • 50k • 14
Binary classifier for green patent detection (Y02 CPC codes). Fine-tuned from AI-Growth-Lab/PatentSBERTa using a 3-agent CrewAI debate system (Advocate / Skeptic / Judge).
is_green (Y02 CPC codes){"label": 0/1, "confidence": "low/medium/high", "rationale": "..."}.
100% agent agreement (0 human overrides — no low-confidence outputs).| Metric | Value |
|---|---|
| F1 | 0.8115 |
| Precision | 0.8224 |
| Recall | 0.8010 |
| Accuracy | 0.8142 |
Assignment 2 baseline: F1=0.8099 | Original baseline: F1=0.7696
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
tokenizer = AutoTokenizer.from_pretrained("AI-Growth-Lab/PatentSBERTa", use_fast=False)
model = AutoModelForSequenceClassification.from_pretrained("Peter512/patentsbert-green-a3")
model.eval()
text = "A photovoltaic cell comprising a perovskite absorber layer..."
inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=256)
with torch.no_grad():
logits = model(**inputs).logits
label = logits.argmax().item() # 0=not_green, 1=green
Base model
AI-Growth-Lab/PatentSBERTa