jacobpol's picture
Upload folder using huggingface_hub
33a10fc verified
metadata
base_model: Qwen/Qwen3-4B-Instruct-2507
library_name: peft
pipeline_tag: text-generation
tags:
  - ner
  - named-entity-recognition
  - early-modern-english
  - historical-texts
  - commodity
  - lora
  - transformers
language:
  - en
license: mit

EarlyModernNER - COMMODITY Adapter

A LoRA adapter specialized for extracting COMMODITY entities from Early Modern English documents (1500-1800).

Entity Type: COMMODITY

Extracts trade goods, materials, and foodstuffs:

  • Agricultural products (sugar, tobacco, cotton, indigo)
  • Foodstuffs (butter, salt, bread, fish, spices)
  • Raw materials (timber, iron, wool, silk)
  • Spices and luxury goods (pepper, cinnamon, nutmeg)
  • Manufactured goods (cloth, linen)

Does NOT extract:

  • Currency terms (money, guineas, pounds)
  • Abstract concepts
  • Nationalities

Performance

Evaluated on 100 gold-standard annotated documents:

Metric Score
Precision 0.8473
Recall 0.8043
F1 0.8253

Training Details

  • Base Model: Qwen/Qwen3-4B-Instruct-2507
  • Method: QLoRA (4-bit quantization)
  • LoRA Rank: 64
  • Epochs: 2
  • Training Data: Augmented silver-standard annotations with synthetic hard negatives

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen3-4B-Instruct-2507",
    device_map="auto",
    trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-4B-Instruct-2507")
model = PeftModel.from_pretrained(base_model, "path/to/commodity_lora")

Or use via the EarlyModernNER package:

python -m earlymodernner --input your_docs/ --output results.jsonl

Limitations

  • Distinguishing commodities from other nouns requires context
  • Historical commodity names may differ from modern terms
  • Recipe/household texts may have different commodity distributions than trade documents

License

MIT License

Author

Jacob Polay, MA Student, University of Saskatchewan

Citation

@software{earlymodernner,
  title = {EarlyModernNER: Named Entity Recognition for Early Modern English},
  author = {Polay, Jacob},
  year = {2026}
}