Instructions to use jimfhahn/bibframe-olmo-1b-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use jimfhahn/bibframe-olmo-1b-v2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="jimfhahn/bibframe-olmo-1b-v2")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("jimfhahn/bibframe-olmo-1b-v2", dtype="auto")

PEFT
How to use jimfhahn/bibframe-olmo-1b-v2 with PEFT:
```
Task type is invalid.
```
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use jimfhahn/bibframe-olmo-1b-v2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "jimfhahn/bibframe-olmo-1b-v2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jimfhahn/bibframe-olmo-1b-v2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/jimfhahn/bibframe-olmo-1b-v2

SGLang

How to use jimfhahn/bibframe-olmo-1b-v2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "jimfhahn/bibframe-olmo-1b-v2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jimfhahn/bibframe-olmo-1b-v2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "jimfhahn/bibframe-olmo-1b-v2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jimfhahn/bibframe-olmo-1b-v2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use jimfhahn/bibframe-olmo-1b-v2 with Docker Model Runner:
```
docker model run hf.co/jimfhahn/bibframe-olmo-1b-v2
```

BIBFRAME-OLMo-1B-v2

Fine-tuned OLMo-1B for BIBFRAME RDF/XML correction. Trained on ~8,500 Library of Congress BIBFRAME records.

Model Details

Model Description

This model corrects malformed or incomplete BIBFRAME RDF/XML to produce valid, well-formed output following Library of Congress conventions. It was trained using LoRA (Low-Rank Adaptation) on real BIBFRAME records from id.loc.gov.

Developed by: Jim Hahn
Model type: Causal Language Model with LoRA adapter
Language(s): RDF/XML (BIBFRAME vocabulary)
License: Apache 2.0
Finetuned from model: amd/AMD-OLMo-1B (native transformers format for ONNX/WebGPU compatibility)

Model Sources

Repository: https://github.com/jimfhahn/bibframe-olmo
Training Dataset: https://huggingface.co/datasets/jimfhahn/bibframe-corrections
Previous version: https://huggingface.co/jimfhahn/bibframe-olmo-1b

Uses

Direct Use

Correcting malformed BIBFRAME RDF/XML records to valid Library of Congress format.

Downstream Use

Integration with BIBFRAME validation pipelines
Post-processing AI-generated BIBFRAME records
Cleaning bulk catalog imports
Part of the mcp4rdf-core validation and correction service

Out-of-Scope Use

Generating BIBFRAME from natural language descriptions (not trained for this)
Non-BIBFRAME RDF vocabularies (Schema.org, Dublin Core, etc.)
MARC record processing

Bias, Risks, and Limitations

Trained exclusively on Library of Congress records; may not generalize to other BIBFRAME implementations
Cannot fix semantic errors (e.g., wrong subject headings), only structural/syntactic issues
Large RDF documents may exceed context length (4096 tokens)

Recommendations

Validate model output with SHACL shapes before use in production
Use as part of a pipeline with human review for critical cataloging

How to Get Started with the Model

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

# Load base model + LoRA adapter
model = AutoModelForCausalLM.from_pretrained('amd/AMD-OLMo-1B')
model = PeftModel.from_pretrained(model, 'jimfhahn/bibframe-olmo-1b-v2')
tokenizer = AutoTokenizer.from_pretrained('amd/AMD-OLMo-1B')

# Example: correct malformed BIBFRAME
prompt = """<|system|>
You are a BIBFRAME expert. Fix the following malformed RDF/XML to produce valid BIBFRAME.
<|user|>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
         xmlns:bf="http://id.loc.gov/ontologies/bibframe/">
  <bf:Work>
    <bf:title>Example Book</bf:title>
  </bf:Work>
</rdf:RDF>
<|assistant|>
"""

inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=1024)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Training Details

Training Data

jimfhahn/bibframe-corrections

Source: Library of Congress (id.loc.gov)
Records: ~4,100 Works + ~5,000 Instances
Diversity: 102 facets covering subjects, languages, time periods, formats, and genres

Training pairs were generated by:

Collecting valid BIBFRAME Works and Instances from id.loc.gov
Applying synthetic corruptions (missing elements, invalid URIs, syntax errors)
Training the model to restore the original valid RDF/XML

Training Procedure

Training Hyperparameters

Training regime: bf16 mixed precision
Optimizer: AdamW
Learning rate: 2e-4
Batch size: 4 (with gradient accumulation 4, effective batch 16)
Epochs: 3
LoRA rank: 64
LoRA alpha: 128
LoRA target modules: att_proj, attn_out, ff_proj, ff_out

Speeds, Sizes, Times

Training time: ~7.5 hours
Hardware: NVIDIA A100-SXM4-80GB
Final loss: 0.118
Adapter size: 168 MB

Evaluation

Metrics

Training loss: 0.118 (final)
Additional evaluation with SHACL validation pending

Environmental Impact

Hardware Type: NVIDIA A100-SXM4-80GB
Hours used: 7.5
Cloud Provider: Illinois Campus Cluster (NCSA)
Compute Region: Illinois, USA

Technical Specifications

Model Architecture and Objective

OLMo-1B base model with LoRA adapters for causal language modeling on BIBFRAME correction task.

Compute Infrastructure

Hardware

NVIDIA A100-SXM4-80GB (1 GPU)

Software

PyTorch 2.9.1
Transformers 4.57.5
PEFT 0.7.0
ai2-olmo

Citation

BibTeX:

@misc{bibframe-olmo-2026,
  author = {Hahn, Jim},
  title = {BIBFRAME-OLMo-1B-v2: Fine-tuned OLMo for BIBFRAME Correction},
  year = {2026},
  publisher = {HuggingFace},
  url = {https://huggingface.co/jimfhahn/bibframe-olmo-1b-v2}
}

Model Card Authors

Jim Hahn

Model Card Contact

https://github.com/jimfhahn

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for jimfhahn/bibframe-olmo-1b-v2

Base model

amd/AMD-OLMo-1B

Adapter

(1)

this model

jimfhahn
/

bibframe-olmo-1b-v2