Malolmalsky's picture
Upload README.md
e36a1ef verified
metadata
license: mit
base_model: facebook/rag-sequence-base
datasets:
  - Malolmalsky/new-commits
library_name: transformers
pipeline_tag: text-generation
tags:
  - rag
  - commit-message-generation
  - hyperbolic-geometry
  - software-maintenance
  - reproducible-research

RAG-Hyp Commit Message Generation Checkpoint

This repository stores the heavyweight checkpoint for the RAG-Hyp dissertation artifact. The source code, reproduction scripts, experiment matrix, and method-to-code traceability documentation are kept in the companion code repository.

Files

File Size, bytes SHA-256
checkpoint-170000/model.safetensors 2061032996 4f1b9e1837998652bdbf6fdf1aa9fc3e006b99d72d312fcb11eab7048e73b1ef
checkpoint-170000/config.json 5959 d4d3f41b44c41c7795a2717e6f5c8d0bebf93f5cf0f3f0e6c0ebad720aaaf93b

Data

The public commit dataset used by the reproduction pipeline is:

Base Model

The checkpoint is based on facebook/rag-sequence-base and is intended to be loaded by the RAG-Hyp runtime from the companion reproducibility repository.

Loading

python3 - <<'PY'
from huggingface_hub import snapshot_download

path = snapshot_download(
    repo_id="Malolmalsky/rag-hyp-commit-message-generation",
    allow_patterns=["checkpoint-170000/*", "artifact_manifest.json"],
)
print(path)
PY

Then point the runtime to the downloaded checkpoint:

export RAG_HYP_MODEL_PATH=/path/to/snapshot/checkpoint-170000

Reproducibility

artifact_manifest.json records file sizes, SHA-256 hashes, the source dataset, and the base model identifier.