File size: 1,705 Bytes
e064df7
 
 
 
 
 
823c2a4
e064df7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
---
license: mit
base_model: facebook/rag-sequence-base
datasets:
- Malolmalsky/new-commits
library_name: transformers
pipeline_tag: text-generation
tags:
- rag
- commit-message-generation
- hyperbolic-geometry
- software-maintenance
- reproducible-research
---

# RAG-Hyp Commit Message Generation Checkpoint

This repository stores the heavyweight checkpoint for the RAG-Hyp dissertation
artifact. The source code, reproduction scripts, experiment matrix, and
method-to-code traceability documentation are kept in the companion code
repository.

## Files

| File | Size, bytes | SHA-256 |
|---|---:|---|
| `checkpoint-170000/model.safetensors` | `2061032996` | `4f1b9e1837998652bdbf6fdf1aa9fc3e006b99d72d312fcb11eab7048e73b1ef` |
| `checkpoint-170000/config.json` | `5959` | `d4d3f41b44c41c7795a2717e6f5c8d0bebf93f5cf0f3f0e6c0ebad720aaaf93b` |

## Data

The public commit dataset used by the reproduction pipeline is:

- `Malolmalsky/new-commits`
- <https://huggingface.co/datasets/Malolmalsky/new-commits>

## Base Model

The checkpoint is based on `facebook/rag-sequence-base` and is intended to be loaded by the
RAG-Hyp runtime from the companion reproducibility repository.

## Loading

```bash
python3 - <<'PY'
from huggingface_hub import snapshot_download

path = snapshot_download(
    repo_id="Malolmalsky/rag-hyp-commit-message-generation",
    allow_patterns=["checkpoint-170000/*", "artifact_manifest.json"],
)
print(path)
PY
```

Then point the runtime to the downloaded checkpoint:

```bash
export RAG_HYP_MODEL_PATH=/path/to/snapshot/checkpoint-170000
```

## Reproducibility

`artifact_manifest.json` records file sizes, SHA-256 hashes, the source dataset,
and the base model identifier.