Document public BERTose IAR usage
Browse files- README.md +20 -1
- SHA256SUMS +1 -1
README.md
CHANGED
|
@@ -14,6 +14,25 @@ tags:
|
|
| 14 |
|
| 15 |
This repository contains the contrastively refined BERTose checkpoint used for iterative ambiguity resolution (IAR) over ambiguous WURCS BPE tokens.
|
| 16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
## Files
|
| 18 |
|
| 19 |
- `checkpoints/bertose_iar_resolver.pt` - BERTose IAR checkpoint.
|
|
@@ -35,4 +54,4 @@ Token-level ambiguity-resolution predictions with confidence scores. The compani
|
|
| 35 |
|
| 36 |
The resolver provides model-backed token updates and confidence values for ambiguous positions. It does not claim to reconstruct a final canonical WURCS string by itself, and it does not perform IUPAC-condensed to WURCS conversion.
|
| 37 |
|
| 38 |
-
|
|
|
|
| 14 |
|
| 15 |
This repository contains the contrastively refined BERTose checkpoint used for iterative ambiguity resolution (IAR) over ambiguous WURCS BPE tokens.
|
| 16 |
|
| 17 |
+
## Quick Start
|
| 18 |
+
|
| 19 |
+
The recommended user path is the companion notebook:
|
| 20 |
+
|
| 21 |
+
```python
|
| 22 |
+
from huggingface_hub import hf_hub_download
|
| 23 |
+
|
| 24 |
+
checkpoint = hf_hub_download(
|
| 25 |
+
repo_id="supanthadey1/bertose-iar-resolver",
|
| 26 |
+
filename="checkpoints/bertose_iar_resolver.pt",
|
| 27 |
+
)
|
| 28 |
+
ambiguity_map = hf_hub_download(
|
| 29 |
+
repo_id="supanthadey1/bertose-iar-resolver",
|
| 30 |
+
filename="vocab/bpe_ambiguity_tokens.json",
|
| 31 |
+
)
|
| 32 |
+
```
|
| 33 |
+
|
| 34 |
+
No Hugging Face token is required for this BERTose IAR checkpoint now that the repository is public.
|
| 35 |
+
|
| 36 |
## Files
|
| 37 |
|
| 38 |
- `checkpoints/bertose_iar_resolver.pt` - BERTose IAR checkpoint.
|
|
|
|
| 54 |
|
| 55 |
The resolver provides model-backed token updates and confidence values for ambiguous positions. It does not claim to reconstruct a final canonical WURCS string by itself, and it does not perform IUPAC-condensed to WURCS conversion.
|
| 56 |
|
| 57 |
+
License metadata is currently `other`; update it when the final release license and citation text are chosen.
|
SHA256SUMS
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
622368f62c23e97e9137c277eaadcc93ee3901cbb420b591422bb1c2e19689a5 ./.gitattributes
|
| 2 |
-
|
| 3 |
ae468f4e8c06dc0c3848138a474dc43249aa6d14dfd0df8f58d68fcaad371152 ./checkpoints/bertose_iar_resolver.pt
|
| 4 |
5b52e098ec662b84900c0c4570404379f8abb15d07c1cea7fe512c036a670131 ./config.json
|
| 5 |
6a56e6f73b8f874470ecde6e538f3f5029ae23aa6c10559817d1c2a8b59b7c0f ./requirements.txt
|
|
|
|
| 1 |
622368f62c23e97e9137c277eaadcc93ee3901cbb420b591422bb1c2e19689a5 ./.gitattributes
|
| 2 |
+
67775250024ca8da24e0556d70279e9c6dae01f2f6ccc0fbf22b18b788ea360d ./README.md
|
| 3 |
ae468f4e8c06dc0c3848138a474dc43249aa6d14dfd0df8f58d68fcaad371152 ./checkpoints/bertose_iar_resolver.pt
|
| 4 |
5b52e098ec662b84900c0c4570404379f8abb15d07c1cea7fe512c036a670131 ./config.json
|
| 5 |
6a56e6f73b8f874470ecde6e538f3f5029ae23aa6c10559817d1c2a8b59b7c0f ./requirements.txt
|