Walter commited on
Commit ·
dfcbccd
1
Parent(s): 5c45f77
initial commit
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- .gitattributes +6 -0
- README.md +122 -3
- discriminator/base/config.json +3 -0
- discriminator/base/pytorch_model.bin +3 -0
- discriminator/base/vocab.txt +0 -0
- discriminator/large-B96-L1e4-2M/config.json +3 -0
- discriminator/large-B96-L1e4-2M/pytorch_model.bin +3 -0
- discriminator/large-B96-L1e4-2M/vocab.txt +0 -0
- discriminator/small/config.json +3 -0
- discriminator/small/pytorch_model.bin +3 -0
- discriminator/small/vocab.txt +0 -0
- generator/base/config.json +3 -0
- generator/base/pytorch_model.bin +3 -0
- generator/base/vocab.txt +0 -0
- generator/large-B96-L1e4-2M/config.json +3 -0
- generator/large-B96-L1e4-2M/pytorch_model.bin +3 -0
- generator/large-B96-L1e4-2M/vocab.txt +0 -0
- generator/small/config.json +3 -0
- generator/small/pytorch_model.bin +3 -0
- generator/small/vocab.txt +0 -0
- runs/base/events.out.tfevents.1665678463.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1665824088.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1665911274.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1666038145.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1666058059.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1666062046.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1666063769.hk-electra-tpuv3 +3 -0
- runs/base/events.out.tfevents.1666064251.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665679361.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665694939.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665722154.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665862500.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665911345.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665965743.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1665993991.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1666062134.hk-electra-tpuv3 +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1666134343.hk-electra-tpuv3b +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1666135134.hk-electra-tpuv3b +3 -0
- runs/large-B96-L1e4-2M/events.out.tfevents.1666135271.hk-electra-tpuv3b +3 -0
- runs/small/events.out.tfevents.1665488622.hk-electra-tpuv3 +3 -0
- tf_checkpoint/base/checkpoint +2 -0
- tf_checkpoint/base/graph.pbtxt +3 -0
- tf_checkpoint/base/model.ckpt-1000000.data-00000-of-00001 +3 -0
- tf_checkpoint/base/model.ckpt-1000000.index +3 -0
- tf_checkpoint/base/model.ckpt-1000000.meta +3 -0
- tf_checkpoint/large-B96-L1e4-2M/checkpoint +2 -0
- tf_checkpoint/large-B96-L1e4-2M/graph.pbtxt +3 -0
- tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.data-00000-of-00001 +3 -0
- tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.index +3 -0
- tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.meta +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
*.tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
*.pbtxt filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
*.index filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
*.meta filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
*.data-* filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,122 @@
|
|
| 1 |
-
---
|
| 2 |
-
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- yue
|
| 4 |
+
- zh
|
| 5 |
+
language_details: "yue-Hant-HK; zh-Hant-HK"
|
| 6 |
+
license: cc-by-4.0
|
| 7 |
+
datasets:
|
| 8 |
+
- SolarisCipher/hk_content_corpus
|
| 9 |
+
metrics:
|
| 10 |
+
- accuracy
|
| 11 |
+
- exact_match
|
| 12 |
+
tags:
|
| 13 |
+
- ELECTRA
|
| 14 |
+
- pretrained
|
| 15 |
+
- masked-language-model
|
| 16 |
+
- replaced-token-detection
|
| 17 |
+
- feature-extraction
|
| 18 |
+
library_name: transformers
|
| 19 |
+
---
|
| 20 |
+
|
| 21 |
+
# HKELECTRA - ELECTRA Pretrained Models for Hong Kong Content
|
| 22 |
+
|
| 23 |
+
This repository contains **pretrained ELECTRA models** trained on Hong Kong Cantonese and Traditional Chinese content, focused on studying diglossia effects for NLP modeling.
|
| 24 |
+
|
| 25 |
+
The repo includes:
|
| 26 |
+
|
| 27 |
+
- `generator/` : HuggingFace Transformers format **generator** model for masked token prediction.
|
| 28 |
+
- `discriminator/` : HuggingFace Transformers format **discriminator** model for replaced token detection.
|
| 29 |
+
- `tf_checkpoint/` : Original **TensorFlow checkpoint** from pretraining (requires TensorFlow to load).
|
| 30 |
+
- `runs/` : **TensorBoard log** of pretraining.
|
| 31 |
+
|
| 32 |
+
**Note:** Because this repo contains multiple models with different purposes, there is **no `pipeline_tag`**. Users should select the appropriate model and pipeline for their use case. TensorFlow checkpoint requires TensorFlow >= 2.X to load manually.
|
| 33 |
+
|
| 34 |
+
This model is also available at Zenodo: https://doi.org/10.5281/zenodo.16889492
|
| 35 |
+
|
| 36 |
+
## Model Details
|
| 37 |
+
|
| 38 |
+
### Model Description
|
| 39 |
+
|
| 40 |
+
**Architecture:** ELECTRA (small/base/large)
|
| 41 |
+
**Pretraining:** from scratch (no base model)
|
| 42 |
+
**Languages:** Hong Kong Cantonese, Traditional Chinese
|
| 43 |
+
**Intended Use:** Research, feature extraction, masked token prediction
|
| 44 |
+
**License:** cc-by-4.0
|
| 45 |
+
|
| 46 |
+
## Usage Examples
|
| 47 |
+
|
| 48 |
+
### Load Generator (Masked LM)
|
| 49 |
+
|
| 50 |
+
```python
|
| 51 |
+
from transformers import ElectraTokenizer, ElectraForMaskedLM, pipeline
|
| 52 |
+
|
| 53 |
+
tokenizer = ElectraTokenizer.from_pretrained("SolarisCipher/HKELECTRA/generator/small")
|
| 54 |
+
model = ElectraForMaskedLM.from_pretrained("SolarisCipher/HKELECTRA/generator/small")
|
| 55 |
+
|
| 56 |
+
unmasker = pipeline("fill-mask", model=model, tokenizer=tokenizer)
|
| 57 |
+
unmasker("從中環[MASK]到尖沙咀。")
|
| 58 |
+
```
|
| 59 |
+
|
| 60 |
+
### Load Discriminator (Feature Extraction / Replaced Token Detection)
|
| 61 |
+
|
| 62 |
+
```python
|
| 63 |
+
from transformers import ElectraTokenizer, ElectraForPreTraining
|
| 64 |
+
|
| 65 |
+
tokenizer = ElectraTokenizer.from_pretrained("SolarisCipher/HKELECTRA/discriminator/small")
|
| 66 |
+
model = ElectraForPreTraining.from_pretrained("SolarisCipher/HKELECTRA/discriminator/small")
|
| 67 |
+
|
| 68 |
+
inputs = tokenizer("從中環坐車到[MASK]。", return_tensors="pt")
|
| 69 |
+
outputs = model(**inputs) # logits for replaced token detection
|
| 70 |
+
```
|
| 71 |
+
|
| 72 |
+
## Citation
|
| 73 |
+
|
| 74 |
+
If you use this model in your work, please cite our dataset and the original research:
|
| 75 |
+
|
| 76 |
+
Dataset (Upstream SQL Dump)
|
| 77 |
+
```bibtex
|
| 78 |
+
@dataset{yung_2025_16875235,
|
| 79 |
+
author = {Yung, Yiu Cheong},
|
| 80 |
+
title = {HK Web Text Corpus (MySQL Dump, raw version)},
|
| 81 |
+
month = aug,
|
| 82 |
+
year = 2025,
|
| 83 |
+
publisher = {Zenodo},
|
| 84 |
+
doi = {10.5281/zenodo.16875235},
|
| 85 |
+
url = {https://doi.org/10.5281/zenodo.16875235},
|
| 86 |
+
}
|
| 87 |
+
```
|
| 88 |
+
|
| 89 |
+
Dataset (Cleaned Corpus)
|
| 90 |
+
```bibtex
|
| 91 |
+
@dataset{yung_2025_16882351,
|
| 92 |
+
author = {Yung, Yiu Cheong},
|
| 93 |
+
title = {HK Content Corpus (Cantonese \& Traditional Chinese)},
|
| 94 |
+
month = aug,
|
| 95 |
+
year = 2025,
|
| 96 |
+
publisher = {Zenodo},
|
| 97 |
+
doi = {10.5281/zenodo.16882351},
|
| 98 |
+
url = {https://doi.org/10.5281/zenodo.16882351},
|
| 99 |
+
}
|
| 100 |
+
```
|
| 101 |
+
|
| 102 |
+
Research Paper
|
| 103 |
+
```bibtex
|
| 104 |
+
@article{10.1145/3744341,
|
| 105 |
+
author = {Yung, Yiu Cheong and Lin, Ying-Jia and Kao, Hung-Yu},
|
| 106 |
+
title = {Exploring the Effectiveness of Pre-training Language Models with Incorporation of Diglossia for Hong Kong Content},
|
| 107 |
+
year = {2025},
|
| 108 |
+
issue_date = {July 2025},
|
| 109 |
+
publisher = {Association for Computing Machinery},
|
| 110 |
+
address = {New York, NY, USA},
|
| 111 |
+
volume = {24},
|
| 112 |
+
number = {7},
|
| 113 |
+
issn = {2375-4699},
|
| 114 |
+
url = {https://doi.org/10.1145/3744341},
|
| 115 |
+
doi = {10.1145/3744341},
|
| 116 |
+
journal = {ACM Trans. Asian Low-Resour. Lang. Inf. Process.},
|
| 117 |
+
month = jul,
|
| 118 |
+
articleno = {71},
|
| 119 |
+
numpages = {16},
|
| 120 |
+
keywords = {Hong Kong, diglossia, ELECTRA, language modeling}
|
| 121 |
+
}
|
| 122 |
+
```
|
discriminator/base/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:27e25b1740e0f239391ef69c0211697e9c84866e53da8f8484282a4c5eef765c
|
| 3 |
+
size 467
|
discriminator/base/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f719031dc799e49b82f22325d945c88da4fcc54776962fb330d7873824990ef9
|
| 3 |
+
size 409157229
|
discriminator/base/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
discriminator/large-B96-L1e4-2M/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:074daf8b9ddafc0d853b3d940c827333d068f91c512338bf1f6262fcd02a99ba
|
| 3 |
+
size 469
|
discriminator/large-B96-L1e4-2M/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3d35082dcb15c82840c62e5346a13d85ab7b7418b1533e6ed3c64d9c3daa0417
|
| 3 |
+
size 1302254445
|
discriminator/large-B96-L1e4-2M/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
discriminator/small/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:98aa105c2183d4189722d8ac81a84a3e4d1e9152818734caf4546c8acaf5693f
|
| 3 |
+
size 466
|
discriminator/small/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ed3f25aa25a21a5456f11856f3f56a9929e51d4cc56de616436f187631570a52
|
| 3 |
+
size 49470697
|
discriminator/small/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
generator/base/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:141c7eb034490f4265437464c02914475a0583ad27592cdb124805b1b0e14fc9
|
| 3 |
+
size 463
|
generator/base/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f8fd590384b609320d02c17606a79d65b0fd6d6b1d82f3b3723b2826fcf192ed
|
| 3 |
+
size 106151079
|
generator/base/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
generator/large-B96-L1e4-2M/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:53578d9921c142686415b42b934dbc6559b1fd1c50e34f1ace768cd9a916652e
|
| 3 |
+
size 464
|
generator/large-B96-L1e4-2M/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f4ac454800b2036161ff0a824a3e0204fb877bf7bfdf3a4ad1c1bf75018adb36
|
| 3 |
+
size 166827367
|
generator/large-B96-L1e4-2M/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
generator/small/config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d5710f85b07df9f04af744d1a2df051d3073d5cf2e8a80bb3b13cc29c6260ff6
|
| 3 |
+
size 461
|
generator/small/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a9ab3011c178489ba5d18ac347fcef5a4cd38ee005a26a83a46ea9fcf569c51a
|
| 3 |
+
size 13716839
|
generator/small/vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
runs/base/events.out.tfevents.1665678463.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e564732c4178b7e7f6e217e8f8d9668b82aef5cb5bdaa752c145fa467f492d2
|
| 3 |
+
size 49916565
|
runs/base/events.out.tfevents.1665824088.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d0c921aa6674b0f94f5106c5a92a7835d39cca2a6b87359fe70f970b97011ab9
|
| 3 |
+
size 49846765
|
runs/base/events.out.tfevents.1665911274.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f5e54937cc53900a23d4b78088e42c4eda5f4b26db8d472b8c58dbfc7d4f6fb9
|
| 3 |
+
size 49902346
|
runs/base/events.out.tfevents.1666038145.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:84b6df06a389b08f3c20828f1041d133eb2cf499433d6112dce111a545b41400
|
| 3 |
+
size 49810190
|
runs/base/events.out.tfevents.1666058059.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1930b98a03e50642bb961ccd309f38755bebf3bf9744dc085073452485f7415e
|
| 3 |
+
size 32343601
|
runs/base/events.out.tfevents.1666062046.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0d5f9d5019a319e4b3cc27c0338960565496a2f8733e01fc021bdf53a5dfe79f
|
| 3 |
+
size 40824673
|
runs/base/events.out.tfevents.1666063769.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f3f90cd477e96f2472bb67c35b67220480b1c975fa25f917feeee787fd6cdd7d
|
| 3 |
+
size 7199003
|
runs/base/events.out.tfevents.1666064251.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:70d246dbecaa675b184798c543f93b79655afeecf5247e1bb5f4c5c188d296dc
|
| 3 |
+
size 32763408
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665679361.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2b96e7481370e16de2da96d5a22e5209e9dd01f373a635cc2bb06a1a9ffc19f9
|
| 3 |
+
size 64106492
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665694939.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b1bd83650afe805616cbe486ee31cf88fe1ef08b560348f360bc86504a71cc38
|
| 3 |
+
size 80915759
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665722154.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9ba072b621eb16ac2c10c52f3682806d9d3bba16e80742aba11e74659c0ae63c
|
| 3 |
+
size 115581525
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665862500.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:745de7733886d5b33223a38ba1af336ca25fb6f186094e55f1d0966420a4e338
|
| 3 |
+
size 80941568
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665911345.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a1c1f7c011354deba627424d6dd76f6b7858f8abdef171e36b6a2cdc0904ea78
|
| 3 |
+
size 80946451
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665965743.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e121a01639e86bc6f4eb2e62519161ebd93ca9acd4b47f64342ddca4a5c004a5
|
| 3 |
+
size 80922987
|
runs/large-B96-L1e4-2M/events.out.tfevents.1665993991.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:280d7c61568c5759d10c7e6c76ff4193b9b842e12a26f4554e776132607b09d0
|
| 3 |
+
size 64145720
|
runs/large-B96-L1e4-2M/events.out.tfevents.1666062134.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e42b79cbb0ab088c11db74994dd5d0633ffed7d281ede0ab94f74ef2c3137dfd
|
| 3 |
+
size 64148675
|
runs/large-B96-L1e4-2M/events.out.tfevents.1666134343.hk-electra-tpuv3b
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:13ab7c99dcec3eeaba09d4e23153c5fa48609ef6f631ac109a925999252dcc9d
|
| 3 |
+
size 14286111
|
runs/large-B96-L1e4-2M/events.out.tfevents.1666135134.hk-electra-tpuv3b
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bfdc9ecce9871107a80849c07e5b4c58b31a08a523b104e6ab4d46a3782468cf
|
| 3 |
+
size 14286114
|
runs/large-B96-L1e4-2M/events.out.tfevents.1666135271.hk-electra-tpuv3b
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e5d7edf2eeed99e34f7be097fe939c8124a4e77a574db122cebfdfec02fbbef
|
| 3 |
+
size 65256032
|
runs/small/events.out.tfevents.1665488622.hk-electra-tpuv3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:80980ce88819c7845a7ec2e3b75b8f6f56fa37b1b29f14b79bdf1e7b3f562a1c
|
| 3 |
+
size 32978061
|
tf_checkpoint/base/checkpoint
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
model_checkpoint_path: "model.ckpt-1000000"
|
| 2 |
+
all_model_checkpoint_paths: "model.ckpt-1000000"
|
tf_checkpoint/base/graph.pbtxt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1e6d180eb2e06c4114a75ae7134a67a44a2773ef4f519ee561ea27ada4d32144
|
| 3 |
+
size 20279384
|
tf_checkpoint/base/model.ckpt-1000000.data-00000-of-00001
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0628f6bf5a4067e76649aa69c87d1ce0eed58b519236fab45753a2949fe339a9
|
| 3 |
+
size 1345949300
|
tf_checkpoint/base/model.ckpt-1000000.index
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e925a95f98a85f4b579ba29206ec12517e5bc31b1df57353368489a35455dff7
|
| 3 |
+
size 18063
|
tf_checkpoint/base/model.ckpt-1000000.meta
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d965ae35f21f71f5018e5de76082f72148521473f3e4e36a0f770aae7155e70e
|
| 3 |
+
size 8971897
|
tf_checkpoint/large-B96-L1e4-2M/checkpoint
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
model_checkpoint_path: "model.ckpt-2000000"
|
| 2 |
+
all_model_checkpoint_paths: "model.ckpt-2000000"
|
tf_checkpoint/large-B96-L1e4-2M/graph.pbtxt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:40077e936a82ea74e8024c93870792e70c70530978fa039047b3948626bf8448
|
| 3 |
+
size 39970200
|
tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.data-00000-of-00001
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:38c0f896ad9058c4e5cc51b6e93fe10df45fb1ddf9f8e849f2ec43db2a9ca302
|
| 3 |
+
size 4140317300
|
tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.index
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a925e88b46df901a20b60bca263e50b16bedd34b8881ae9eb7cf503b237e6149
|
| 3 |
+
size 34448
|
tf_checkpoint/large-B96-L1e4-2M/model.ckpt-2000000.meta
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a22be54f295fdd13263d265ec243478b267588a26cb62389865d62c2e7cf9e75
|
| 3 |
+
size 17762941
|