Training in progress, step 500
Browse files- README.md +199 -0
- config.json +36 -0
- pytorch_model.bin +3 -0
- runs/Sep12_17-18-23_nid007662/events.out.tfevents.1757690318.nid007662.88323.0 +3 -0
- runs/Sep12_17-27-52_nid007662/events.out.tfevents.1757690879.nid007662.93380.0 +3 -0
- runs/Sep12_17-34-57_nid007662/events.out.tfevents.1757691304.nid007662.97075.0 +3 -0
- runs/Sep12_18-14-30_nid006632/events.out.tfevents.1757693676.nid006632.43325.0 +3 -0
- runs/Sep12_23-44-44_nid007360/events.out.tfevents.1757713492.nid007360.272671.0 +3 -0
- runs/Sep12_23-54-39_nid006757/events.out.tfevents.1757714086.nid006757.283834.0 +3 -0
- runs/Sep13_00-08-59_nid006658/events.out.tfevents.1757714947.nid006658.6000.0 +3 -0
- runs/Sep13_02-42-56_nid007114/events.out.tfevents.1757724184.nid007114.256359.0 +3 -0
- runs/Sep13_03-18-33_nid006608/events.out.tfevents.1757726319.nid006608.74531.0 +3 -0
- runs/Sep13_13-04-34_nid006621/events.out.tfevents.1757761486.nid006621.110484.0 +3 -0
- runs/Sep13_16-35-48_nid006748/events.out.tfevents.1757774154.nid006748.195703.0 +3 -0
- runs/Sep14_03-10-51_nid006726/events.out.tfevents.1757812258.nid006726.69600.0 +3 -0
- runs/Sep14_03-19-47_nid006726/events.out.tfevents.1757812794.nid006726.72480.0 +3 -0
- runs/Sep14_03-26-16_nid006726/events.out.tfevents.1757813184.nid006726.75048.0 +3 -0
- runs/Sep14_03-33-05_nid006726/events.out.tfevents.1757813592.nid006726.83525.0 +3 -0
- runs/Sep14_03-39-21_nid006726/events.out.tfevents.1757813968.nid006726.86050.0 +3 -0
- runs/Sep14_03-45-04_nid006726/events.out.tfevents.1757814311.nid006726.88493.0 +3 -0
- runs/Sep14_04-32-21_nid007204/events.out.tfevents.1757817150.nid007204.51886.0 +3 -0
- runs/Sep14_05-12-46_nid007237/events.out.tfevents.1757819574.nid007237.108860.0 +3 -0
- runs/Sep14_13-25-37_nid007424/events.out.tfevents.1757849146.nid007424.136489.0 +3 -0
- runs/Sep14_14-09-17_nid006665/events.out.tfevents.1757851764.nid006665.277341.0 +3 -0
- runs/Sep14_16-05-03_nid007152/events.out.tfevents.1757858711.nid007152.77315.0 +3 -0
- runs/Sep14_18-15-13_nid006958/events.out.tfevents.1757866520.nid006958.104416.0 +3 -0
- runs/Sep14_18-27-59_nid007178/events.out.tfevents.1757867286.nid007178.190529.0 +3 -0
- runs/Sep14_20-40-46_nid007345/events.out.tfevents.1757875253.nid007345.135845.0 +3 -0
- runs/Sep14_20-48-06_nid006878/events.out.tfevents.1757875693.nid006878.13392.0 +3 -0
- runs/Sep14_21-52-24_nid007249/events.out.tfevents.1757879556.nid007249.17581.0 +3 -0
- runs/Sep14_22-00-06_nid007043/events.out.tfevents.1757880019.nid007043.32397.0 +3 -0
- runs/Sep14_22-18-51_nid007230/events.out.tfevents.1757881139.nid007230.6286.0 +3 -0
- training_args.bin +3 -0
README.md
ADDED
|
@@ -0,0 +1,199 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: transformers
|
| 3 |
+
tags: []
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# Model Card for Model ID
|
| 7 |
+
|
| 8 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 9 |
+
|
| 10 |
+
|
| 11 |
+
|
| 12 |
+
## Model Details
|
| 13 |
+
|
| 14 |
+
### Model Description
|
| 15 |
+
|
| 16 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 17 |
+
|
| 18 |
+
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
|
| 19 |
+
|
| 20 |
+
- **Developed by:** [More Information Needed]
|
| 21 |
+
- **Funded by [optional]:** [More Information Needed]
|
| 22 |
+
- **Shared by [optional]:** [More Information Needed]
|
| 23 |
+
- **Model type:** [More Information Needed]
|
| 24 |
+
- **Language(s) (NLP):** [More Information Needed]
|
| 25 |
+
- **License:** [More Information Needed]
|
| 26 |
+
- **Finetuned from model [optional]:** [More Information Needed]
|
| 27 |
+
|
| 28 |
+
### Model Sources [optional]
|
| 29 |
+
|
| 30 |
+
<!-- Provide the basic links for the model. -->
|
| 31 |
+
|
| 32 |
+
- **Repository:** [More Information Needed]
|
| 33 |
+
- **Paper [optional]:** [More Information Needed]
|
| 34 |
+
- **Demo [optional]:** [More Information Needed]
|
| 35 |
+
|
| 36 |
+
## Uses
|
| 37 |
+
|
| 38 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 39 |
+
|
| 40 |
+
### Direct Use
|
| 41 |
+
|
| 42 |
+
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 43 |
+
|
| 44 |
+
[More Information Needed]
|
| 45 |
+
|
| 46 |
+
### Downstream Use [optional]
|
| 47 |
+
|
| 48 |
+
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
| 49 |
+
|
| 50 |
+
[More Information Needed]
|
| 51 |
+
|
| 52 |
+
### Out-of-Scope Use
|
| 53 |
+
|
| 54 |
+
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
| 55 |
+
|
| 56 |
+
[More Information Needed]
|
| 57 |
+
|
| 58 |
+
## Bias, Risks, and Limitations
|
| 59 |
+
|
| 60 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
| 61 |
+
|
| 62 |
+
[More Information Needed]
|
| 63 |
+
|
| 64 |
+
### Recommendations
|
| 65 |
+
|
| 66 |
+
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
| 67 |
+
|
| 68 |
+
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
| 69 |
+
|
| 70 |
+
## How to Get Started with the Model
|
| 71 |
+
|
| 72 |
+
Use the code below to get started with the model.
|
| 73 |
+
|
| 74 |
+
[More Information Needed]
|
| 75 |
+
|
| 76 |
+
## Training Details
|
| 77 |
+
|
| 78 |
+
### Training Data
|
| 79 |
+
|
| 80 |
+
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 81 |
+
|
| 82 |
+
[More Information Needed]
|
| 83 |
+
|
| 84 |
+
### Training Procedure
|
| 85 |
+
|
| 86 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 87 |
+
|
| 88 |
+
#### Preprocessing [optional]
|
| 89 |
+
|
| 90 |
+
[More Information Needed]
|
| 91 |
+
|
| 92 |
+
|
| 93 |
+
#### Training Hyperparameters
|
| 94 |
+
|
| 95 |
+
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 96 |
+
|
| 97 |
+
#### Speeds, Sizes, Times [optional]
|
| 98 |
+
|
| 99 |
+
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
| 100 |
+
|
| 101 |
+
[More Information Needed]
|
| 102 |
+
|
| 103 |
+
## Evaluation
|
| 104 |
+
|
| 105 |
+
<!-- This section describes the evaluation protocols and provides the results. -->
|
| 106 |
+
|
| 107 |
+
### Testing Data, Factors & Metrics
|
| 108 |
+
|
| 109 |
+
#### Testing Data
|
| 110 |
+
|
| 111 |
+
<!-- This should link to a Dataset Card if possible. -->
|
| 112 |
+
|
| 113 |
+
[More Information Needed]
|
| 114 |
+
|
| 115 |
+
#### Factors
|
| 116 |
+
|
| 117 |
+
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
| 118 |
+
|
| 119 |
+
[More Information Needed]
|
| 120 |
+
|
| 121 |
+
#### Metrics
|
| 122 |
+
|
| 123 |
+
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
| 124 |
+
|
| 125 |
+
[More Information Needed]
|
| 126 |
+
|
| 127 |
+
### Results
|
| 128 |
+
|
| 129 |
+
[More Information Needed]
|
| 130 |
+
|
| 131 |
+
#### Summary
|
| 132 |
+
|
| 133 |
+
|
| 134 |
+
|
| 135 |
+
## Model Examination [optional]
|
| 136 |
+
|
| 137 |
+
<!-- Relevant interpretability work for the model goes here -->
|
| 138 |
+
|
| 139 |
+
[More Information Needed]
|
| 140 |
+
|
| 141 |
+
## Environmental Impact
|
| 142 |
+
|
| 143 |
+
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
| 144 |
+
|
| 145 |
+
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
| 146 |
+
|
| 147 |
+
- **Hardware Type:** [More Information Needed]
|
| 148 |
+
- **Hours used:** [More Information Needed]
|
| 149 |
+
- **Cloud Provider:** [More Information Needed]
|
| 150 |
+
- **Compute Region:** [More Information Needed]
|
| 151 |
+
- **Carbon Emitted:** [More Information Needed]
|
| 152 |
+
|
| 153 |
+
## Technical Specifications [optional]
|
| 154 |
+
|
| 155 |
+
### Model Architecture and Objective
|
| 156 |
+
|
| 157 |
+
[More Information Needed]
|
| 158 |
+
|
| 159 |
+
### Compute Infrastructure
|
| 160 |
+
|
| 161 |
+
[More Information Needed]
|
| 162 |
+
|
| 163 |
+
#### Hardware
|
| 164 |
+
|
| 165 |
+
[More Information Needed]
|
| 166 |
+
|
| 167 |
+
#### Software
|
| 168 |
+
|
| 169 |
+
[More Information Needed]
|
| 170 |
+
|
| 171 |
+
## Citation [optional]
|
| 172 |
+
|
| 173 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
| 174 |
+
|
| 175 |
+
**BibTeX:**
|
| 176 |
+
|
| 177 |
+
[More Information Needed]
|
| 178 |
+
|
| 179 |
+
**APA:**
|
| 180 |
+
|
| 181 |
+
[More Information Needed]
|
| 182 |
+
|
| 183 |
+
## Glossary [optional]
|
| 184 |
+
|
| 185 |
+
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
| 186 |
+
|
| 187 |
+
[More Information Needed]
|
| 188 |
+
|
| 189 |
+
## More Information [optional]
|
| 190 |
+
|
| 191 |
+
[More Information Needed]
|
| 192 |
+
|
| 193 |
+
## Model Card Authors [optional]
|
| 194 |
+
|
| 195 |
+
[More Information Needed]
|
| 196 |
+
|
| 197 |
+
## Model Card Contact
|
| 198 |
+
|
| 199 |
+
[More Information Needed]
|
config.json
ADDED
|
@@ -0,0 +1,36 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"adapter_reduction": 16,
|
| 3 |
+
"architectures": [
|
| 4 |
+
"DistillationWrapper"
|
| 5 |
+
],
|
| 6 |
+
"attention_probs_dropout_prob": 0.1,
|
| 7 |
+
"classifier_dropout": null,
|
| 8 |
+
"dtype": "float32",
|
| 9 |
+
"embedding_size": 128,
|
| 10 |
+
"expert_intermediate_size": 2624,
|
| 11 |
+
"group_depth": 4,
|
| 12 |
+
"hidden_act": "gelu",
|
| 13 |
+
"hidden_dropout_prob": 0.1,
|
| 14 |
+
"hidden_size": 1024,
|
| 15 |
+
"initializer_range": 0.02,
|
| 16 |
+
"intermediate_size": 2624,
|
| 17 |
+
"layer_norm_eps": 1e-06,
|
| 18 |
+
"load_balancing_loss_coef": 0.2,
|
| 19 |
+
"lora_alpha": 32,
|
| 20 |
+
"lora_rank": 16,
|
| 21 |
+
"max_position_embeddings": 8192,
|
| 22 |
+
"model_type": "ModernALBERT",
|
| 23 |
+
"num_attention_heads": 16,
|
| 24 |
+
"num_expert_modules": 3,
|
| 25 |
+
"num_experts": 8,
|
| 26 |
+
"num_hidden_layers": 16,
|
| 27 |
+
"pad_token_id": 0,
|
| 28 |
+
"router_jitter_noise": 0.01,
|
| 29 |
+
"top_k": 2,
|
| 30 |
+
"torch_dtype": "bfloat16",
|
| 31 |
+
"transformers_version": "4.51.3",
|
| 32 |
+
"use_adapter": true,
|
| 33 |
+
"use_cache": true,
|
| 34 |
+
"use_moa": true,
|
| 35 |
+
"vocab_size": 50368
|
| 36 |
+
}
|
pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:75a961f490b344a7f1e908e60d9dd20d60e0e34a3f11d3d3773859954b34df52
|
| 3 |
+
size 943411142
|
runs/Sep12_17-18-23_nid007662/events.out.tfevents.1757690318.nid007662.88323.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ca3e27c253fc051ad7205d058c51dadbee25eb24746e4d0de4ad1ce4abc2cd66
|
| 3 |
+
size 5402
|
runs/Sep12_17-27-52_nid007662/events.out.tfevents.1757690879.nid007662.93380.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1370b5d8e513e55284979a3cb125cbc63d2d823bf55cdbd190b502d077404557
|
| 3 |
+
size 5196
|
runs/Sep12_17-34-57_nid007662/events.out.tfevents.1757691304.nid007662.97075.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ea4d3de72fe2c0ea1b56e19bd3df79574dfcd58253dd50076f3baa445c1bb800
|
| 3 |
+
size 5195
|
runs/Sep12_18-14-30_nid006632/events.out.tfevents.1757693676.nid006632.43325.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c3433b895ae8217b7862b309ae70b8f763b0fa2fd1a6fef78ddbda6407384344
|
| 3 |
+
size 41227
|
runs/Sep12_23-44-44_nid007360/events.out.tfevents.1757713492.nid007360.272671.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3039e7a61a7e568311c552e790f6a839febc66686d12b04dbd17c6e62e77cb6c
|
| 3 |
+
size 6639
|
runs/Sep12_23-54-39_nid006757/events.out.tfevents.1757714086.nid006757.283834.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8cc4cf66df3c80ab971048e457f1c3b54e1cb302a9ad6f1cba6a24784683905a
|
| 3 |
+
size 36581
|
runs/Sep13_00-08-59_nid006658/events.out.tfevents.1757714947.nid006658.6000.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c99c5d99d2cf4b6e251a65754c96c3c2f7a1fea930460c9081709dabdc6b581b
|
| 3 |
+
size 5604
|
runs/Sep13_02-42-56_nid007114/events.out.tfevents.1757724184.nid007114.256359.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:587addde6af647c22a4f9654f8868ae2e01c0f30f38c9336447b7a92c252020d
|
| 3 |
+
size 5844
|
runs/Sep13_03-18-33_nid006608/events.out.tfevents.1757726319.nid006608.74531.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fc839de07771caec2e4355ca65b82e74d17481d69210f0ae91bce84d573d91fc
|
| 3 |
+
size 5637
|
runs/Sep13_13-04-34_nid006621/events.out.tfevents.1757761486.nid006621.110484.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8af4e14a5c0c956b6ddcd94d8dcbd15836a925ab4270a9053d4a1e6b30eee683
|
| 3 |
+
size 21138
|
runs/Sep13_16-35-48_nid006748/events.out.tfevents.1757774154.nid006748.195703.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:31b632dd6216b94590cf7c8023d8cd666b4410bd82b0761fb5f5e3c86b9191d2
|
| 3 |
+
size 591817
|
runs/Sep14_03-10-51_nid006726/events.out.tfevents.1757812258.nid006726.69600.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9f2561cd18393e81371d2242addef351945da0a429b1a36ac034536025a8d42c
|
| 3 |
+
size 5232
|
runs/Sep14_03-19-47_nid006726/events.out.tfevents.1757812794.nid006726.72480.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:407c96161847eabc3deee61a36ea1fea903b5f24a4e8f0f78fa9b977855fd2aa
|
| 3 |
+
size 5232
|
runs/Sep14_03-26-16_nid006726/events.out.tfevents.1757813184.nid006726.75048.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:98ed373a716f67d971ca71450d1c103199e0dbc7ab6c3085f41fd4717460b91b
|
| 3 |
+
size 5232
|
runs/Sep14_03-33-05_nid006726/events.out.tfevents.1757813592.nid006726.83525.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c0e1e0063d56a68bc0d85613597da747d1e86a5a3d2ab3db04a513283adbf106
|
| 3 |
+
size 5232
|
runs/Sep14_03-39-21_nid006726/events.out.tfevents.1757813968.nid006726.86050.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:79fd7682670c3bd2a012483b7bbd86acf1be551944f1579e356cbe54d607943f
|
| 3 |
+
size 5232
|
runs/Sep14_03-45-04_nid006726/events.out.tfevents.1757814311.nid006726.88493.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e5556c0271412a34f1d33b8b0898a84dea202aff35f14fdbf8acf2638b10e095
|
| 3 |
+
size 31892
|
runs/Sep14_04-32-21_nid007204/events.out.tfevents.1757817150.nid007204.51886.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:17381122e7ba8446bde61b662eb16c0d7b4a31bf1e37962e11b404d6b54f671a
|
| 3 |
+
size 26947
|
runs/Sep14_05-12-46_nid007237/events.out.tfevents.1757819574.nid007237.108860.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fd7e1f3e2d2ff340afecb99135d62c0d767764142c88750fa6e923b9a2c242de
|
| 3 |
+
size 446197
|
runs/Sep14_13-25-37_nid007424/events.out.tfevents.1757849146.nid007424.136489.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:323e0140634b6128af4fd69ff7ea9dcad87abf5eb15553d475b7cb6d4cb3b151
|
| 3 |
+
size 5228
|
runs/Sep14_14-09-17_nid006665/events.out.tfevents.1757851764.nid006665.277341.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cdaadd5cf8395bcc3ac550f8d002a5dea84895855cfdd0b7bf149b3df897555b
|
| 3 |
+
size 5228
|
runs/Sep14_16-05-03_nid007152/events.out.tfevents.1757858711.nid007152.77315.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7b950fd8bdee562d3ce3101cbefa4a7b2f7d0302fb387b07e0a2cb2dcf3d1da1
|
| 3 |
+
size 8353
|
runs/Sep14_18-15-13_nid006958/events.out.tfevents.1757866520.nid006958.104416.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:96e70e5f5877112dad74b994dfdbd904118cf4d2beffbeda2a5987f48790fe05
|
| 3 |
+
size 5237
|
runs/Sep14_18-27-59_nid007178/events.out.tfevents.1757867286.nid007178.190529.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b64365e2fd1621ad68af3f0d686918f3e64c9b20ecb69915d643cb02c3854a8b
|
| 3 |
+
size 11307
|
runs/Sep14_20-40-46_nid007345/events.out.tfevents.1757875253.nid007345.135845.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fde09581a71901371e7d7d66e996ce15aef687b6297bdd051c2c87d412cdb0ab
|
| 3 |
+
size 5236
|
runs/Sep14_20-48-06_nid006878/events.out.tfevents.1757875693.nid006878.13392.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8f22f24c6243a0a34c044a6e82ef25da227e6ca77bb8944da5c0162385c5b121
|
| 3 |
+
size 7099
|
runs/Sep14_21-52-24_nid007249/events.out.tfevents.1757879556.nid007249.17581.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8a0e17a3910cf3745230d5c280061e28c199697ade1a38e955a1ff1dea44a466
|
| 3 |
+
size 8563
|
runs/Sep14_22-00-06_nid007043/events.out.tfevents.1757880019.nid007043.32397.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b328bbbb83331da222a381b6c13d065a4fc470be65b6497855bbab865e6b76ba
|
| 3 |
+
size 5235
|
runs/Sep14_22-18-51_nid007230/events.out.tfevents.1757881139.nid007230.6286.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f4b81eb5fbf96625017af7e14bb7badf6e374f1c58216db0a1dddcfce344e86c
|
| 3 |
+
size 15737
|
training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9540043b69e398857104c563a1ec22c9bcc89baa9d452eba21b4d93a7a7d15b2
|
| 3 |
+
size 5432
|