Upload folder using huggingface_hub

Browse files

Files changed (10) hide show

prior_attributer_polymarket_4b_2900/README.md +202 -0
prior_attributer_polymarket_4b_2900/adapter_config.json +34 -0
prior_attributer_polymarket_4b_2900/adapter_model.safetensors +3 -0
prior_attributer_polymarket_4b_2900/config.json +6 -0
prior_attributer_polymarket_4b_2900/optimizer.pt +3 -0
prior_attributer_polymarket_4b_2900/regression_head.bin +3 -0
prior_attributer_polymarket_4b_2900/rng_state.pth +3 -0
prior_attributer_polymarket_4b_2900/scheduler.pt +3 -0
prior_attributer_polymarket_4b_2900/trainer_state.json +2111 -0
prior_attributer_polymarket_4b_2900/training_args.bin +3 -0

prior_attributer_polymarket_4b_2900/README.md ADDED Viewed

	@@ -0,0 +1,202 @@

+---
+base_model: Qwen/Qwen3-4B
+library_name: peft
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+## Model Details
+### Model Description
+<!-- Provide a longer summary of what this model is. -->
+- **Developed by:** [More Information Needed]
+- **Funded by [optional]:** [More Information Needed]
+- **Shared by [optional]:** [More Information Needed]
+- **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed]
+- **License:** [More Information Needed]
+- **Finetuned from model [optional]:** [More Information Needed]
+### Model Sources [optional]
+<!-- Provide the basic links for the model. -->
+- **Repository:** [More Information Needed]
+- **Paper [optional]:** [More Information Needed]
+- **Demo [optional]:** [More Information Needed]
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+### Direct Use
+<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+[More Information Needed]
+### Downstream Use [optional]
+<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
+[More Information Needed]
+### Out-of-Scope Use
+<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+[More Information Needed]
+## Bias, Risks, and Limitations
+<!-- This section is meant to convey both technical and sociotechnical limitations. -->
+[More Information Needed]
+### Recommendations
+<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
+Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
+## How to Get Started with the Model
+Use the code below to get started with the model.
+[More Information Needed]
+## Training Details
+### Training Data
+<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[More Information Needed]
+### Training Procedure
+<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
+#### Preprocessing [optional]
+[More Information Needed]
+#### Training Hyperparameters
+- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+#### Speeds, Sizes, Times [optional]
+<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
+[More Information Needed]
+## Evaluation
+<!-- This section describes the evaluation protocols and provides the results. -->
+### Testing Data, Factors & Metrics
+#### Testing Data
+<!-- This should link to a Dataset Card if possible. -->
+[More Information Needed]
+#### Factors
+<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
+[More Information Needed]
+#### Metrics
+<!-- These are the evaluation metrics being used, ideally with a description of why. -->
+[More Information Needed]
+### Results
+[More Information Needed]
+#### Summary
+## Model Examination [optional]
+<!-- Relevant interpretability work for the model goes here -->
+[More Information Needed]
+## Environmental Impact
+<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
+Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
+- **Hardware Type:** [More Information Needed]
+- **Hours used:** [More Information Needed]
+- **Cloud Provider:** [More Information Needed]
+- **Compute Region:** [More Information Needed]
+- **Carbon Emitted:** [More Information Needed]
+## Technical Specifications [optional]
+### Model Architecture and Objective
+[More Information Needed]
+### Compute Infrastructure
+[More Information Needed]
+#### Hardware
+[More Information Needed]
+#### Software
+[More Information Needed]
+## Citation [optional]
+<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
+**BibTeX:**
+[More Information Needed]
+**APA:**
+[More Information Needed]
+## Glossary [optional]
+<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
+[More Information Needed]
+## More Information [optional]
+[More Information Needed]
+## Model Card Authors [optional]
+[More Information Needed]
+## Model Card Contact
+[More Information Needed]
+### Framework versions
+- PEFT 0.14.0

prior_attributer_polymarket_4b_2900/adapter_config.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "/mnt/tidal-alsh-share2/usr/wangshanyong/models/Qwen/Qwen3-4B",
+  "bias": "none",
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 32,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": null,
+  "peft_type": "LORA",
+  "r": 32,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "v_proj",
+    "k_proj",
+    "o_proj"
+  ],
+  "task_type": "CAUSAL_LM",
+  "use_dora": false,
+  "use_rslora": false
+}

prior_attributer_polymarket_4b_2900/adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a1c95c58a7689dda030b9276202475a2afbb73e119f3d812310656643dd27cce
+size 94410616

prior_attributer_polymarket_4b_2900/config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+  "base_model_name_or_path": "/mnt/tidal-alsh-share2/usr/wangshanyong/models/Qwen/Qwen3-4B",
+  "max_length": 1024,
+  "model_type": "llm_regressor",
+  "transformers_version": "4.57.6"
+}

prior_attributer_polymarket_4b_2900/optimizer.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e0b93a35e03123d129d72dcdf397070f52b61132a7e654dfa0a8731eef520cda
+size 194410810

prior_attributer_polymarket_4b_2900/regression_head.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:06b6ceed8307b14b48e48f074c3babf963c647c3b2384c2d6869ef8cf77dde8d
+size 2712352

prior_attributer_polymarket_4b_2900/rng_state.pth ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3b7a17cd060e6c3d3ffdf9501d645843cb62c9cdb5d8741f86c19f6e2c0bd74a
+size 14244

prior_attributer_polymarket_4b_2900/scheduler.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:faa8d11a452b0f628fb0fa1a6874024ca1271dc9684e0052fc10457b7bae9b0d
+size 1064

prior_attributer_polymarket_4b_2900/trainer_state.json ADDED Viewed

	@@ -0,0 +1,2111 @@

+{
+  "best_global_step": 2500,
+  "best_metric": 0.15810872614383698,
+  "best_model_checkpoint": "../saves/prior_attributer_polymarket_4b/checkpoint-2500",
+  "epoch": 1.1832092216668366,
+  "eval_steps": 500,
+  "global_step": 2900,
+  "is_hyper_param_search": false,
+  "is_local_process_zero": true,
+  "is_world_process_zero": true,
+  "log_history": [
+    {
+      "epoch": 0.0004080383556054269,
+      "grad_norm": 1.197115421295166,
+      "learning_rate": 2e-05,
+      "loss": 0.115,
+      "step": 1
+    },
+    {
+      "epoch": 0.004080383556054269,
+      "grad_norm": 0.062769815325737,
+      "learning_rate": 1.9992656058751532e-05,
+      "loss": 0.1976,
+      "step": 10
+    },
+    {
+      "epoch": 0.008160767112108539,
+      "grad_norm": 2.5398786067962646,
+      "learning_rate": 1.998449612403101e-05,
+      "loss": 0.1798,
+      "step": 20
+    },
+    {
+      "epoch": 0.012241150668162807,
+      "grad_norm": 1.748403549194336,
+      "learning_rate": 1.9976336189310487e-05,
+      "loss": 0.3736,
+      "step": 30
+    },
+    {
+      "epoch": 0.016321534224217078,
+      "grad_norm": 1.7297178506851196,
+      "learning_rate": 1.9968176254589966e-05,
+      "loss": 0.2225,
+      "step": 40
+    },
+    {
+      "epoch": 0.020401917780271346,
+      "grad_norm": 1.073101282119751,
+      "learning_rate": 1.9960016319869442e-05,
+      "loss": 0.2245,
+      "step": 50
+    },
+    {
+      "epoch": 0.024482301336325615,
+      "grad_norm": 2.884895086288452,
+      "learning_rate": 1.995185638514892e-05,
+      "loss": 0.3386,
+      "step": 60
+    },
+    {
+      "epoch": 0.028562684892379883,
+      "grad_norm": 0.4594326913356781,
+      "learning_rate": 1.9943696450428397e-05,
+      "loss": 0.1972,
+      "step": 70
+    },
+    {
+      "epoch": 0.032643068448434155,
+      "grad_norm": 2.303440809249878,
+      "learning_rate": 1.9935536515707876e-05,
+      "loss": 0.2936,
+      "step": 80
+    },
+    {
+      "epoch": 0.03672345200448842,
+      "grad_norm": 0.6495940089225769,
+      "learning_rate": 1.9927376580987355e-05,
+      "loss": 0.2015,
+      "step": 90
+    },
+    {
+      "epoch": 0.04080383556054269,
+      "grad_norm": 0.44108060002326965,
+      "learning_rate": 1.991921664626683e-05,
+      "loss": 0.1682,
+      "step": 100
+    },
+    {
+      "epoch": 0.04488421911659696,
+      "grad_norm": 0.8983336687088013,
+      "learning_rate": 1.9911056711546307e-05,
+      "loss": 0.2188,
+      "step": 110
+    },
+    {
+      "epoch": 0.04896460267265123,
+      "grad_norm": 0.804106593132019,
+      "learning_rate": 1.9902896776825786e-05,
+      "loss": 0.1468,
+      "step": 120
+    },
+    {
+      "epoch": 0.0530449862287055,
+      "grad_norm": 1.6065974235534668,
+      "learning_rate": 1.9894736842105265e-05,
+      "loss": 0.2299,
+      "step": 130
+    },
+    {
+      "epoch": 0.057125369784759766,
+      "grad_norm": 2.307762622833252,
+      "learning_rate": 1.9886576907384744e-05,
+      "loss": 0.2235,
+      "step": 140
+    },
+    {
+      "epoch": 0.06120575334081404,
+      "grad_norm": 2.0962867736816406,
+      "learning_rate": 1.987841697266422e-05,
+      "loss": 0.1584,
+      "step": 150
+    },
+    {
+      "epoch": 0.06528613689686831,
+      "grad_norm": 1.4618809223175049,
+      "learning_rate": 1.98702570379437e-05,
+      "loss": 0.1578,
+      "step": 160
+    },
+    {
+      "epoch": 0.06936652045292258,
+      "grad_norm": 1.4815113544464111,
+      "learning_rate": 1.9862097103223178e-05,
+      "loss": 0.1921,
+      "step": 170
+    },
+    {
+      "epoch": 0.07344690400897684,
+      "grad_norm": 1.9700915813446045,
+      "learning_rate": 1.9853937168502654e-05,
+      "loss": 0.2035,
+      "step": 180
+    },
+    {
+      "epoch": 0.07752728756503112,
+      "grad_norm": 1.9198737144470215,
+      "learning_rate": 1.984577723378213e-05,
+      "loss": 0.1231,
+      "step": 190
+    },
+    {
+      "epoch": 0.08160767112108538,
+      "grad_norm": 1.500857949256897,
+      "learning_rate": 1.983761729906161e-05,
+      "loss": 0.2042,
+      "step": 200
+    },
+    {
+      "epoch": 0.08568805467713965,
+      "grad_norm": 2.8167471885681152,
+      "learning_rate": 1.9829457364341088e-05,
+      "loss": 0.2765,
+      "step": 210
+    },
+    {
+      "epoch": 0.08976843823319391,
+      "grad_norm": 1.9542330503463745,
+      "learning_rate": 1.9821297429620564e-05,
+      "loss": 0.1445,
+      "step": 220
+    },
+    {
+      "epoch": 0.0938488217892482,
+      "grad_norm": 1.3737446069717407,
+      "learning_rate": 1.9813137494900043e-05,
+      "loss": 0.0898,
+      "step": 230
+    },
+    {
+      "epoch": 0.09792920534530246,
+      "grad_norm": 0.0,
+      "learning_rate": 1.980497756017952e-05,
+      "loss": 0.1711,
+      "step": 240
+    },
+    {
+      "epoch": 0.10200958890135672,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9796817625458998e-05,
+      "loss": 0.1336,
+      "step": 250
+    },
+    {
+      "epoch": 0.106089972457411,
+      "grad_norm": 1.6241754293441772,
+      "learning_rate": 1.9788657690738477e-05,
+      "loss": 0.223,
+      "step": 260
+    },
+    {
+      "epoch": 0.11017035601346527,
+      "grad_norm": 0.8505207300186157,
+      "learning_rate": 1.9780497756017953e-05,
+      "loss": 0.1421,
+      "step": 270
+    },
+    {
+      "epoch": 0.11425073956951953,
+      "grad_norm": 1.3087742328643799,
+      "learning_rate": 1.977233782129743e-05,
+      "loss": 0.2088,
+      "step": 280
+    },
+    {
+      "epoch": 0.1183311231255738,
+      "grad_norm": 1.7539215087890625,
+      "learning_rate": 1.976417788657691e-05,
+      "loss": 0.2385,
+      "step": 290
+    },
+    {
+      "epoch": 0.12241150668162808,
+      "grad_norm": 1.763391137123108,
+      "learning_rate": 1.9756017951856387e-05,
+      "loss": 0.2221,
+      "step": 300
+    },
+    {
+      "epoch": 0.12649189023768234,
+      "grad_norm": 1.7321321964263916,
+      "learning_rate": 1.9747858017135862e-05,
+      "loss": 0.0796,
+      "step": 310
+    },
+    {
+      "epoch": 0.13057227379373662,
+      "grad_norm": 0.850107729434967,
+      "learning_rate": 1.973969808241534e-05,
+      "loss": 0.116,
+      "step": 320
+    },
+    {
+      "epoch": 0.13465265734979087,
+      "grad_norm": 0.14039260149002075,
+      "learning_rate": 1.973153814769482e-05,
+      "loss": 0.1679,
+      "step": 330
+    },
+    {
+      "epoch": 0.13873304090584515,
+      "grad_norm": 0.44985684752464294,
+      "learning_rate": 1.9723378212974296e-05,
+      "loss": 0.0986,
+      "step": 340
+    },
+    {
+      "epoch": 0.14281342446189943,
+      "grad_norm": 1.0865625143051147,
+      "learning_rate": 1.9715218278253775e-05,
+      "loss": 0.2011,
+      "step": 350
+    },
+    {
+      "epoch": 0.14689380801795368,
+      "grad_norm": 2.6993794441223145,
+      "learning_rate": 1.9707058343533255e-05,
+      "loss": 0.2065,
+      "step": 360
+    },
+    {
+      "epoch": 0.15097419157400796,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9698898408812734e-05,
+      "loss": 0.2226,
+      "step": 370
+    },
+    {
+      "epoch": 0.15505457513006224,
+      "grad_norm": 1.9446637630462646,
+      "learning_rate": 1.969073847409221e-05,
+      "loss": 0.1792,
+      "step": 380
+    },
+    {
+      "epoch": 0.1591349586861165,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9682578539371685e-05,
+      "loss": 0.1589,
+      "step": 390
+    },
+    {
+      "epoch": 0.16321534224217077,
+      "grad_norm": 3.4847676753997803,
+      "learning_rate": 1.9674418604651164e-05,
+      "loss": 0.2231,
+      "step": 400
+    },
+    {
+      "epoch": 0.16729572579822502,
+      "grad_norm": 2.1196365356445312,
+      "learning_rate": 1.9666258669930644e-05,
+      "loss": 0.2105,
+      "step": 410
+    },
+    {
+      "epoch": 0.1713761093542793,
+      "grad_norm": 2.080935478210449,
+      "learning_rate": 1.965809873521012e-05,
+      "loss": 0.1893,
+      "step": 420
+    },
+    {
+      "epoch": 0.17545649291033358,
+      "grad_norm": 0.0,
+      "learning_rate": 1.96499388004896e-05,
+      "loss": 0.1518,
+      "step": 430
+    },
+    {
+      "epoch": 0.17953687646638783,
+      "grad_norm": 1.2432128190994263,
+      "learning_rate": 1.9641778865769074e-05,
+      "loss": 0.1305,
+      "step": 440
+    },
+    {
+      "epoch": 0.1836172600224421,
+      "grad_norm": 1.5672948360443115,
+      "learning_rate": 1.9633618931048553e-05,
+      "loss": 0.1938,
+      "step": 450
+    },
+    {
+      "epoch": 0.1876976435784964,
+      "grad_norm": 0.0,
+      "learning_rate": 1.962545899632803e-05,
+      "loss": 0.1269,
+      "step": 460
+    },
+    {
+      "epoch": 0.19177802713455064,
+      "grad_norm": 0.7028200626373291,
+      "learning_rate": 1.9617299061607508e-05,
+      "loss": 0.1078,
+      "step": 470
+    },
+    {
+      "epoch": 0.19585841069060492,
+      "grad_norm": 0.5177062749862671,
+      "learning_rate": 1.9609139126886987e-05,
+      "loss": 0.1291,
+      "step": 480
+    },
+    {
+      "epoch": 0.1999387942466592,
+      "grad_norm": 1.3655359745025635,
+      "learning_rate": 1.9600979192166466e-05,
+      "loss": 0.2066,
+      "step": 490
+    },
+    {
+      "epoch": 0.20401917780271345,
+      "grad_norm": 1.7469329833984375,
+      "learning_rate": 1.9592819257445942e-05,
+      "loss": 0.1059,
+      "step": 500
+    },
+    {
+      "epoch": 0.20401917780271345,
+      "eval_loss": 0.1709977239370346,
+      "eval_runtime": 5717.6231,
+      "eval_samples_per_second": 0.646,
+      "eval_steps_per_second": 0.323,
+      "step": 500
+    },
+    {
+      "epoch": 0.20809956135876773,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9584659322725418e-05,
+      "loss": 0.1062,
+      "step": 510
+    },
+    {
+      "epoch": 0.212179944914822,
+      "grad_norm": 1.9649909734725952,
+      "learning_rate": 1.9576499388004897e-05,
+      "loss": 0.2056,
+      "step": 520
+    },
+    {
+      "epoch": 0.21626032847087626,
+      "grad_norm": 0.9991185069084167,
+      "learning_rate": 1.9568339453284376e-05,
+      "loss": 0.1863,
+      "step": 530
+    },
+    {
+      "epoch": 0.22034071202693054,
+      "grad_norm": 1.955867052078247,
+      "learning_rate": 1.9560179518563852e-05,
+      "loss": 0.1804,
+      "step": 540
+    },
+    {
+      "epoch": 0.22442109558298481,
+      "grad_norm": 1.5744529962539673,
+      "learning_rate": 1.955201958384333e-05,
+      "loss": 0.1147,
+      "step": 550
+    },
+    {
+      "epoch": 0.22850147913903907,
+      "grad_norm": 0.8454556465148926,
+      "learning_rate": 1.954385964912281e-05,
+      "loss": 0.1995,
+      "step": 560
+    },
+    {
+      "epoch": 0.23258186269509334,
+      "grad_norm": 1.5379407405853271,
+      "learning_rate": 1.9535699714402286e-05,
+      "loss": 0.2056,
+      "step": 570
+    },
+    {
+      "epoch": 0.2366622462511476,
+      "grad_norm": 1.6168546676635742,
+      "learning_rate": 1.9527539779681762e-05,
+      "loss": 0.1991,
+      "step": 580
+    },
+    {
+      "epoch": 0.24074262980720187,
+      "grad_norm": 0.9494819045066833,
+      "learning_rate": 1.951937984496124e-05,
+      "loss": 0.1712,
+      "step": 590
+    },
+    {
+      "epoch": 0.24482301336325615,
+      "grad_norm": 1.0418027639389038,
+      "learning_rate": 1.951121991024072e-05,
+      "loss": 0.1218,
+      "step": 600
+    },
+    {
+      "epoch": 0.2489033969193104,
+      "grad_norm": 1.579040288925171,
+      "learning_rate": 1.95030599755202e-05,
+      "loss": 0.1568,
+      "step": 610
+    },
+    {
+      "epoch": 0.2529837804753647,
+      "grad_norm": 1.290714144706726,
+      "learning_rate": 1.9494900040799675e-05,
+      "loss": 0.2278,
+      "step": 620
+    },
+    {
+      "epoch": 0.25706416403141896,
+      "grad_norm": 0.5078051686286926,
+      "learning_rate": 1.9486740106079154e-05,
+      "loss": 0.167,
+      "step": 630
+    },
+    {
+      "epoch": 0.26114454758747324,
+      "grad_norm": 0.5044044256210327,
+      "learning_rate": 1.947858017135863e-05,
+      "loss": 0.1019,
+      "step": 640
+    },
+    {
+      "epoch": 0.26522493114352746,
+      "grad_norm": 0.38573765754699707,
+      "learning_rate": 1.947042023663811e-05,
+      "loss": 0.1564,
+      "step": 650
+    },
+    {
+      "epoch": 0.26930531469958174,
+      "grad_norm": 1.4927674531936646,
+      "learning_rate": 1.9462260301917585e-05,
+      "loss": 0.2518,
+      "step": 660
+    },
+    {
+      "epoch": 0.273385698255636,
+      "grad_norm": 1.397609829902649,
+      "learning_rate": 1.9454100367197064e-05,
+      "loss": 0.1117,
+      "step": 670
+    },
+    {
+      "epoch": 0.2774660818116903,
+      "grad_norm": 1.1881047487258911,
+      "learning_rate": 1.9445940432476543e-05,
+      "loss": 0.0951,
+      "step": 680
+    },
+    {
+      "epoch": 0.2815464653677446,
+      "grad_norm": 1.1354044675827026,
+      "learning_rate": 1.943778049775602e-05,
+      "loss": 0.0996,
+      "step": 690
+    },
+    {
+      "epoch": 0.28562684892379886,
+      "grad_norm": 0.7019430994987488,
+      "learning_rate": 1.9429620563035498e-05,
+      "loss": 0.1364,
+      "step": 700
+    },
+    {
+      "epoch": 0.2897072324798531,
+      "grad_norm": 1.2556483745574951,
+      "learning_rate": 1.9421460628314974e-05,
+      "loss": 0.1707,
+      "step": 710
+    },
+    {
+      "epoch": 0.29378761603590736,
+      "grad_norm": 0.5789899230003357,
+      "learning_rate": 1.9413300693594453e-05,
+      "loss": 0.0999,
+      "step": 720
+    },
+    {
+      "epoch": 0.29786799959196164,
+      "grad_norm": 1.1356525421142578,
+      "learning_rate": 1.9405140758873932e-05,
+      "loss": 0.1767,
+      "step": 730
+    },
+    {
+      "epoch": 0.3019483831480159,
+      "grad_norm": 1.6264972686767578,
+      "learning_rate": 1.9396980824153408e-05,
+      "loss": 0.1597,
+      "step": 740
+    },
+    {
+      "epoch": 0.3060287667040702,
+      "grad_norm": 3.7391366958618164,
+      "learning_rate": 1.9388820889432887e-05,
+      "loss": 0.2732,
+      "step": 750
+    },
+    {
+      "epoch": 0.3101091502601245,
+      "grad_norm": 1.7408511638641357,
+      "learning_rate": 1.9380660954712366e-05,
+      "loss": 0.1474,
+      "step": 760
+    },
+    {
+      "epoch": 0.3141895338161787,
+      "grad_norm": 1.4884685277938843,
+      "learning_rate": 1.9372501019991842e-05,
+      "loss": 0.1737,
+      "step": 770
+    },
+    {
+      "epoch": 0.318269917372233,
+      "grad_norm": 0.45126017928123474,
+      "learning_rate": 1.9364341085271317e-05,
+      "loss": 0.179,
+      "step": 780
+    },
+    {
+      "epoch": 0.32235030092828726,
+      "grad_norm": 0.9976311326026917,
+      "learning_rate": 1.9356181150550797e-05,
+      "loss": 0.1212,
+      "step": 790
+    },
+    {
+      "epoch": 0.32643068448434154,
+      "grad_norm": 1.568253517150879,
+      "learning_rate": 1.9348021215830276e-05,
+      "loss": 0.1096,
+      "step": 800
+    },
+    {
+      "epoch": 0.3305110680403958,
+      "grad_norm": 0.0,
+      "learning_rate": 1.933986128110975e-05,
+      "loss": 0.1105,
+      "step": 810
+    },
+    {
+      "epoch": 0.33459145159645004,
+      "grad_norm": 3.553971290588379,
+      "learning_rate": 1.933170134638923e-05,
+      "loss": 0.1685,
+      "step": 820
+    },
+    {
+      "epoch": 0.3386718351525043,
+      "grad_norm": 0.37956804037094116,
+      "learning_rate": 1.932354141166871e-05,
+      "loss": 0.0832,
+      "step": 830
+    },
+    {
+      "epoch": 0.3427522187085586,
+      "grad_norm": 0.9464099407196045,
+      "learning_rate": 1.9315381476948186e-05,
+      "loss": 0.1784,
+      "step": 840
+    },
+    {
+      "epoch": 0.3468326022646129,
+      "grad_norm": 2.282661199569702,
+      "learning_rate": 1.9307221542227665e-05,
+      "loss": 0.1712,
+      "step": 850
+    },
+    {
+      "epoch": 0.35091298582066716,
+      "grad_norm": 1.1713647842407227,
+      "learning_rate": 1.929906160750714e-05,
+      "loss": 0.1147,
+      "step": 860
+    },
+    {
+      "epoch": 0.35499336937672143,
+      "grad_norm": 1.503341555595398,
+      "learning_rate": 1.929090167278662e-05,
+      "loss": 0.1052,
+      "step": 870
+    },
+    {
+      "epoch": 0.35907375293277566,
+      "grad_norm": 0.6751163005828857,
+      "learning_rate": 1.92827417380661e-05,
+      "loss": 0.2266,
+      "step": 880
+    },
+    {
+      "epoch": 0.36315413648882994,
+      "grad_norm": 0.8894637823104858,
+      "learning_rate": 1.9274581803345574e-05,
+      "loss": 0.1487,
+      "step": 890
+    },
+    {
+      "epoch": 0.3672345200448842,
+      "grad_norm": 1.1354976892471313,
+      "learning_rate": 1.926642186862505e-05,
+      "loss": 0.2175,
+      "step": 900
+    },
+    {
+      "epoch": 0.3713149036009385,
+      "grad_norm": 1.0130964517593384,
+      "learning_rate": 1.925826193390453e-05,
+      "loss": 0.2418,
+      "step": 910
+    },
+    {
+      "epoch": 0.3753952871569928,
+      "grad_norm": 0.5899848341941833,
+      "learning_rate": 1.925010199918401e-05,
+      "loss": 0.1668,
+      "step": 920
+    },
+    {
+      "epoch": 0.37947567071304705,
+      "grad_norm": 0.2949761748313904,
+      "learning_rate": 1.9241942064463484e-05,
+      "loss": 0.1446,
+      "step": 930
+    },
+    {
+      "epoch": 0.3835560542691013,
+      "grad_norm": 1.4031494855880737,
+      "learning_rate": 1.9233782129742963e-05,
+      "loss": 0.0917,
+      "step": 940
+    },
+    {
+      "epoch": 0.38763643782515556,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9225622195022442e-05,
+      "loss": 0.0876,
+      "step": 950
+    },
+    {
+      "epoch": 0.39171682138120983,
+      "grad_norm": 1.1788252592086792,
+      "learning_rate": 1.921746226030192e-05,
+      "loss": 0.1019,
+      "step": 960
+    },
+    {
+      "epoch": 0.3957972049372641,
+      "grad_norm": 1.3211113214492798,
+      "learning_rate": 1.9209302325581397e-05,
+      "loss": 0.1199,
+      "step": 970
+    },
+    {
+      "epoch": 0.3998775884933184,
+      "grad_norm": 2.4007322788238525,
+      "learning_rate": 1.9201142390860873e-05,
+      "loss": 0.2123,
+      "step": 980
+    },
+    {
+      "epoch": 0.4039579720493726,
+      "grad_norm": 0.7072489857673645,
+      "learning_rate": 1.9192982456140352e-05,
+      "loss": 0.2355,
+      "step": 990
+    },
+    {
+      "epoch": 0.4080383556054269,
+      "grad_norm": 0.9034311771392822,
+      "learning_rate": 1.918482252141983e-05,
+      "loss": 0.2086,
+      "step": 1000
+    },
+    {
+      "epoch": 0.4080383556054269,
+      "eval_loss": 0.16534732282161713,
+      "eval_runtime": 5717.6716,
+      "eval_samples_per_second": 0.646,
+      "eval_steps_per_second": 0.323,
+      "step": 1000
+    },
+    {
+      "epoch": 0.4121187391614812,
+      "grad_norm": 0.7085575461387634,
+      "learning_rate": 1.9176662586699307e-05,
+      "loss": 0.136,
+      "step": 1010
+    },
+    {
+      "epoch": 0.41619912271753545,
+      "grad_norm": 0.7955109477043152,
+      "learning_rate": 1.9168502651978786e-05,
+      "loss": 0.0829,
+      "step": 1020
+    },
+    {
+      "epoch": 0.42027950627358973,
+      "grad_norm": 0.4781329929828644,
+      "learning_rate": 1.9160342717258265e-05,
+      "loss": 0.2112,
+      "step": 1030
+    },
+    {
+      "epoch": 0.424359889829644,
+      "grad_norm": 0.0,
+      "learning_rate": 1.915218278253774e-05,
+      "loss": 0.1665,
+      "step": 1040
+    },
+    {
+      "epoch": 0.42844027338569823,
+      "grad_norm": 0.8854587078094482,
+      "learning_rate": 1.9144022847817217e-05,
+      "loss": 0.0929,
+      "step": 1050
+    },
+    {
+      "epoch": 0.4325206569417525,
+      "grad_norm": 0.7086293697357178,
+      "learning_rate": 1.9135862913096696e-05,
+      "loss": 0.1342,
+      "step": 1060
+    },
+    {
+      "epoch": 0.4366010404978068,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9127702978376175e-05,
+      "loss": 0.1739,
+      "step": 1070
+    },
+    {
+      "epoch": 0.44068142405386107,
+      "grad_norm": 1.0532270669937134,
+      "learning_rate": 1.9119543043655654e-05,
+      "loss": 0.1093,
+      "step": 1080
+    },
+    {
+      "epoch": 0.44476180760991535,
+      "grad_norm": 0.0,
+      "learning_rate": 1.911138310893513e-05,
+      "loss": 0.1425,
+      "step": 1090
+    },
+    {
+      "epoch": 0.44884219116596963,
+      "grad_norm": 0.38894256949424744,
+      "learning_rate": 1.9103223174214606e-05,
+      "loss": 0.0964,
+      "step": 1100
+    },
+    {
+      "epoch": 0.45292257472202385,
+      "grad_norm": 0.0,
+      "learning_rate": 1.9095063239494085e-05,
+      "loss": 0.0769,
+      "step": 1110
+    },
+    {
+      "epoch": 0.45700295827807813,
+      "grad_norm": 2.1243956089019775,
+      "learning_rate": 1.9086903304773564e-05,
+      "loss": 0.1549,
+      "step": 1120
+    },
+    {
+      "epoch": 0.4610833418341324,
+      "grad_norm": 0.37033727765083313,
+      "learning_rate": 1.907874337005304e-05,
+      "loss": 0.1836,
+      "step": 1130
+    },
+    {
+      "epoch": 0.4651637253901867,
+      "grad_norm": 0.05899756774306297,
+      "learning_rate": 1.907058343533252e-05,
+      "loss": 0.1451,
+      "step": 1140
+    },
+    {
+      "epoch": 0.46924410894624097,
+      "grad_norm": 1.3318108320236206,
+      "learning_rate": 1.9062423500611998e-05,
+      "loss": 0.1424,
+      "step": 1150
+    },
+    {
+      "epoch": 0.4733244925022952,
+      "grad_norm": 0.5499687790870667,
+      "learning_rate": 1.9054263565891474e-05,
+      "loss": 0.1343,
+      "step": 1160
+    },
+    {
+      "epoch": 0.47740487605834947,
+      "grad_norm": 0.7460530996322632,
+      "learning_rate": 1.904610363117095e-05,
+      "loss": 0.1318,
+      "step": 1170
+    },
+    {
+      "epoch": 0.48148525961440375,
+      "grad_norm": 0.18631018698215485,
+      "learning_rate": 1.903794369645043e-05,
+      "loss": 0.1335,
+      "step": 1180
+    },
+    {
+      "epoch": 0.48556564317045803,
+      "grad_norm": 1.2286078929901123,
+      "learning_rate": 1.9029783761729908e-05,
+      "loss": 0.1599,
+      "step": 1190
+    },
+    {
+      "epoch": 0.4896460267265123,
+      "grad_norm": 0.456785649061203,
+      "learning_rate": 1.9021623827009387e-05,
+      "loss": 0.0534,
+      "step": 1200
+    },
+    {
+      "epoch": 0.4937264102825666,
+      "grad_norm": 1.5648113489151,
+      "learning_rate": 1.9013463892288863e-05,
+      "loss": 0.1173,
+      "step": 1210
+    },
+    {
+      "epoch": 0.4978067938386208,
+      "grad_norm": 1.4287240505218506,
+      "learning_rate": 1.9005303957568342e-05,
+      "loss": 0.1784,
+      "step": 1220
+    },
+    {
+      "epoch": 0.5018871773946751,
+      "grad_norm": 0.4237353205680847,
+      "learning_rate": 1.8997144022847818e-05,
+      "loss": 0.0898,
+      "step": 1230
+    },
+    {
+      "epoch": 0.5059675609507294,
+      "grad_norm": 1.7313411235809326,
+      "learning_rate": 1.8988984088127297e-05,
+      "loss": 0.1186,
+      "step": 1240
+    },
+    {
+      "epoch": 0.5100479445067836,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8980824153406773e-05,
+      "loss": 0.2331,
+      "step": 1250
+    },
+    {
+      "epoch": 0.5141283280628379,
+      "grad_norm": 1.2604000568389893,
+      "learning_rate": 1.8972664218686252e-05,
+      "loss": 0.1924,
+      "step": 1260
+    },
+    {
+      "epoch": 0.5182087116188921,
+      "grad_norm": 0.24087439477443695,
+      "learning_rate": 1.896450428396573e-05,
+      "loss": 0.0633,
+      "step": 1270
+    },
+    {
+      "epoch": 0.5222890951749465,
+      "grad_norm": 0.14195257425308228,
+      "learning_rate": 1.8956344349245207e-05,
+      "loss": 0.0914,
+      "step": 1280
+    },
+    {
+      "epoch": 0.5263694787310007,
+      "grad_norm": 1.0728325843811035,
+      "learning_rate": 1.8948184414524686e-05,
+      "loss": 0.1142,
+      "step": 1290
+    },
+    {
+      "epoch": 0.5304498622870549,
+      "grad_norm": 2.187006711959839,
+      "learning_rate": 1.894002447980416e-05,
+      "loss": 0.162,
+      "step": 1300
+    },
+    {
+      "epoch": 0.5345302458431093,
+      "grad_norm": 2.7703022956848145,
+      "learning_rate": 1.893186454508364e-05,
+      "loss": 0.2188,
+      "step": 1310
+    },
+    {
+      "epoch": 0.5386106293991635,
+      "grad_norm": 1.2225255966186523,
+      "learning_rate": 1.892370461036312e-05,
+      "loss": 0.1318,
+      "step": 1320
+    },
+    {
+      "epoch": 0.5426910129552178,
+      "grad_norm": 1.5350176095962524,
+      "learning_rate": 1.8915544675642596e-05,
+      "loss": 0.1786,
+      "step": 1330
+    },
+    {
+      "epoch": 0.546771396511272,
+      "grad_norm": 1.9391083717346191,
+      "learning_rate": 1.8907384740922075e-05,
+      "loss": 0.1432,
+      "step": 1340
+    },
+    {
+      "epoch": 0.5508517800673264,
+      "grad_norm": 0.9705495834350586,
+      "learning_rate": 1.8899224806201554e-05,
+      "loss": 0.1274,
+      "step": 1350
+    },
+    {
+      "epoch": 0.5549321636233806,
+      "grad_norm": 0.8046002984046936,
+      "learning_rate": 1.889106487148103e-05,
+      "loss": 0.1206,
+      "step": 1360
+    },
+    {
+      "epoch": 0.5590125471794348,
+      "grad_norm": 1.8676600456237793,
+      "learning_rate": 1.8882904936760505e-05,
+      "loss": 0.2011,
+      "step": 1370
+    },
+    {
+      "epoch": 0.5630929307354892,
+      "grad_norm": 2.2367093563079834,
+      "learning_rate": 1.8874745002039984e-05,
+      "loss": 0.1716,
+      "step": 1380
+    },
+    {
+      "epoch": 0.5671733142915434,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8866585067319464e-05,
+      "loss": 0.155,
+      "step": 1390
+    },
+    {
+      "epoch": 0.5712536978475977,
+      "grad_norm": 0.0836629867553711,
+      "learning_rate": 1.885842513259894e-05,
+      "loss": 0.0718,
+      "step": 1400
+    },
+    {
+      "epoch": 0.5753340814036519,
+      "grad_norm": 2.3889925479888916,
+      "learning_rate": 1.885026519787842e-05,
+      "loss": 0.2362,
+      "step": 1410
+    },
+    {
+      "epoch": 0.5794144649597062,
+      "grad_norm": 2.2786920070648193,
+      "learning_rate": 1.8842105263157898e-05,
+      "loss": 0.1976,
+      "step": 1420
+    },
+    {
+      "epoch": 0.5834948485157605,
+      "grad_norm": 1.596222996711731,
+      "learning_rate": 1.8833945328437373e-05,
+      "loss": 0.1621,
+      "step": 1430
+    },
+    {
+      "epoch": 0.5875752320718147,
+      "grad_norm": 1.5035650730133057,
+      "learning_rate": 1.8825785393716853e-05,
+      "loss": 0.1539,
+      "step": 1440
+    },
+    {
+      "epoch": 0.5916556156278691,
+      "grad_norm": 1.4773615598678589,
+      "learning_rate": 1.8817625458996328e-05,
+      "loss": 0.1193,
+      "step": 1450
+    },
+    {
+      "epoch": 0.5957359991839233,
+      "grad_norm": 3.1149489879608154,
+      "learning_rate": 1.8809465524275807e-05,
+      "loss": 0.163,
+      "step": 1460
+    },
+    {
+      "epoch": 0.5998163827399775,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8801305589555287e-05,
+      "loss": 0.1259,
+      "step": 1470
+    },
+    {
+      "epoch": 0.6038967662960318,
+      "grad_norm": 1.1338975429534912,
+      "learning_rate": 1.8793145654834762e-05,
+      "loss": 0.1926,
+      "step": 1480
+    },
+    {
+      "epoch": 0.6079771498520861,
+      "grad_norm": 2.367356300354004,
+      "learning_rate": 1.878498572011424e-05,
+      "loss": 0.087,
+      "step": 1490
+    },
+    {
+      "epoch": 0.6120575334081404,
+      "grad_norm": 0.7989845275878906,
+      "learning_rate": 1.8776825785393717e-05,
+      "loss": 0.164,
+      "step": 1500
+    },
+    {
+      "epoch": 0.6120575334081404,
+      "eval_loss": 0.1599983274936676,
+      "eval_runtime": 5718.9834,
+      "eval_samples_per_second": 0.646,
+      "eval_steps_per_second": 0.323,
+      "step": 1500
+    },
+    {
+      "epoch": 0.6161379169641946,
+      "grad_norm": 0.7103864550590515,
+      "learning_rate": 1.8768665850673196e-05,
+      "loss": 0.1737,
+      "step": 1510
+    },
+    {
+      "epoch": 0.620218300520249,
+      "grad_norm": 1.410314679145813,
+      "learning_rate": 1.8760505915952672e-05,
+      "loss": 0.1538,
+      "step": 1520
+    },
+    {
+      "epoch": 0.6242986840763032,
+      "grad_norm": 2.189002275466919,
+      "learning_rate": 1.875234598123215e-05,
+      "loss": 0.1662,
+      "step": 1530
+    },
+    {
+      "epoch": 0.6283790676323574,
+      "grad_norm": 2.2174313068389893,
+      "learning_rate": 1.874418604651163e-05,
+      "loss": 0.1765,
+      "step": 1540
+    },
+    {
+      "epoch": 0.6324594511884117,
+      "grad_norm": 0.0,
+      "learning_rate": 1.873602611179111e-05,
+      "loss": 0.1565,
+      "step": 1550
+    },
+    {
+      "epoch": 0.636539834744466,
+      "grad_norm": 2.8284146785736084,
+      "learning_rate": 1.8727866177070585e-05,
+      "loss": 0.2672,
+      "step": 1560
+    },
+    {
+      "epoch": 0.6406202183005203,
+      "grad_norm": 0.0,
+      "learning_rate": 1.871970624235006e-05,
+      "loss": 0.0889,
+      "step": 1570
+    },
+    {
+      "epoch": 0.6447006018565745,
+      "grad_norm": 1.36979341506958,
+      "learning_rate": 1.871154630762954e-05,
+      "loss": 0.0965,
+      "step": 1580
+    },
+    {
+      "epoch": 0.6487809854126287,
+      "grad_norm": 2.8368990421295166,
+      "learning_rate": 1.870338637290902e-05,
+      "loss": 0.1882,
+      "step": 1590
+    },
+    {
+      "epoch": 0.6528613689686831,
+      "grad_norm": 1.439751148223877,
+      "learning_rate": 1.8695226438188495e-05,
+      "loss": 0.1345,
+      "step": 1600
+    },
+    {
+      "epoch": 0.6569417525247373,
+      "grad_norm": 1.7981996536254883,
+      "learning_rate": 1.8687066503467974e-05,
+      "loss": 0.2021,
+      "step": 1610
+    },
+    {
+      "epoch": 0.6610221360807916,
+      "grad_norm": 2.474820375442505,
+      "learning_rate": 1.8678906568747453e-05,
+      "loss": 0.2709,
+      "step": 1620
+    },
+    {
+      "epoch": 0.6651025196368459,
+      "grad_norm": 0.5656393766403198,
+      "learning_rate": 1.867074663402693e-05,
+      "loss": 0.1114,
+      "step": 1630
+    },
+    {
+      "epoch": 0.6691829031929001,
+      "grad_norm": 2.1718122959136963,
+      "learning_rate": 1.8662586699306405e-05,
+      "loss": 0.1922,
+      "step": 1640
+    },
+    {
+      "epoch": 0.6732632867489544,
+      "grad_norm": 1.9503840208053589,
+      "learning_rate": 1.8654426764585884e-05,
+      "loss": 0.1791,
+      "step": 1650
+    },
+    {
+      "epoch": 0.6773436703050086,
+      "grad_norm": 0.5494780540466309,
+      "learning_rate": 1.8646266829865363e-05,
+      "loss": 0.1891,
+      "step": 1660
+    },
+    {
+      "epoch": 0.681424053861063,
+      "grad_norm": 2.120107650756836,
+      "learning_rate": 1.8638106895144842e-05,
+      "loss": 0.1791,
+      "step": 1670
+    },
+    {
+      "epoch": 0.6855044374171172,
+      "grad_norm": 1.1772249937057495,
+      "learning_rate": 1.8629946960424318e-05,
+      "loss": 0.1071,
+      "step": 1680
+    },
+    {
+      "epoch": 0.6895848209731715,
+      "grad_norm": 0.5621774792671204,
+      "learning_rate": 1.8621787025703797e-05,
+      "loss": 0.1136,
+      "step": 1690
+    },
+    {
+      "epoch": 0.6936652045292258,
+      "grad_norm": 1.0332114696502686,
+      "learning_rate": 1.8613627090983273e-05,
+      "loss": 0.2104,
+      "step": 1700
+    },
+    {
+      "epoch": 0.69774558808528,
+      "grad_norm": 0.27499428391456604,
+      "learning_rate": 1.8605467156262752e-05,
+      "loss": 0.169,
+      "step": 1710
+    },
+    {
+      "epoch": 0.7018259716413343,
+      "grad_norm": 2.5944690704345703,
+      "learning_rate": 1.8597307221542228e-05,
+      "loss": 0.1507,
+      "step": 1720
+    },
+    {
+      "epoch": 0.7059063551973885,
+      "grad_norm": 1.3817540407180786,
+      "learning_rate": 1.8589147286821707e-05,
+      "loss": 0.1391,
+      "step": 1730
+    },
+    {
+      "epoch": 0.7099867387534429,
+      "grad_norm": 1.1489579677581787,
+      "learning_rate": 1.8580987352101186e-05,
+      "loss": 0.0891,
+      "step": 1740
+    },
+    {
+      "epoch": 0.7140671223094971,
+      "grad_norm": 1.933108925819397,
+      "learning_rate": 1.8572827417380662e-05,
+      "loss": 0.2514,
+      "step": 1750
+    },
+    {
+      "epoch": 0.7181475058655513,
+      "grad_norm": 1.160273551940918,
+      "learning_rate": 1.8564667482660138e-05,
+      "loss": 0.1934,
+      "step": 1760
+    },
+    {
+      "epoch": 0.7222278894216057,
+      "grad_norm": 0.7756880521774292,
+      "learning_rate": 1.8556507547939617e-05,
+      "loss": 0.1565,
+      "step": 1770
+    },
+    {
+      "epoch": 0.7263082729776599,
+      "grad_norm": 0.7927201390266418,
+      "learning_rate": 1.8548347613219096e-05,
+      "loss": 0.1851,
+      "step": 1780
+    },
+    {
+      "epoch": 0.7303886565337142,
+      "grad_norm": 1.8288319110870361,
+      "learning_rate": 1.8540187678498575e-05,
+      "loss": 0.1792,
+      "step": 1790
+    },
+    {
+      "epoch": 0.7344690400897684,
+      "grad_norm": 0.6128740310668945,
+      "learning_rate": 1.853202774377805e-05,
+      "loss": 0.1863,
+      "step": 1800
+    },
+    {
+      "epoch": 0.7385494236458227,
+      "grad_norm": 0.41025951504707336,
+      "learning_rate": 1.852386780905753e-05,
+      "loss": 0.1634,
+      "step": 1810
+    },
+    {
+      "epoch": 0.742629807201877,
+      "grad_norm": 0.1404421627521515,
+      "learning_rate": 1.851570787433701e-05,
+      "loss": 0.1592,
+      "step": 1820
+    },
+    {
+      "epoch": 0.7467101907579312,
+      "grad_norm": 0.4112943410873413,
+      "learning_rate": 1.8507547939616485e-05,
+      "loss": 0.1009,
+      "step": 1830
+    },
+    {
+      "epoch": 0.7507905743139855,
+      "grad_norm": 1.8249828815460205,
+      "learning_rate": 1.849938800489596e-05,
+      "loss": 0.142,
+      "step": 1840
+    },
+    {
+      "epoch": 0.7548709578700398,
+      "grad_norm": 1.3493293523788452,
+      "learning_rate": 1.849122807017544e-05,
+      "loss": 0.1564,
+      "step": 1850
+    },
+    {
+      "epoch": 0.7589513414260941,
+      "grad_norm": 1.402621865272522,
+      "learning_rate": 1.848306813545492e-05,
+      "loss": 0.1766,
+      "step": 1860
+    },
+    {
+      "epoch": 0.7630317249821483,
+      "grad_norm": 0.939300000667572,
+      "learning_rate": 1.8474908200734395e-05,
+      "loss": 0.1089,
+      "step": 1870
+    },
+    {
+      "epoch": 0.7671121085382026,
+      "grad_norm": 1.1979037523269653,
+      "learning_rate": 1.8466748266013874e-05,
+      "loss": 0.1941,
+      "step": 1880
+    },
+    {
+      "epoch": 0.7711924920942569,
+      "grad_norm": 0.6373042464256287,
+      "learning_rate": 1.8458588331293353e-05,
+      "loss": 0.1585,
+      "step": 1890
+    },
+    {
+      "epoch": 0.7752728756503111,
+      "grad_norm": 0.9635875225067139,
+      "learning_rate": 1.845042839657283e-05,
+      "loss": 0.1157,
+      "step": 1900
+    },
+    {
+      "epoch": 0.7793532592063654,
+      "grad_norm": 1.4881830215454102,
+      "learning_rate": 1.8442268461852308e-05,
+      "loss": 0.1192,
+      "step": 1910
+    },
+    {
+      "epoch": 0.7834336427624197,
+      "grad_norm": 0.6800144910812378,
+      "learning_rate": 1.8434108527131783e-05,
+      "loss": 0.0905,
+      "step": 1920
+    },
+    {
+      "epoch": 0.7875140263184739,
+      "grad_norm": 0.3224925398826599,
+      "learning_rate": 1.8425948592411263e-05,
+      "loss": 0.1117,
+      "step": 1930
+    },
+    {
+      "epoch": 0.7915944098745282,
+      "grad_norm": 2.013815402984619,
+      "learning_rate": 1.8417788657690742e-05,
+      "loss": 0.1974,
+      "step": 1940
+    },
+    {
+      "epoch": 0.7956747934305824,
+      "grad_norm": 2.8408901691436768,
+      "learning_rate": 1.8409628722970217e-05,
+      "loss": 0.1484,
+      "step": 1950
+    },
+    {
+      "epoch": 0.7997551769866368,
+      "grad_norm": 1.5153237581253052,
+      "learning_rate": 1.8401468788249693e-05,
+      "loss": 0.2739,
+      "step": 1960
+    },
+    {
+      "epoch": 0.803835560542691,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8393308853529172e-05,
+      "loss": 0.093,
+      "step": 1970
+    },
+    {
+      "epoch": 0.8079159440987452,
+      "grad_norm": 3.2609169483184814,
+      "learning_rate": 1.838514891880865e-05,
+      "loss": 0.1349,
+      "step": 1980
+    },
+    {
+      "epoch": 0.8119963276547996,
+      "grad_norm": 1.1488144397735596,
+      "learning_rate": 1.8376988984088127e-05,
+      "loss": 0.1121,
+      "step": 1990
+    },
+    {
+      "epoch": 0.8160767112108538,
+      "grad_norm": 0.609579861164093,
+      "learning_rate": 1.8368829049367606e-05,
+      "loss": 0.1625,
+      "step": 2000
+    },
+    {
+      "epoch": 0.8160767112108538,
+      "eval_loss": 0.16013823449611664,
+      "eval_runtime": 5718.8002,
+      "eval_samples_per_second": 0.646,
+      "eval_steps_per_second": 0.323,
+      "step": 2000
+    },
+    {
+      "epoch": 0.8201570947669081,
+      "grad_norm": 0.7778828144073486,
+      "learning_rate": 1.8360669114647086e-05,
+      "loss": 0.1858,
+      "step": 2010
+    },
+    {
+      "epoch": 0.8242374783229623,
+      "grad_norm": 1.5061280727386475,
+      "learning_rate": 1.8352509179926565e-05,
+      "loss": 0.1961,
+      "step": 2020
+    },
+    {
+      "epoch": 0.8283178618790167,
+      "grad_norm": 1.2525917291641235,
+      "learning_rate": 1.834434924520604e-05,
+      "loss": 0.2311,
+      "step": 2030
+    },
+    {
+      "epoch": 0.8323982454350709,
+      "grad_norm": 0.5317162871360779,
+      "learning_rate": 1.8336189310485516e-05,
+      "loss": 0.0839,
+      "step": 2040
+    },
+    {
+      "epoch": 0.8364786289911251,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8328029375764995e-05,
+      "loss": 0.2016,
+      "step": 2050
+    },
+    {
+      "epoch": 0.8405590125471795,
+      "grad_norm": 0.5220463275909424,
+      "learning_rate": 1.8319869441044474e-05,
+      "loss": 0.1013,
+      "step": 2060
+    },
+    {
+      "epoch": 0.8446393961032337,
+      "grad_norm": 0.755527913570404,
+      "learning_rate": 1.831170950632395e-05,
+      "loss": 0.2339,
+      "step": 2070
+    },
+    {
+      "epoch": 0.848719779659288,
+      "grad_norm": 1.3820732831954956,
+      "learning_rate": 1.830354957160343e-05,
+      "loss": 0.1481,
+      "step": 2080
+    },
+    {
+      "epoch": 0.8528001632153422,
+      "grad_norm": 1.9282928705215454,
+      "learning_rate": 1.8295389636882905e-05,
+      "loss": 0.1154,
+      "step": 2090
+    },
+    {
+      "epoch": 0.8568805467713965,
+      "grad_norm": 1.7541260719299316,
+      "learning_rate": 1.8287229702162384e-05,
+      "loss": 0.1828,
+      "step": 2100
+    },
+    {
+      "epoch": 0.8609609303274508,
+      "grad_norm": 1.4934720993041992,
+      "learning_rate": 1.827906976744186e-05,
+      "loss": 0.1316,
+      "step": 2110
+    },
+    {
+      "epoch": 0.865041313883505,
+      "grad_norm": 0.8395527601242065,
+      "learning_rate": 1.827090983272134e-05,
+      "loss": 0.1496,
+      "step": 2120
+    },
+    {
+      "epoch": 0.8691216974395594,
+      "grad_norm": 3.2863991260528564,
+      "learning_rate": 1.8262749898000818e-05,
+      "loss": 0.1944,
+      "step": 2130
+    },
+    {
+      "epoch": 0.8732020809956136,
+      "grad_norm": 0.6508553624153137,
+      "learning_rate": 1.8254589963280297e-05,
+      "loss": 0.1392,
+      "step": 2140
+    },
+    {
+      "epoch": 0.8772824645516678,
+      "grad_norm": 0.0,
+      "learning_rate": 1.8246430028559773e-05,
+      "loss": 0.1707,
+      "step": 2150
+    },
+    {
+      "epoch": 0.8813628481077221,
+      "grad_norm": 1.3363920450210571,
+      "learning_rate": 1.823827009383925e-05,
+      "loss": 0.1975,
+      "step": 2160
+    },
+    {
+      "epoch": 0.8854432316637764,
+      "grad_norm": 0.38948386907577515,
+      "learning_rate": 1.8230110159118728e-05,
+      "loss": 0.1374,
+      "step": 2170
+    },
+    {
+      "epoch": 0.8895236152198307,
+      "grad_norm": 1.5190832614898682,
+      "learning_rate": 1.8221950224398207e-05,
+      "loss": 0.153,
+      "step": 2180
+    },
+    {
+      "epoch": 0.8936039987758849,
+      "grad_norm": 1.8394863605499268,
+      "learning_rate": 1.8213790289677683e-05,
+      "loss": 0.207,
+      "step": 2190
+    },
+    {
+      "epoch": 0.8976843823319393,
+      "grad_norm": 1.5787442922592163,
+      "learning_rate": 1.8205630354957162e-05,
+      "loss": 0.1856,
+      "step": 2200
+    },
+    {
+      "epoch": 0.9017647658879935,
+      "grad_norm": 0.7729844450950623,
+      "learning_rate": 1.819747042023664e-05,
+      "loss": 0.1998,
+      "step": 2210
+    },
+    {
+      "epoch": 0.9058451494440477,
+      "grad_norm": 1.661285400390625,
+      "learning_rate": 1.8189310485516117e-05,
+      "loss": 0.1606,
+      "step": 2220
+    },
+    {
+      "epoch": 0.909925533000102,
+      "grad_norm": 3.445115804672241,
+      "learning_rate": 1.8181150550795593e-05,
+      "loss": 0.1219,
+      "step": 2230
+    },
+    {
+      "epoch": 0.9140059165561563,
+      "grad_norm": 1.5525267124176025,
+      "learning_rate": 1.8172990616075072e-05,
+      "loss": 0.1664,
+      "step": 2240
+    },
+    {
+      "epoch": 0.9180863001122106,
+      "grad_norm": 0.8300206661224365,
+      "learning_rate": 1.816483068135455e-05,
+      "loss": 0.1576,
+      "step": 2250
+    },
+    {
+      "epoch": 0.9221666836682648,
+      "grad_norm": 1.2452751398086548,
+      "learning_rate": 1.815667074663403e-05,
+      "loss": 0.1384,
+      "step": 2260
+    },
+    {
+      "epoch": 0.926247067224319,
+      "grad_norm": 1.6318614482879639,
+      "learning_rate": 1.8148510811913506e-05,
+      "loss": 0.1932,
+      "step": 2270
+    },
+    {
+      "epoch": 0.9303274507803734,
+      "grad_norm": 1.384819507598877,
+      "learning_rate": 1.8140350877192985e-05,
+      "loss": 0.1398,
+      "step": 2280
+    },
+    {
+      "epoch": 0.9344078343364276,
+      "grad_norm": 0.0,
+      "learning_rate": 1.813219094247246e-05,
+      "loss": 0.1721,
+      "step": 2290
+    },
+    {
+      "epoch": 0.9384882178924819,
+      "grad_norm": 0.0,
+      "learning_rate": 1.812403100775194e-05,
+      "loss": 0.1654,
+      "step": 2300
+    },
+    {
+      "epoch": 0.9425686014485362,
+      "grad_norm": 0.7114019393920898,
+      "learning_rate": 1.8115871073031416e-05,
+      "loss": 0.1158,
+      "step": 2310
+    },
+    {
+      "epoch": 0.9466489850045904,
+      "grad_norm": 2.6222026348114014,
+      "learning_rate": 1.8107711138310895e-05,
+      "loss": 0.1539,
+      "step": 2320
+    },
+    {
+      "epoch": 0.9507293685606447,
+      "grad_norm": 1.1143105030059814,
+      "learning_rate": 1.8099551203590374e-05,
+      "loss": 0.1768,
+      "step": 2330
+    },
+    {
+      "epoch": 0.9548097521166989,
+      "grad_norm": 0.3291604518890381,
+      "learning_rate": 1.809139126886985e-05,
+      "loss": 0.0838,
+      "step": 2340
+    },
+    {
+      "epoch": 0.9588901356727533,
+      "grad_norm": 0.5898478031158447,
+      "learning_rate": 1.808323133414933e-05,
+      "loss": 0.1646,
+      "step": 2350
+    },
+    {
+      "epoch": 0.9629705192288075,
+      "grad_norm": 1.2440063953399658,
+      "learning_rate": 1.8075071399428805e-05,
+      "loss": 0.1116,
+      "step": 2360
+    },
+    {
+      "epoch": 0.9670509027848618,
+      "grad_norm": 1.5655691623687744,
+      "learning_rate": 1.8066911464708284e-05,
+      "loss": 0.1588,
+      "step": 2370
+    },
+    {
+      "epoch": 0.9711312863409161,
+      "grad_norm": 0.8769559264183044,
+      "learning_rate": 1.8058751529987763e-05,
+      "loss": 0.1125,
+      "step": 2380
+    },
+    {
+      "epoch": 0.9752116698969703,
+      "grad_norm": 0.835755467414856,
+      "learning_rate": 1.805059159526724e-05,
+      "loss": 0.1997,
+      "step": 2390
+    },
+    {
+      "epoch": 0.9792920534530246,
+      "grad_norm": 2.1245999336242676,
+      "learning_rate": 1.8042431660546718e-05,
+      "loss": 0.1586,
+      "step": 2400
+    },
+    {
+      "epoch": 0.9833724370090788,
+      "grad_norm": 0.8960317373275757,
+      "learning_rate": 1.8034271725826197e-05,
+      "loss": 0.1208,
+      "step": 2410
+    },
+    {
+      "epoch": 0.9874528205651332,
+      "grad_norm": 0.6613627076148987,
+      "learning_rate": 1.8026111791105673e-05,
+      "loss": 0.2237,
+      "step": 2420
+    },
+    {
+      "epoch": 0.9915332041211874,
+      "grad_norm": 2.5867104530334473,
+      "learning_rate": 1.801795185638515e-05,
+      "loss": 0.1689,
+      "step": 2430
+    },
+    {
+      "epoch": 0.9956135876772416,
+      "grad_norm": 1.074992060661316,
+      "learning_rate": 1.8009791921664628e-05,
+      "loss": 0.1577,
+      "step": 2440
+    },
+    {
+      "epoch": 0.999693971233296,
+      "grad_norm": 1.3898792266845703,
+      "learning_rate": 1.8001631986944107e-05,
+      "loss": 0.169,
+      "step": 2450
+    },
+    {
+      "epoch": 1.0036723452004488,
+      "grad_norm": 1.4916174411773682,
+      "learning_rate": 1.7993472052223582e-05,
+      "loss": 0.1197,
+      "step": 2460
+    },
+    {
+      "epoch": 1.0077527287565031,
+      "grad_norm": 0.7944161295890808,
+      "learning_rate": 1.798531211750306e-05,
+      "loss": 0.1644,
+      "step": 2470
+    },
+    {
+      "epoch": 1.0118331123125575,
+      "grad_norm": 1.5463371276855469,
+      "learning_rate": 1.797715218278254e-05,
+      "loss": 0.2344,
+      "step": 2480
+    },
+    {
+      "epoch": 1.0159134958686116,
+      "grad_norm": 2.5019683837890625,
+      "learning_rate": 1.7968992248062016e-05,
+      "loss": 0.1641,
+      "step": 2490
+    },
+    {
+      "epoch": 1.019993879424666,
+      "grad_norm": 0.33804982900619507,
+      "learning_rate": 1.7960832313341496e-05,
+      "loss": 0.105,
+      "step": 2500
+    },
+    {
+      "epoch": 1.019993879424666,
+      "eval_loss": 0.15810872614383698,
+      "eval_runtime": 5718.7353,
+      "eval_samples_per_second": 0.646,
+      "eval_steps_per_second": 0.323,
+      "step": 2500
+    },
+    {
+      "epoch": 1.0240742629807202,
+      "grad_norm": 0.44688037037849426,
+      "learning_rate": 1.795267237862097e-05,
+      "loss": 0.0821,
+      "step": 2510
+    },
+    {
+      "epoch": 1.0281546465367744,
+      "grad_norm": 1.251035451889038,
+      "learning_rate": 1.794451244390045e-05,
+      "loss": 0.1784,
+      "step": 2520
+    },
+    {
+      "epoch": 1.0322350300928287,
+      "grad_norm": 2.9573662281036377,
+      "learning_rate": 1.793635250917993e-05,
+      "loss": 0.2249,
+      "step": 2530
+    },
+    {
+      "epoch": 1.036315413648883,
+      "grad_norm": 1.3778438568115234,
+      "learning_rate": 1.7928192574459405e-05,
+      "loss": 0.1034,
+      "step": 2540
+    },
+    {
+      "epoch": 1.0403957972049374,
+      "grad_norm": 1.9515317678451538,
+      "learning_rate": 1.7920032639738884e-05,
+      "loss": 0.1395,
+      "step": 2550
+    },
+    {
+      "epoch": 1.0444761807609915,
+      "grad_norm": 0.7084790468215942,
+      "learning_rate": 1.791187270501836e-05,
+      "loss": 0.1052,
+      "step": 2560
+    },
+    {
+      "epoch": 1.0485565643170458,
+      "grad_norm": 1.4658987522125244,
+      "learning_rate": 1.790371277029784e-05,
+      "loss": 0.1926,
+      "step": 2570
+    },
+    {
+      "epoch": 1.0526369478731001,
+      "grad_norm": 1.8518738746643066,
+      "learning_rate": 1.7895552835577315e-05,
+      "loss": 0.1359,
+      "step": 2580
+    },
+    {
+      "epoch": 1.0567173314291542,
+      "grad_norm": 0.0,
+      "learning_rate": 1.7887392900856794e-05,
+      "loss": 0.2138,
+      "step": 2590
+    },
+    {
+      "epoch": 1.0607977149852086,
+      "grad_norm": 3.8889942169189453,
+      "learning_rate": 1.7879232966136273e-05,
+      "loss": 0.1807,
+      "step": 2600
+    },
+    {
+      "epoch": 1.064878098541263,
+      "grad_norm": 0.2436802089214325,
+      "learning_rate": 1.7871073031415753e-05,
+      "loss": 0.1309,
+      "step": 2610
+    },
+    {
+      "epoch": 1.068958482097317,
+      "grad_norm": 1.300466775894165,
+      "learning_rate": 1.786291309669523e-05,
+      "loss": 0.1192,
+      "step": 2620
+    },
+    {
+      "epoch": 1.0730388656533714,
+      "grad_norm": 1.0126774311065674,
+      "learning_rate": 1.7854753161974704e-05,
+      "loss": 0.1204,
+      "step": 2630
+    },
+    {
+      "epoch": 1.0771192492094257,
+      "grad_norm": 1.8178058862686157,
+      "learning_rate": 1.7846593227254183e-05,
+      "loss": 0.2804,
+      "step": 2640
+    },
+    {
+      "epoch": 1.08119963276548,
+      "grad_norm": 3.756274938583374,
+      "learning_rate": 1.7838433292533662e-05,
+      "loss": 0.2423,
+      "step": 2650
+    },
+    {
+      "epoch": 1.0852800163215341,
+      "grad_norm": 1.1179070472717285,
+      "learning_rate": 1.7830273357813138e-05,
+      "loss": 0.0965,
+      "step": 2660
+    },
+    {
+      "epoch": 1.0893603998775885,
+      "grad_norm": 0.0,
+      "learning_rate": 1.7822113423092617e-05,
+      "loss": 0.1685,
+      "step": 2670
+    },
+    {
+      "epoch": 1.0934407834336428,
+      "grad_norm": 1.443263292312622,
+      "learning_rate": 1.7813953488372096e-05,
+      "loss": 0.1253,
+      "step": 2680
+    },
+    {
+      "epoch": 1.097521166989697,
+      "grad_norm": 0.5109815001487732,
+      "learning_rate": 1.7805793553651572e-05,
+      "loss": 0.1199,
+      "step": 2690
+    },
+    {
+      "epoch": 1.1016015505457513,
+      "grad_norm": 0.22729843854904175,
+      "learning_rate": 1.7797633618931048e-05,
+      "loss": 0.0929,
+      "step": 2700
+    },
+    {
+      "epoch": 1.1056819341018056,
+      "grad_norm": 1.8977311849594116,
+      "learning_rate": 1.7789473684210527e-05,
+      "loss": 0.1941,
+      "step": 2710
+    },
+    {
+      "epoch": 1.10976231765786,
+      "grad_norm": 0.0,
+      "learning_rate": 1.7781313749490006e-05,
+      "loss": 0.1466,
+      "step": 2720
+    },
+    {
+      "epoch": 1.113842701213914,
+      "grad_norm": 0.0,
+      "learning_rate": 1.7773153814769485e-05,
+      "loss": 0.2937,
+      "step": 2730
+    },
+    {
+      "epoch": 1.1179230847699684,
+      "grad_norm": 1.0497016906738281,
+      "learning_rate": 1.776499388004896e-05,
+      "loss": 0.2416,
+      "step": 2740
+    },
+    {
+      "epoch": 1.1220034683260227,
+      "grad_norm": 0.0,
+      "learning_rate": 1.7756833945328437e-05,
+      "loss": 0.2409,
+      "step": 2750
+    },
+    {
+      "epoch": 1.1260838518820768,
+      "grad_norm": 1.1163699626922607,
+      "learning_rate": 1.7748674010607916e-05,
+      "loss": 0.1572,
+      "step": 2760
+    },
+    {
+      "epoch": 1.1301642354381312,
+      "grad_norm": 1.966738224029541,
+      "learning_rate": 1.7740514075887395e-05,
+      "loss": 0.1724,
+      "step": 2770
+    },
+    {
+      "epoch": 1.1342446189941855,
+      "grad_norm": 1.8599481582641602,
+      "learning_rate": 1.773235414116687e-05,
+      "loss": 0.1455,
+      "step": 2780
+    },
+    {
+      "epoch": 1.1383250025502396,
+      "grad_norm": 0.1753341406583786,
+      "learning_rate": 1.772419420644635e-05,
+      "loss": 0.1785,
+      "step": 2790
+    },
+    {
+      "epoch": 1.142405386106294,
+      "grad_norm": 1.0875040292739868,
+      "learning_rate": 1.771603427172583e-05,
+      "loss": 0.1644,
+      "step": 2800
+    },
+    {
+      "epoch": 1.1464857696623483,
+      "grad_norm": 2.753896713256836,
+      "learning_rate": 1.7707874337005305e-05,
+      "loss": 0.1258,
+      "step": 2810
+    },
+    {
+      "epoch": 1.1505661532184026,
+      "grad_norm": 0.49547603726387024,
+      "learning_rate": 1.769971440228478e-05,
+      "loss": 0.0947,
+      "step": 2820
+    },
+    {
+      "epoch": 1.1546465367744567,
+      "grad_norm": 0.0,
+      "learning_rate": 1.769155446756426e-05,
+      "loss": 0.1007,
+      "step": 2830
+    },
+    {
+      "epoch": 1.158726920330511,
+      "grad_norm": 0.2678370773792267,
+      "learning_rate": 1.768339453284374e-05,
+      "loss": 0.0747,
+      "step": 2840
+    },
+    {
+      "epoch": 1.1628073038865654,
+      "grad_norm": 0.7357677817344666,
+      "learning_rate": 1.7675234598123218e-05,
+      "loss": 0.1259,
+      "step": 2850
+    },
+    {
+      "epoch": 1.1668876874426197,
+      "grad_norm": 1.2190055847167969,
+      "learning_rate": 1.7667074663402694e-05,
+      "loss": 0.1064,
+      "step": 2860
+    },
+    {
+      "epoch": 1.1709680709986738,
+      "grad_norm": 1.6289119720458984,
+      "learning_rate": 1.7658914728682173e-05,
+      "loss": 0.1711,
+      "step": 2870
+    },
+    {
+      "epoch": 1.1750484545547282,
+      "grad_norm": 0.5327509641647339,
+      "learning_rate": 1.7650754793961652e-05,
+      "loss": 0.0959,
+      "step": 2880
+    },
+    {
+      "epoch": 1.1791288381107825,
+      "grad_norm": 1.7999294996261597,
+      "learning_rate": 1.7642594859241128e-05,
+      "loss": 0.1594,
+      "step": 2890
+    },
+    {
+      "epoch": 1.1832092216668366,
+      "grad_norm": 1.19373619556427,
+      "learning_rate": 1.7634434924520604e-05,
+      "loss": 0.1787,
+      "step": 2900
+    }
+  ],
+  "logging_steps": 10,
+  "max_steps": 24510,
+  "num_input_tokens_seen": 0,
+  "num_train_epochs": 10,
+  "save_steps": 50,
+  "stateful_callbacks": {
+    "TrainerControl": {
+      "args": {
+        "should_epoch_stop": false,
+        "should_evaluate": false,
+        "should_log": false,
+        "should_save": true,
+        "should_training_stop": false
+      },
+      "attributes": {}
+    }
+  },
+  "total_flos": 1.415936759096148e+18,
+  "train_batch_size": 1,
+  "trial_name": null,
+  "trial_params": null
+}

prior_attributer_polymarket_4b_2900/training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f46457bf3c59baf106bc05ea602d8e6a8d87e1840bac5cf73b3e22b6eab825a9
+size 5496