Add run artifacts grpo_phi4_persona_20260203_111730

Browse files

Files changed (14) hide show

.gitattributes +1 -0
artifacts/grpo_phi4_persona_20260203_111730/env.csv +13 -0
artifacts/grpo_phi4_persona_20260203_111730/generations.csv +0 -0
artifacts/grpo_phi4_persona_20260203_111730/generations.jsonl +0 -0
artifacts/grpo_phi4_persona_20260203_111730/inference_sample.txt +14 -0
artifacts/grpo_phi4_persona_20260203_111730/loss.png +0 -0
artifacts/grpo_phi4_persona_20260203_111730/notes.md +180 -0
artifacts/grpo_phi4_persona_20260203_111730/reasoning_dataset.jsonl +0 -0
artifacts/grpo_phi4_persona_20260203_111730/report.html +0 -0
artifacts/grpo_phi4_persona_20260203_111730/reward.png +3 -0
artifacts/grpo_phi4_persona_20260203_111730/run_config.csv +36 -0
artifacts/grpo_phi4_persona_20260203_111730/run_config.json +47 -0
artifacts/grpo_phi4_persona_20260203_111730/system_prompt.txt +48 -0
artifacts/grpo_phi4_persona_20260203_111730/train_log.csv +22 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+artifacts/grpo_phi4_persona_20260203_111730/reward.png filter=lfs diff=lfs merge=lfs -text

artifacts/grpo_phi4_persona_20260203_111730/env.csv ADDED Viewed

	@@ -0,0 +1,13 @@

+key,value
+python_version,3.12.12
+platform,Linux-6.6.105+-x86_64-with-glibc2.35
+torch_version,2.9.0+cu126
+cuda_available,True
+cuda_device_name,NVIDIA A100-SXM4-80GB
+pkg_unsloth,2026.1.4
+pkg_trl,0.22.2
+pkg_transformers,4.56.2
+pkg_vllm,0.11.2
+pkg_pandas,2.2.2
+pkg_matplotlib,3.10.0
+pkg_rich,13.9.4

artifacts/grpo_phi4_persona_20260203_111730/generations.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

artifacts/grpo_phi4_persona_20260203_111730/generations.jsonl ADDED Viewed

The diff for this file is too large to render. See raw diff

artifacts/grpo_phi4_persona_20260203_111730/inference_sample.txt ADDED Viewed

	@@ -0,0 +1,14 @@

+<reasoning>
+CONTEXT: Jim has just shared a personal failure, likely about a task or project, and is looking for a response. The context suggests a moment of vulnerability or frustration.
+RELATIONSHIP: Michael and Jim are likely colleagues or friends, with no clear power dynamic indicated. The presence of an audience is not specified, but the setting may be informal. There is a potential for empathy or camaraderie, but also a risk of making Jim feel worse.
+MICHAEL_STATE: Michael may feel a mix of surprise and mild discomfort, as he is unsure how to respond to Jim's admission. He might also feel a bit of pressure to say something supportive or humorous.
+MICHAEL_GOAL: Michael wants to respond in a way that maintains the social bond, possibly with a touch of humor to lighten the mood, while not making Jim feel worse.
+REACTION_STRATEGY: Michael opts for a light-hearted, self-deprecating joke to acknowledge the situation without adding pressure.
+COMEDY_MECHANISM: The humor comes from Michael's self-deprecation, which is a common way to defuse tension and make the situation more relatable.
+ANSWER_CONSTRAINT: The response must be in-character, supportive, and humorous, without being dismissive of

artifacts/grpo_phi4_persona_20260203_111730/loss.png ADDED Viewed

artifacts/grpo_phi4_persona_20260203_111730/notes.md ADDED Viewed

	@@ -0,0 +1,180 @@

+## 1) Ce que contient réellement ton dataset (et pourquoi ça change tout)
+1. Ton dataset n’est **pas** “GSM8K math” : c’est un dataset **dialogue / persona** où :
+   * `question` = un **contexte de répliques** (ex: Jim/Pam/Jan/Toby… + Michael)
+   * `answer` = la **réplique cible** (souvent Michael) à produire. ([Hugging Face][1])
+2. Donc, les rewards “numériques” (`extract_final_number`, `int_reward_func`) ne correspondent pas à ton objectif persona. Pour ce dataset, il faut des rewards orientés :
+   * **exactitude de la réplique** (match exact / similarité texte)
+   * **cohérence persona** (ton, psychologie, relation, intention)
+   * **structure du reasoning** (si tu veux un “reasoning dataset” exploitable)
+---
+## 2) Ton objectif “Reasoning dataset + apprendre les réponses via reasoning” (reformulé proprement)
+1. **Phase A (teacher / bootstrapping)**
+   Tu veux **générer/enseigner** un *reasoning* riche (psychologie + relation + réaction), en donnant la réponse dans le prompt pour stabiliser la production du reasoning.
+2. **Phase B (student / test)**
+   Tu veux **retirer la réponse du prompt** et vérifier si le modèle arrive à produire la **même réponse** “grâce au reasoning appris”.
+3. Point clé : en pratique, **le reasoning n’est pas une preuve causale** que le modèle “sait” la réponse. Ce que tu peux mesurer, c’est :
+   * **la capacité de prédiction** sans réponse (Phase B)
+   * et **la qualité/consistance** du reasoning (style persona)
+     L’approche fonctionne si tu traites Phase A comme **création de données (distillation)**, puis Phase B comme **apprentissage de prédiction**.
+---
+## 3) Ce que fait ton script actuel pendant le training (mécanique GRPO)
+1. **Préparation**
+   1. Installe les libs manquantes (pip).
+   2. Crée un `RUN_ID` et des dossiers `runs/<RUN_ID>/...` pour tracer tout (logs, exports, artifacts).
+   3. Se connecte à Hugging Face via token, puis crée 2 repos : merged16 + gguf q8.
+2. **Dataset mapping**
+   1. Charge le split `train` du dataset.
+   2. Pour chaque exemple, construit un `prompt` :
+      * system = impose le format XML `<reasoning>...</reasoning><answer>...</answer>`
+      * user = `question` (et optionnellement `+ answer` si `INCLUDE_GOLD_ANSWER_IN_PROMPT=True`)
+   3. Stocke `answer` = **réponse brute** (pas de `####`). ([Hugging Face][1])
+3. **GRPO**
+   1. À chaque step, GRPO génère `GRPO_NUM_GENERATIONS` complétions par prompt (ici 6).
+   2. Il calcule tes rewards sur chaque completion.
+   3. Il met à jour les poids LoRA pour augmenter le reward moyen.
+4. **Logging / plots**
+   1. TRL émet des logs (loss, reward, kl, etc.).
+   2. Ton callback écrit tout dans `train_log.csv`.
+   3. Tu plots loss/reward en PNG.
+---
+## 4) Pourquoi “mettre la réponse dans le prompt” peut marcher… et comment éviter l’échec classique
+### 4.1 Le risque principal
+1. Si la réponse est dans le prompt, le modèle peut :
+   * **copier** la réponse dans `<answer>` sans comprendre,
+   * et écrire un “reasoning” décoratif (post-hoc).
+2. Tu crois “il a appris via reasoning”, mais en réalité il a appris un **raccourci** : “réponse visible ⇒ output réponse”.
+### 4.2 Le correctif indispensable (si tu veux que Phase A serve vraiment)
+Tu dois **empêcher** Phase A de récompenser le “copier-coller”, et faire de Phase A une **génération de reasoning** exploitable.
+Concrètement :
+1. **En Phase A**, n’entraîne pas (ou très peu) sur la sortie `<answer>` ; entraîne surtout sur :
+   * structure reasoning,
+   * contenu psycho-relationnel,
+   * “non-fuite” de la réplique (ne pas réécrire la réponse mot pour mot dans le reasoning).
+2. Ajoute une pénalité “le reasoning ne doit pas contenir des n-grams longs de la réponse”.
+---
+## 5) Pipeline recommandé (aligné exactement avec ton but)
+### 5.1 Phase A — “Reasoning builder” (réponse visible)
+Objectif : produire un reasoning **utile** (psychologie, relations, intention, réaction), et constituer un dataset.
+1. **Prompt**
+   * user = `question + (answer brut en “reference”)`
+   * instruction : “Écris un reasoning structuré expliquant pourquoi cette réplique est la meilleure réaction de Michael, sans citer la réplique mot à mot”.
+2. **Outputs**
+   * idéalement tu sors **uniquement** `<reasoning>` (et tu peux mettre `<answer>` vide ou absent),
+   * ou `<answer>` mais **sans reward correctness** sur l’answer en Phase A.
+3. **Rewards Phase A**
+   1. Reward format (tes xml/soft/strict)
+   2. Reward “slots” : le reasoning doit contenir des champs (ex):
+      * Contexte (ce qui vient d’être dit)
+      * Intention de Michael
+      * État émotionnel
+      * Relation / rapport de force
+      * Mécanisme comique (si pertinent)
+   3. Reward longueur contrôlée (min/max tokens)
+   4. Pénalité “copie” : forte similarité entre reasoning et answer (ex: Levenshtein / n-gram overlap)
+4. **Artefact**
+   * Tu sauvegardes un JSONL “reasoning_dataset.jsonl” :
+     * question
+     * answer (gold)
+     * reasoning (généré)
+     * metadata (episode/personnages si tu en ajoutes)
+### 5.2 Phase B — “Answer predictor” (réponse cachée)
+Objectif : sans voir la réponse, le modèle doit produire **(reasoning + answer)**.
+1. **Prompt**
+   * user = question seule
+   * system = même format XML
+2. **Rewards Phase B**
+   1. Reward answer-match :
+      * exact match normalisé (strip, whitespace)
+      * * fuzzy match (Levenshtein ratio) car dialogues = variations possibles
+   2. Reward persona/style :
+      * embedding similarity (Michael tone)
+      * contraintes lexicales (catchphrases, narcissisme, awkwardness, etc.)
+   3. Reward format (XML)
+3. **Évaluation**
+   * split train/val/test
+   * métriques :
+     * exact match
+     * similarity score
+     * “persona score” (embedding)
+       Si Phase B monte sans voir la réponse, tu as “appris” au sens prédictif.
+---
+## 6) Ce que ton script doit changer pour coller à ce pipeline (conceptuellement)
+1. **Séparer Phase A et Phase B** via un flag `MODE = "build_reasoning" | "predict_answer"`.
+2. **Remplacer `extract_final_number`** par une fonction de comparaison texte :
+   * normalisation + exact match + fuzzy ratio
+3. **Ajouter reward anti-copie** (Phase A) :
+   * pénalité si reasoning contient une séquence trop proche de la réponse.
+4. **Ajouter un export JSONL** du reasoning généré (Phase A) :
+   * c’est ton “dataset reasoning”.
+5. **Conserver l’upload organisé** (tu l’as déjà) :
+   * repo merged16 = modèle final Phase B
+   * repo gguf = export runtime
+   * artifacts/<RUN_ID> = logs + plots + config + samples
+---
+## 7) Proposition précise pour continuer
+[1]: https://huggingface.co/datasets/Mathieu-Thomas-JOSSET/michael_abab_as_gsm8k.jsonl/raw/main/train.jsonl "huggingface.co"

artifacts/grpo_phi4_persona_20260203_111730/reasoning_dataset.jsonl ADDED Viewed

The diff for this file is too large to render. See raw diff

artifacts/grpo_phi4_persona_20260203_111730/report.html ADDED Viewed

The diff for this file is too large to render. See raw diff

artifacts/grpo_phi4_persona_20260203_111730/reward.png ADDED Viewed

Git LFS Details

SHA256: 7d5a19b23ef27837728cf079d6ce4e224683a1b5e075188fa53f9e6f7047911f
Pointer size: 131 Bytes
Size of remote file: 110 kB

artifacts/grpo_phi4_persona_20260203_111730/run_config.csv ADDED Viewed

	@@ -0,0 +1,36 @@

+key,value
+run_id,grpo_phi4_persona_20260203_111730
+mode,build_reasoning
+dataset_id,Mathieu-Thomas-JOSSET/michael_abab_as_gsm8k.jsonl
+dataset_config,
+dataset_split,train
+include_gold_answer_in_prompt,True
+repos.merged_16bit,Mathieu-Thomas-JOSSET/phi4-grpo-merged16
+repos.gguf_q8,Mathieu-Thomas-JOSSET/phi4-grpo-gguf-q8
+model.name,unsloth/Phi-4
+model.max_seq_length,1024
+model.load_in_4bit,True
+model.fast_inference,True
+model.max_lora_rank,16
+model.lora_rank,16
+model.gpu_memory_utilization,0.9
+model.target_modules[0],gate_proj
+model.target_modules[1],up_proj
+model.target_modules[2],down_proj
+model.lora_alpha,16
+model.gradient_checkpointing,unsloth
+grpo.use_vllm,True
+grpo.learning_rate,5e-06
+grpo.num_generations,6
+grpo.max_prompt_length,512
+grpo.max_completion_length,256
+grpo.max_steps,20
+reward_weights.xmlcount,0.25
+reward_weights.soft_format,0.25
+reward_weights.strict_format,0.25
+reward_weights.slots,0.75
+reward_weights.answer_exact,1.5
+reward_weights.answer_fuzzy,1.0
+reward_weights.anti_copy,1.0
+reward_weights.anti_copy_threshold,0.55
+timestamp,2026-02-03T11:22:29.080587

artifacts/grpo_phi4_persona_20260203_111730/run_config.json ADDED Viewed

	@@ -0,0 +1,47 @@

+{
+  "run_id": "grpo_phi4_persona_20260203_111730",
+  "mode": "build_reasoning",
+  "dataset_id": "Mathieu-Thomas-JOSSET/michael_abab_as_gsm8k.jsonl",
+  "dataset_config": "",
+  "dataset_split": "train",
+  "include_gold_answer_in_prompt": true,
+  "repos": {
+    "merged_16bit": "Mathieu-Thomas-JOSSET/phi4-grpo-merged16",
+    "gguf_q8": "Mathieu-Thomas-JOSSET/phi4-grpo-gguf-q8"
+  },
+  "model": {
+    "name": "unsloth/Phi-4",
+    "max_seq_length": 1024,
+    "load_in_4bit": true,
+    "fast_inference": true,
+    "max_lora_rank": 16,
+    "lora_rank": 16,
+    "gpu_memory_utilization": 0.9,
+    "target_modules": [
+      "gate_proj",
+      "up_proj",
+      "down_proj"
+    ],
+    "lora_alpha": 16,
+    "gradient_checkpointing": "unsloth"
+  },
+  "grpo": {
+    "use_vllm": true,
+    "learning_rate": 5e-06,
+    "num_generations": 6,
+    "max_prompt_length": 512,
+    "max_completion_length": 256,
+    "max_steps": 20
+  },
+  "reward_weights": {
+    "xmlcount": 0.25,
+    "soft_format": 0.25,
+    "strict_format": 0.25,
+    "slots": 0.75,
+    "answer_exact": 1.5,
+    "answer_fuzzy": 1.0,
+    "anti_copy": 1.0,
+    "anti_copy_threshold": 0.55
+  },
+  "timestamp": "2026-02-03T11:22:29.080587"
+}

artifacts/grpo_phi4_persona_20260203_111730/system_prompt.txt ADDED Viewed

	@@ -0,0 +1,48 @@

+You are a character-specialized reasoning engine. Your job is to produce a psychologically grounded, relationship-aware, context-faithful internal reasoning that leads to the exact target reply.
+You will always answer in the following exact XML format (including newlines):
+<reasoning>
+...
+</reasoning>
+<answer>
+...
+</answer>
+TASK
+You are given a dialogue context ("CONTEXT") containing multiple speakers. You must produce:
+1) <reasoning>: a structured analysis of psychology, relationships, power dynamics, subtext, comedic intent, and reaction strategy.
+2) <answer>: the final target reply in-character.
+PHASE A (teacher / bootstrapping) - when a reference answer is provided
+Sometimes the user prompt includes a reference answer block:
+REFERENCE_ANSWER_RAW:
+<gold answer text>
+END_REFERENCE_ANSWER
+In that case:
+- Treat the reference answer as the ground truth target you must reproduce exactly in <answer>.
+- Use it as a target, not as a crutch: do NOT quote long spans of it in <reasoning>.
+- Your <answer> must be EXACTLY identical (after preserving punctuation, capitalization, speaker tag if present).
+PHASE B (student / test) - when no reference answer is provided
+If no reference answer is present:
+- Infer the best possible target reply in-character from context alone.
+- Still keep the same reasoning structure.
+REASONING STYLE REQUIREMENTS
+Your <reasoning> must be explicit and slot-based. Include all slots in this order, each on its own line starting with the exact label:
+CONTEXT: (1-3 sentences) What just happened and what is being asked socially.
+RELATIONSHIP: Who relates to whom, status/power, friction, obligations, audience presence.
+MICHAEL_STATE: Michael's internal emotional state (ego, anxiety, excitement, defensiveness).
+MICHAEL_GOAL: What Michael wants right now (attention, dominance, approval, deflection).
+REACTION_STRATEGY: The mechanism of the response (redirect, joke, mimicry, intimidation, faux-wisdom, awkward sincerity).
+COMEDY_MECHANISM: Why it is funny/awkward (misread, overconfidence, inappropriate metaphor, superiority play).
+ANSWER_CONSTRAINT: State constraints: must be in-character, consistent with context, and (if provided) match the reference answer exactly.
+ANTI-LEAK RULE
+Do NOT paste the reference answer inside <reasoning>. Keep overlap low. The final line <answer> is the only place that may contain the full target.
+OUTPUT RULES
+- No extra text before/after the XML.
+- Keep <answer> concise and natural as a spoken line.

artifacts/grpo_phi4_persona_20260203_111730/train_log.csv ADDED Viewed

	@@ -0,0 +1,22 @@

+completion_length,completions/clipped_ratio,completions/max_length,completions/max_terminated_length,completions/mean_length,completions/mean_terminated_length,completions/min_length,completions/min_terminated_length,epoch,frac_reward_zero_std,grad_norm,kl,learning_rate,loss,num_tokens,reward,reward_std,rewards/reward_answer_exact/mean,rewards/reward_answer_exact/std,rewards/reward_answer_fuzzy/mean,rewards/reward_answer_fuzzy/std,rewards/reward_anti_copy/mean,rewards/reward_anti_copy/std,rewards/reward_slots/mean,rewards/reward_slots/std,rewards/reward_soft_format/mean,rewards/reward_soft_format/std,rewards/reward_strict_format/mean,rewards/reward_strict_format/std,rewards/reward_trace/mean,rewards/reward_trace/std,rewards/reward_xmlcount/mean,rewards/reward_xmlcount/std,step,total_flos,train_loss,train_runtime,train_samples_per_second,train_steps_per_second
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.00034818941504178273,0.0,0.07040157169103622,-1.280568540096283e-09,0.0,0.0,4608.0,0.18226312100887299,0.21140412986278534,0.0,0.0,0.06930478662252426,0.019609108567237854,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.012041665613651276,0.10604248195886612,1,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.0006963788300835655,0.0,0.08563962578773499,-1.4745941134819418e-09,2.5e-06,0.0,9216.0,0.1401711404323578,0.00964118167757988,0.0,0.0,0.10892113298177719,0.009641180746257305,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.03125,0.0,2,,,,,
+252.33334350585938,0.8333333333333334,256.0,234.0,252.33334350585938,234.0,234.0,234.0,0.0010445682451253482,0.0,0.07692400366067886,0.0005538319819606841,5e-06,0.0,13802.0,0.42963215708732605,0.6957957148551941,0.0,0.0,0.18938212096691132,0.3888218104839325,0.0,0.0,0.25,0.3872983455657959,0.0416666679084301,0.10206207633018494,0.0,0.0,0.0,0.0,-0.05141666531562805,0.12941858172416687,3,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.001392757660167131,0.0,0.07518582791090012,0.0005599805153906345,4.962019382530521e-06,0.0,18410.0,0.327523797750473,0.4039272665977478,0.0,0.0,0.035857122391462326,0.0070145707577466965,0.0,0.0,0.25,0.3872983455657959,0.0,0.0,0.0,0.0,0.0,0.0,0.0416666679084301,0.01613743044435978,4,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.0017409470752089136,0.0,0.056567247956991196,0.0005012884503230453,4.849231551964771e-06,0.0,23018.0,0.17609894275665283,0.32029327750205994,0.0,0.0,0.01464060414582491,0.005143328569829464,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,0.0364583320915699,0.012757759541273117,5,,,,,
+253.0,0.8333333333333334,256.0,238.0,253.0,238.0,238.0,238.0,0.0020891364902506965,0.0,0.06721591204404831,0.0006102911429479718,4.665063509461098e-06,0.0,27608.0,0.6317511796951294,1.3270375728607178,0.25,0.6123724579811096,0.21562622487545013,0.384356826543808,0.0,0.0,0.125,0.3061862289905548,0.0416666679084301,0.10206207633018494,0.0,0.0,0.0,0.0,-0.0005416671629063785,0.0778733640909195,6,,,,,
+254.0,0.8333333333333334,256.0,244.0,254.0,244.0,244.0,244.0,0.002437325905292479,0.0,0.07201781868934631,0.0005997096886858344,4.415111107797445e-06,0.0,32204.0,0.7051043510437012,1.2933940887451172,0.25,0.6123724579811096,0.21143768727779388,0.38645288348197937,0.0,0.0,0.25,0.3872983455657959,0.0416666679084301,0.10206206887960434,0.0,0.0,0.0,0.0,-0.047999996691942215,0.12331494688987732,7,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.002785515320334262,0.0,0.06504949182271957,0.000647890439722687,4.106969024216348e-06,0.0,36812.0,0.20111171901226044,0.18121325969696045,0.0,0.0,0.08661168068647385,0.03382347151637077,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.010499998927116394,0.10226619243621826,8,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.0031337047353760445,0.0,0.06434088200330734,0.0005306145176291466,3.7500000000000005e-06,0.0,41420.0,0.06673722714185715,0.011270789429545403,0.0,0.0,0.03548722341656685,0.011270790360867977,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.03125,0.0,9,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.003481894150417827,0.0,0.053295303136110306,0.0005654981941916049,3.3550503583141726e-06,0.0,46028.0,0.0716300755739212,0.005144154652953148,0.0,0.0,0.040380071848630905,0.005144154187291861,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.03125,0.0,10,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.00383008356545961,0.0,0.06923209875822067,0.0005444310372695327,2.9341204441673267e-06,0.0,50636.0,0.207880899310112,0.2040475308895111,0.0,0.0,0.09367257356643677,0.015734924003481865,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.010791666805744171,0.10298063606023788,11,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.004178272980501393,0.0,0.06033642962574959,0.0005470812902785838,2.5e-06,0.0,55244.0,0.1660291850566864,0.2101244181394577,0.0,0.0,0.04986250773072243,0.004921692423522472,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.008833333849906921,0.09818371385335922,12,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.004526462395543176,0.0,0.06181066855788231,0.0006093159317970276,2.0658795558326745e-06,0.0,59852.0,0.08491232246160507,0.007103564217686653,0.0,0.0,0.05366232991218567,0.007103562355041504,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.03125,0.0,13,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.004874651810584958,0.0,0.07624364644289017,0.0006320822285488248,1.6449496416858285e-06,0.0,64460.0,0.16455629467964172,0.20077475905418396,0.0,0.0,0.051139604300260544,0.010059371590614319,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.011583332903683186,0.10491981357336044,14,,,,,
+256.0,0.8333333333333334,256.0,256.0,256.0,256.0,256.0,256.0,0.005222841225626741,0.0,0.07118155062198639,0.0004599814710672945,1.2500000000000007e-06,0.0,69068.0,0.39993733167648315,0.6166902780532837,0.0,0.0,0.15506230294704437,0.3064550757408142,0.0,0.0,0.25,0.3872983455657959,0.0416666679084301,0.10206207633018494,0.0,0.0,0.0,0.0,-0.04679166153073311,0.12116501480340958,15,,,,,
+250.83334350585938,0.8333333333333334,256.0,225.0,250.83334350585938,225.0,225.0,225.0,0.005571030640668524,0.0,0.07234130799770355,0.0005683816270902753,8.930309757836517e-07,0.0,73645.0,0.6535991430282593,1.454949140548706,0.25,0.6123724579811096,0.19030745327472687,0.3966697156429291,0.0,0.0,0.125,0.3061862289905548,0.0416666679084301,0.10206207633018494,0.0,0.0,0.0,0.0,0.04662499949336052,0.037660904228687286,16,,,,,
+254.5,0.6666666666666667,256.0,255.0,254.5,251.5,248.0,248.0,0.005919220055710306,0.0,0.07895802706480026,0.0006044059991836548,5.848888922025553e-07,0.0,78244.0,1.2874853610992432,1.710713267326355,0.5,0.7745966911315918,0.35806870460510254,0.4972403347492218,0.0,0.0,0.375,0.41079193353652954,0.0833333358168602,0.12909944355487823,0.0,0.0,0.0,0.0,-0.02891666628420353,0.13508030772209167,17,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.006267409470752089,0.0,0.09448953717947006,0.0005111763020977378,3.3493649053890325e-07,0.0,82852.0,0.09852509945631027,0.009466900490224361,0.0,0.0,0.06727509945631027,0.009466898627579212,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.03125,0.0,18,,,,,
+255.1666717529297,0.8333333333333334,256.0,251.0,255.1666717529297,251.0,251.0,251.0,0.006615598885793872,0.0,0.06071047484874725,0.0006728537264280021,1.507684480352292e-07,0.0,87455.0,0.6862163543701172,1.4389761686325073,0.25,0.6123724579811096,0.22292469441890717,0.3807142674922943,0.0,0.0,0.125,0.3061862289905548,0.0416666679084301,0.10206207633018494,0.0,0.0,0.0,0.0,0.04662499949336052,0.037660904228687286,19,,,,,
+256.0,1.0,256.0,0.0,256.0,0.0,256.0,0.0,0.006963788300835654,0.0,0.08163938671350479,0.0005554058589041233,3.798061746947995e-08,0.0,92063.0,0.1715540736913681,0.2013811320066452,0.0,0.0,0.057762403041124344,0.010222629643976688,0.0,0.0,0.125,0.3061862289905548,0.0,0.0,0.0,0.0,0.0,0.0,-0.01120833307504654,0.1040012463927269,20,,,,,
+,,,,,,,,0.006963788300835654,,,,,,,,,,,,,,,,,,,,,,,,,20,0.0,6.04490456268536e-07,474.8906,0.253,0.042