darkolorin commited on
Commit
68df8f3
·
verified ·
1 Parent(s): 099c1a7

Upload three-tier cascaded coding router v3

Browse files
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - routing
5
+ - code
6
+ - mlx
7
+ - pid
8
+ - cascade
9
+ library_name: mlx
10
+ ---
11
+
12
+ # Vibe Coding Router
13
+
14
+ A three-tier cascaded router for coding tasks that routes prompts between:
15
+
16
+ - **Local**: Qwen3-Coder-Next (80B/3B active MoE, on-device via MLX)
17
+ - **Sonnet**: Claude Sonnet 4.6 (medium-complexity cloud)
18
+ - **Opus**: Claude Opus 4.6 (max-capability cloud)
19
+
20
+ ## Architecture
21
+
22
+ Two cascaded binary MLP routers trained with **Privileged Information Distillation (PID)**:
23
+
24
+ - **Router A** (local vs cloud): 70-dim input -> [128, 64] -> 1, dropout=0.2
25
+ - **Router B** (sonnet vs opus): 70-dim input -> [64, 32] -> 1, dropout=0.2
26
+
27
+ Features: 38 handcrafted code features + 32 PCA-reduced sentence embeddings (all-MiniLM-L6-v2).
28
+
29
+ ## Training
30
+
31
+ - **Router A**: 100 samples with real (local, sonnet, opus) quality scores
32
+ - **Router B**: 1,729 samples (100 main + 1,644 cloud-only with sonnet+opus scores)
33
+ - **Judge**: GPT-5.4 scoring correctness, completeness, code quality, explanation
34
+ - **Loss**: PID (reward-weighted CE + KL divergence)
35
+ - **Label smoothing**: epsilon=0.05, cost-aware margin for Router B (cost_premium=0.03)
36
+ - **HP sweep**: 108 configurations, 3-way split (train/val/test)
37
+
38
+ ## Routing Distribution
39
+
40
+ | Tier | Rate | Use Case |
41
+ |------|------|----------|
42
+ | Local | 46.7% | Simple tasks, explanations, basic code gen |
43
+ | Sonnet | 20.0% | Medium complexity, standard debugging |
44
+ | Opus | 33.3% | Architecture, complex multi-file tasks |
45
+
46
+ ## Thresholds
47
+
48
+ - Router A: 0.526 (p(cloud) >= threshold -> route to cloud)
49
+ - Router B: 0.474 (p(opus) >= threshold -> route to Opus, else Sonnet)
50
+
51
+ ## Files
52
+
53
+ - `router_a.safetensors` - Router A weights (128x64 MLP)
54
+ - `router_b.safetensors` - Router B weights (64x32 MLP)
55
+ - `config.json` - Model config, thresholds, training results
56
+ - `scaler.pkl` - StandardScaler for feature normalization
57
+ - `embedding_extractor.pkl` - PCA-reduced sentence-transformers extractor
58
+
59
+ ## Usage
60
+
61
+ ```python
62
+ from router.three_tier_inference import ThreeTierRouter
63
+
64
+ router = ThreeTierRouter("models/three_tier_v3")
65
+ tier, probs = router.route("Write a Python function to sort a list")
66
+ # tier: "local", "sonnet", or "opus"
67
+ ```
config.json ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "input_dim": 70,
3
+ "hidden_dims_a": [
4
+ 128,
5
+ 64
6
+ ],
7
+ "hidden_dims_b": [
8
+ 64,
9
+ 32
10
+ ],
11
+ "dropout_a": 0.2,
12
+ "dropout_b": 0.2,
13
+ "threshold_a": 0.5257894736842105,
14
+ "threshold_b": 0.47421052631578947,
15
+ "loss_a": "PID",
16
+ "loss_b": "PID",
17
+ "hp_a": {
18
+ "hidden_dims": [
19
+ 128,
20
+ 64
21
+ ],
22
+ "lr": 0.0003,
23
+ "beta_kl": 0.1,
24
+ "dropout": 0.2,
25
+ "weight_decay": 0.0001,
26
+ "use_pid_loss": true
27
+ },
28
+ "hp_b": {
29
+ "hidden_dims": [
30
+ 64,
31
+ 32
32
+ ],
33
+ "lr": 0.003,
34
+ "beta_kl": 0.02,
35
+ "dropout": 0.2,
36
+ "weight_decay": 0.0001,
37
+ "use_pid_loss": true
38
+ },
39
+ "local_model": "Qwen/Qwen3-Coder-Next",
40
+ "sonnet_model": "claude-sonnet-4-6",
41
+ "opus_model": "claude-opus-4-6",
42
+ "n_train": 69,
43
+ "n_val": 16,
44
+ "n_test": 15,
45
+ "test_results": {
46
+ "local_rate": 0.4666666666666667,
47
+ "sonnet_rate": 0.2,
48
+ "opus_rate": 0.3333333333333333,
49
+ "utility": 0.45333337783813477,
50
+ "oracle_utility": 0.530666708946228,
51
+ "regret": 0.07733333110809326
52
+ }
53
+ }
embedding_extractor.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90f00e4c7a8e4fda09388709a4e8b5bf5d86c0e054763ee968bb84646becefed
3
+ size 51814
router_a.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f666061f45051592c16d68488b381a0ee48b9b6a2f42d3982c126b4d6207f221
3
+ size 71981
router_b.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6cc33c2741c9935720d3a4a3ffaccf728e445ea3bba866236330e522fba0a82
3
+ size 28194
scaler.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28e2921c4df74bdb930506fae642a85dc21ba07c014abba7bfdeb023cd1849eb
3
+ size 2088
sweep_results.json ADDED
@@ -0,0 +1,3516 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "router_a": [
3
+ {
4
+ "hp": {
5
+ "hidden_dims": [
6
+ 32,
7
+ 16
8
+ ],
9
+ "lr": 0.003,
10
+ "beta_kl": 0.02,
11
+ "dropout": 0.0,
12
+ "weight_decay": 0.0001,
13
+ "use_pid_loss": true
14
+ },
15
+ "val_loss": 0.09526831656694412,
16
+ "stopped_epoch": 103,
17
+ "time_s": 0.3511964589706622
18
+ },
19
+ {
20
+ "hp": {
21
+ "hidden_dims": [
22
+ 32,
23
+ 16
24
+ ],
25
+ "lr": 0.003,
26
+ "beta_kl": 0.02,
27
+ "dropout": 0.1,
28
+ "weight_decay": 0.0001,
29
+ "use_pid_loss": true
30
+ },
31
+ "val_loss": 0.09706660360097885,
32
+ "stopped_epoch": 125,
33
+ "time_s": 0.3766482920036651
34
+ },
35
+ {
36
+ "hp": {
37
+ "hidden_dims": [
38
+ 32,
39
+ 16
40
+ ],
41
+ "lr": 0.003,
42
+ "beta_kl": 0.02,
43
+ "dropout": 0.2,
44
+ "weight_decay": 0.0001,
45
+ "use_pid_loss": true
46
+ },
47
+ "val_loss": 0.10793537646532059,
48
+ "stopped_epoch": 107,
49
+ "time_s": 0.3281740410020575
50
+ },
51
+ {
52
+ "hp": {
53
+ "hidden_dims": [
54
+ 32,
55
+ 16
56
+ ],
57
+ "lr": 0.003,
58
+ "beta_kl": 0.05,
59
+ "dropout": 0.0,
60
+ "weight_decay": 0.0001,
61
+ "use_pid_loss": true
62
+ },
63
+ "val_loss": 0.10472729802131653,
64
+ "stopped_epoch": 149,
65
+ "time_s": 0.4166310830041766
66
+ },
67
+ {
68
+ "hp": {
69
+ "hidden_dims": [
70
+ 32,
71
+ 16
72
+ ],
73
+ "lr": 0.003,
74
+ "beta_kl": 0.05,
75
+ "dropout": 0.1,
76
+ "weight_decay": 0.0001,
77
+ "use_pid_loss": true
78
+ },
79
+ "val_loss": 0.10306401550769806,
80
+ "stopped_epoch": 103,
81
+ "time_s": 0.3018436249694787
82
+ },
83
+ {
84
+ "hp": {
85
+ "hidden_dims": [
86
+ 32,
87
+ 16
88
+ ],
89
+ "lr": 0.003,
90
+ "beta_kl": 0.05,
91
+ "dropout": 0.2,
92
+ "weight_decay": 0.0001,
93
+ "use_pid_loss": true
94
+ },
95
+ "val_loss": 0.10205923020839691,
96
+ "stopped_epoch": 105,
97
+ "time_s": 0.3144488339894451
98
+ },
99
+ {
100
+ "hp": {
101
+ "hidden_dims": [
102
+ 32,
103
+ 16
104
+ ],
105
+ "lr": 0.003,
106
+ "beta_kl": 0.1,
107
+ "dropout": 0.0,
108
+ "weight_decay": 0.0001,
109
+ "use_pid_loss": true
110
+ },
111
+ "val_loss": 0.09813252091407776,
112
+ "stopped_epoch": 105,
113
+ "time_s": 0.3057361250394024
114
+ },
115
+ {
116
+ "hp": {
117
+ "hidden_dims": [
118
+ 32,
119
+ 16
120
+ ],
121
+ "lr": 0.003,
122
+ "beta_kl": 0.1,
123
+ "dropout": 0.1,
124
+ "weight_decay": 0.0001,
125
+ "use_pid_loss": true
126
+ },
127
+ "val_loss": 0.10203106701374054,
128
+ "stopped_epoch": 105,
129
+ "time_s": 0.3247596659930423
130
+ },
131
+ {
132
+ "hp": {
133
+ "hidden_dims": [
134
+ 32,
135
+ 16
136
+ ],
137
+ "lr": 0.003,
138
+ "beta_kl": 0.1,
139
+ "dropout": 0.2,
140
+ "weight_decay": 0.0001,
141
+ "use_pid_loss": true
142
+ },
143
+ "val_loss": 0.09873931109905243,
144
+ "stopped_epoch": 111,
145
+ "time_s": 0.3362587080337107
146
+ },
147
+ {
148
+ "hp": {
149
+ "hidden_dims": [
150
+ 32,
151
+ 16
152
+ ],
153
+ "lr": 0.001,
154
+ "beta_kl": 0.02,
155
+ "dropout": 0.0,
156
+ "weight_decay": 0.0001,
157
+ "use_pid_loss": true
158
+ },
159
+ "val_loss": 0.09603578597307205,
160
+ "stopped_epoch": 180,
161
+ "time_s": 0.5150102499756031
162
+ },
163
+ {
164
+ "hp": {
165
+ "hidden_dims": [
166
+ 32,
167
+ 16
168
+ ],
169
+ "lr": 0.001,
170
+ "beta_kl": 0.02,
171
+ "dropout": 0.1,
172
+ "weight_decay": 0.0001,
173
+ "use_pid_loss": true
174
+ },
175
+ "val_loss": 0.10240816324949265,
176
+ "stopped_epoch": 135,
177
+ "time_s": 0.3959702500142157
178
+ },
179
+ {
180
+ "hp": {
181
+ "hidden_dims": [
182
+ 32,
183
+ 16
184
+ ],
185
+ "lr": 0.001,
186
+ "beta_kl": 0.02,
187
+ "dropout": 0.2,
188
+ "weight_decay": 0.0001,
189
+ "use_pid_loss": true
190
+ },
191
+ "val_loss": 0.10106248408555984,
192
+ "stopped_epoch": 139,
193
+ "time_s": 0.42972395895048976
194
+ },
195
+ {
196
+ "hp": {
197
+ "hidden_dims": [
198
+ 32,
199
+ 16
200
+ ],
201
+ "lr": 0.001,
202
+ "beta_kl": 0.05,
203
+ "dropout": 0.0,
204
+ "weight_decay": 0.0001,
205
+ "use_pid_loss": true
206
+ },
207
+ "val_loss": 0.10415155440568924,
208
+ "stopped_epoch": 150,
209
+ "time_s": 0.463614292035345
210
+ },
211
+ {
212
+ "hp": {
213
+ "hidden_dims": [
214
+ 32,
215
+ 16
216
+ ],
217
+ "lr": 0.001,
218
+ "beta_kl": 0.05,
219
+ "dropout": 0.1,
220
+ "weight_decay": 0.0001,
221
+ "use_pid_loss": true
222
+ },
223
+ "val_loss": 0.1088128313422203,
224
+ "stopped_epoch": 107,
225
+ "time_s": 0.3379857499967329
226
+ },
227
+ {
228
+ "hp": {
229
+ "hidden_dims": [
230
+ 32,
231
+ 16
232
+ ],
233
+ "lr": 0.001,
234
+ "beta_kl": 0.05,
235
+ "dropout": 0.2,
236
+ "weight_decay": 0.0001,
237
+ "use_pid_loss": true
238
+ },
239
+ "val_loss": 0.10398281365633011,
240
+ "stopped_epoch": 108,
241
+ "time_s": 0.3434776670183055
242
+ },
243
+ {
244
+ "hp": {
245
+ "hidden_dims": [
246
+ 32,
247
+ 16
248
+ ],
249
+ "lr": 0.001,
250
+ "beta_kl": 0.1,
251
+ "dropout": 0.0,
252
+ "weight_decay": 0.0001,
253
+ "use_pid_loss": true
254
+ },
255
+ "val_loss": 0.10706456750631332,
256
+ "stopped_epoch": 110,
257
+ "time_s": 0.3095027080271393
258
+ },
259
+ {
260
+ "hp": {
261
+ "hidden_dims": [
262
+ 32,
263
+ 16
264
+ ],
265
+ "lr": 0.001,
266
+ "beta_kl": 0.1,
267
+ "dropout": 0.1,
268
+ "weight_decay": 0.0001,
269
+ "use_pid_loss": true
270
+ },
271
+ "val_loss": 0.10526217520236969,
272
+ "stopped_epoch": 113,
273
+ "time_s": 0.330791209009476
274
+ },
275
+ {
276
+ "hp": {
277
+ "hidden_dims": [
278
+ 32,
279
+ 16
280
+ ],
281
+ "lr": 0.001,
282
+ "beta_kl": 0.1,
283
+ "dropout": 0.2,
284
+ "weight_decay": 0.0001,
285
+ "use_pid_loss": true
286
+ },
287
+ "val_loss": 0.10749977827072144,
288
+ "stopped_epoch": 112,
289
+ "time_s": 0.32965670799603686
290
+ },
291
+ {
292
+ "hp": {
293
+ "hidden_dims": [
294
+ 32,
295
+ 16
296
+ ],
297
+ "lr": 0.0003,
298
+ "beta_kl": 0.02,
299
+ "dropout": 0.0,
300
+ "weight_decay": 0.0001,
301
+ "use_pid_loss": true
302
+ },
303
+ "val_loss": 0.10034146904945374,
304
+ "stopped_epoch": 115,
305
+ "time_s": 0.3288282090215944
306
+ },
307
+ {
308
+ "hp": {
309
+ "hidden_dims": [
310
+ 32,
311
+ 16
312
+ ],
313
+ "lr": 0.0003,
314
+ "beta_kl": 0.02,
315
+ "dropout": 0.1,
316
+ "weight_decay": 0.0001,
317
+ "use_pid_loss": true
318
+ },
319
+ "val_loss": 0.09520266205072403,
320
+ "stopped_epoch": 211,
321
+ "time_s": 0.6575975840096362
322
+ },
323
+ {
324
+ "hp": {
325
+ "hidden_dims": [
326
+ 32,
327
+ 16
328
+ ],
329
+ "lr": 0.0003,
330
+ "beta_kl": 0.02,
331
+ "dropout": 0.2,
332
+ "weight_decay": 0.0001,
333
+ "use_pid_loss": true
334
+ },
335
+ "val_loss": 0.10360293835401535,
336
+ "stopped_epoch": 245,
337
+ "time_s": 0.7808697080472484
338
+ },
339
+ {
340
+ "hp": {
341
+ "hidden_dims": [
342
+ 32,
343
+ 16
344
+ ],
345
+ "lr": 0.0003,
346
+ "beta_kl": 0.05,
347
+ "dropout": 0.0,
348
+ "weight_decay": 0.0001,
349
+ "use_pid_loss": true
350
+ },
351
+ "val_loss": 0.09755263477563858,
352
+ "stopped_epoch": 246,
353
+ "time_s": 0.7056415829574689
354
+ },
355
+ {
356
+ "hp": {
357
+ "hidden_dims": [
358
+ 32,
359
+ 16
360
+ ],
361
+ "lr": 0.0003,
362
+ "beta_kl": 0.05,
363
+ "dropout": 0.1,
364
+ "weight_decay": 0.0001,
365
+ "use_pid_loss": true
366
+ },
367
+ "val_loss": 0.09705237299203873,
368
+ "stopped_epoch": 195,
369
+ "time_s": 0.6036518330220133
370
+ },
371
+ {
372
+ "hp": {
373
+ "hidden_dims": [
374
+ 32,
375
+ 16
376
+ ],
377
+ "lr": 0.0003,
378
+ "beta_kl": 0.05,
379
+ "dropout": 0.2,
380
+ "weight_decay": 0.0001,
381
+ "use_pid_loss": true
382
+ },
383
+ "val_loss": 0.10187888890504837,
384
+ "stopped_epoch": 226,
385
+ "time_s": 0.7090968340053223
386
+ },
387
+ {
388
+ "hp": {
389
+ "hidden_dims": [
390
+ 32,
391
+ 16
392
+ ],
393
+ "lr": 0.0003,
394
+ "beta_kl": 0.1,
395
+ "dropout": 0.0,
396
+ "weight_decay": 0.0001,
397
+ "use_pid_loss": true
398
+ },
399
+ "val_loss": 0.10171081870794296,
400
+ "stopped_epoch": 156,
401
+ "time_s": 0.4607050420017913
402
+ },
403
+ {
404
+ "hp": {
405
+ "hidden_dims": [
406
+ 32,
407
+ 16
408
+ ],
409
+ "lr": 0.0003,
410
+ "beta_kl": 0.1,
411
+ "dropout": 0.1,
412
+ "weight_decay": 0.0001,
413
+ "use_pid_loss": true
414
+ },
415
+ "val_loss": 0.10760975629091263,
416
+ "stopped_epoch": 169,
417
+ "time_s": 0.5401734170154668
418
+ },
419
+ {
420
+ "hp": {
421
+ "hidden_dims": [
422
+ 32,
423
+ 16
424
+ ],
425
+ "lr": 0.0003,
426
+ "beta_kl": 0.1,
427
+ "dropout": 0.2,
428
+ "weight_decay": 0.0001,
429
+ "use_pid_loss": true
430
+ },
431
+ "val_loss": 0.10248646885156631,
432
+ "stopped_epoch": 127,
433
+ "time_s": 0.3923001250368543
434
+ },
435
+ {
436
+ "hp": {
437
+ "hidden_dims": [
438
+ 64,
439
+ 32
440
+ ],
441
+ "lr": 0.003,
442
+ "beta_kl": 0.02,
443
+ "dropout": 0.0,
444
+ "weight_decay": 0.0001,
445
+ "use_pid_loss": true
446
+ },
447
+ "val_loss": 0.09845619648694992,
448
+ "stopped_epoch": 103,
449
+ "time_s": 0.30979183298768476
450
+ },
451
+ {
452
+ "hp": {
453
+ "hidden_dims": [
454
+ 64,
455
+ 32
456
+ ],
457
+ "lr": 0.003,
458
+ "beta_kl": 0.02,
459
+ "dropout": 0.1,
460
+ "weight_decay": 0.0001,
461
+ "use_pid_loss": true
462
+ },
463
+ "val_loss": 0.0941203236579895,
464
+ "stopped_epoch": 119,
465
+ "time_s": 0.3695387089974247
466
+ },
467
+ {
468
+ "hp": {
469
+ "hidden_dims": [
470
+ 64,
471
+ 32
472
+ ],
473
+ "lr": 0.003,
474
+ "beta_kl": 0.02,
475
+ "dropout": 0.2,
476
+ "weight_decay": 0.0001,
477
+ "use_pid_loss": true
478
+ },
479
+ "val_loss": 0.09730832278728485,
480
+ "stopped_epoch": 108,
481
+ "time_s": 0.3159803749877028
482
+ },
483
+ {
484
+ "hp": {
485
+ "hidden_dims": [
486
+ 64,
487
+ 32
488
+ ],
489
+ "lr": 0.003,
490
+ "beta_kl": 0.05,
491
+ "dropout": 0.0,
492
+ "weight_decay": 0.0001,
493
+ "use_pid_loss": true
494
+ },
495
+ "val_loss": 0.10986710339784622,
496
+ "stopped_epoch": 141,
497
+ "time_s": 0.3859832090092823
498
+ },
499
+ {
500
+ "hp": {
501
+ "hidden_dims": [
502
+ 64,
503
+ 32
504
+ ],
505
+ "lr": 0.003,
506
+ "beta_kl": 0.05,
507
+ "dropout": 0.1,
508
+ "weight_decay": 0.0001,
509
+ "use_pid_loss": true
510
+ },
511
+ "val_loss": 0.10231650620698929,
512
+ "stopped_epoch": 103,
513
+ "time_s": 0.3125516250147484
514
+ },
515
+ {
516
+ "hp": {
517
+ "hidden_dims": [
518
+ 64,
519
+ 32
520
+ ],
521
+ "lr": 0.003,
522
+ "beta_kl": 0.05,
523
+ "dropout": 0.2,
524
+ "weight_decay": 0.0001,
525
+ "use_pid_loss": true
526
+ },
527
+ "val_loss": 0.0952429249882698,
528
+ "stopped_epoch": 123,
529
+ "time_s": 0.386852000025101
530
+ },
531
+ {
532
+ "hp": {
533
+ "hidden_dims": [
534
+ 64,
535
+ 32
536
+ ],
537
+ "lr": 0.003,
538
+ "beta_kl": 0.1,
539
+ "dropout": 0.0,
540
+ "weight_decay": 0.0001,
541
+ "use_pid_loss": true
542
+ },
543
+ "val_loss": 0.10523795336484909,
544
+ "stopped_epoch": 102,
545
+ "time_s": 0.2852378750103526
546
+ },
547
+ {
548
+ "hp": {
549
+ "hidden_dims": [
550
+ 64,
551
+ 32
552
+ ],
553
+ "lr": 0.003,
554
+ "beta_kl": 0.1,
555
+ "dropout": 0.1,
556
+ "weight_decay": 0.0001,
557
+ "use_pid_loss": true
558
+ },
559
+ "val_loss": 0.10593602061271667,
560
+ "stopped_epoch": 194,
561
+ "time_s": 0.5935489159892313
562
+ },
563
+ {
564
+ "hp": {
565
+ "hidden_dims": [
566
+ 64,
567
+ 32
568
+ ],
569
+ "lr": 0.003,
570
+ "beta_kl": 0.1,
571
+ "dropout": 0.2,
572
+ "weight_decay": 0.0001,
573
+ "use_pid_loss": true
574
+ },
575
+ "val_loss": 0.10924716293811798,
576
+ "stopped_epoch": 103,
577
+ "time_s": 0.30400612502126023
578
+ },
579
+ {
580
+ "hp": {
581
+ "hidden_dims": [
582
+ 64,
583
+ 32
584
+ ],
585
+ "lr": 0.001,
586
+ "beta_kl": 0.02,
587
+ "dropout": 0.0,
588
+ "weight_decay": 0.0001,
589
+ "use_pid_loss": true
590
+ },
591
+ "val_loss": 0.10035652667284012,
592
+ "stopped_epoch": 104,
593
+ "time_s": 0.2876757920021191
594
+ },
595
+ {
596
+ "hp": {
597
+ "hidden_dims": [
598
+ 64,
599
+ 32
600
+ ],
601
+ "lr": 0.001,
602
+ "beta_kl": 0.02,
603
+ "dropout": 0.1,
604
+ "weight_decay": 0.0001,
605
+ "use_pid_loss": true
606
+ },
607
+ "val_loss": 0.09972376376390457,
608
+ "stopped_epoch": 139,
609
+ "time_s": 0.4184161250013858
610
+ },
611
+ {
612
+ "hp": {
613
+ "hidden_dims": [
614
+ 64,
615
+ 32
616
+ ],
617
+ "lr": 0.001,
618
+ "beta_kl": 0.02,
619
+ "dropout": 0.2,
620
+ "weight_decay": 0.0001,
621
+ "use_pid_loss": true
622
+ },
623
+ "val_loss": 0.1002410501241684,
624
+ "stopped_epoch": 154,
625
+ "time_s": 0.47230020799906924
626
+ },
627
+ {
628
+ "hp": {
629
+ "hidden_dims": [
630
+ 64,
631
+ 32
632
+ ],
633
+ "lr": 0.001,
634
+ "beta_kl": 0.05,
635
+ "dropout": 0.0,
636
+ "weight_decay": 0.0001,
637
+ "use_pid_loss": true
638
+ },
639
+ "val_loss": 0.09540876001119614,
640
+ "stopped_epoch": 104,
641
+ "time_s": 0.2981030840310268
642
+ },
643
+ {
644
+ "hp": {
645
+ "hidden_dims": [
646
+ 64,
647
+ 32
648
+ ],
649
+ "lr": 0.001,
650
+ "beta_kl": 0.05,
651
+ "dropout": 0.1,
652
+ "weight_decay": 0.0001,
653
+ "use_pid_loss": true
654
+ },
655
+ "val_loss": 0.10364223271608353,
656
+ "stopped_epoch": 220,
657
+ "time_s": 0.7037542500183918
658
+ },
659
+ {
660
+ "hp": {
661
+ "hidden_dims": [
662
+ 64,
663
+ 32
664
+ ],
665
+ "lr": 0.001,
666
+ "beta_kl": 0.05,
667
+ "dropout": 0.2,
668
+ "weight_decay": 0.0001,
669
+ "use_pid_loss": true
670
+ },
671
+ "val_loss": 0.10783598572015762,
672
+ "stopped_epoch": 101,
673
+ "time_s": 0.29406745795859024
674
+ },
675
+ {
676
+ "hp": {
677
+ "hidden_dims": [
678
+ 64,
679
+ 32
680
+ ],
681
+ "lr": 0.001,
682
+ "beta_kl": 0.1,
683
+ "dropout": 0.0,
684
+ "weight_decay": 0.0001,
685
+ "use_pid_loss": true
686
+ },
687
+ "val_loss": 0.10237938910722733,
688
+ "stopped_epoch": 157,
689
+ "time_s": 0.4418069999665022
690
+ },
691
+ {
692
+ "hp": {
693
+ "hidden_dims": [
694
+ 64,
695
+ 32
696
+ ],
697
+ "lr": 0.001,
698
+ "beta_kl": 0.1,
699
+ "dropout": 0.1,
700
+ "weight_decay": 0.0001,
701
+ "use_pid_loss": true
702
+ },
703
+ "val_loss": 0.10463026165962219,
704
+ "stopped_epoch": 108,
705
+ "time_s": 0.33412266697268933
706
+ },
707
+ {
708
+ "hp": {
709
+ "hidden_dims": [
710
+ 64,
711
+ 32
712
+ ],
713
+ "lr": 0.001,
714
+ "beta_kl": 0.1,
715
+ "dropout": 0.2,
716
+ "weight_decay": 0.0001,
717
+ "use_pid_loss": true
718
+ },
719
+ "val_loss": 0.10396554321050644,
720
+ "stopped_epoch": 104,
721
+ "time_s": 0.3171117919846438
722
+ },
723
+ {
724
+ "hp": {
725
+ "hidden_dims": [
726
+ 64,
727
+ 32
728
+ ],
729
+ "lr": 0.0003,
730
+ "beta_kl": 0.02,
731
+ "dropout": 0.0,
732
+ "weight_decay": 0.0001,
733
+ "use_pid_loss": true
734
+ },
735
+ "val_loss": 0.0980963185429573,
736
+ "stopped_epoch": 118,
737
+ "time_s": 0.34339095902396366
738
+ },
739
+ {
740
+ "hp": {
741
+ "hidden_dims": [
742
+ 64,
743
+ 32
744
+ ],
745
+ "lr": 0.0003,
746
+ "beta_kl": 0.02,
747
+ "dropout": 0.1,
748
+ "weight_decay": 0.0001,
749
+ "use_pid_loss": true
750
+ },
751
+ "val_loss": 0.09119636565446854,
752
+ "stopped_epoch": 221,
753
+ "time_s": 0.6611024999874644
754
+ },
755
+ {
756
+ "hp": {
757
+ "hidden_dims": [
758
+ 64,
759
+ 32
760
+ ],
761
+ "lr": 0.0003,
762
+ "beta_kl": 0.02,
763
+ "dropout": 0.2,
764
+ "weight_decay": 0.0001,
765
+ "use_pid_loss": true
766
+ },
767
+ "val_loss": 0.10583359748125076,
768
+ "stopped_epoch": 204,
769
+ "time_s": 0.6189236660138704
770
+ },
771
+ {
772
+ "hp": {
773
+ "hidden_dims": [
774
+ 64,
775
+ 32
776
+ ],
777
+ "lr": 0.0003,
778
+ "beta_kl": 0.05,
779
+ "dropout": 0.0,
780
+ "weight_decay": 0.0001,
781
+ "use_pid_loss": true
782
+ },
783
+ "val_loss": 0.10601699352264404,
784
+ "stopped_epoch": 106,
785
+ "time_s": 0.302524084050674
786
+ },
787
+ {
788
+ "hp": {
789
+ "hidden_dims": [
790
+ 64,
791
+ 32
792
+ ],
793
+ "lr": 0.0003,
794
+ "beta_kl": 0.05,
795
+ "dropout": 0.1,
796
+ "weight_decay": 0.0001,
797
+ "use_pid_loss": true
798
+ },
799
+ "val_loss": 0.09647030383348465,
800
+ "stopped_epoch": 113,
801
+ "time_s": 0.3519312500138767
802
+ },
803
+ {
804
+ "hp": {
805
+ "hidden_dims": [
806
+ 64,
807
+ 32
808
+ ],
809
+ "lr": 0.0003,
810
+ "beta_kl": 0.05,
811
+ "dropout": 0.2,
812
+ "weight_decay": 0.0001,
813
+ "use_pid_loss": true
814
+ },
815
+ "val_loss": 0.10571961849927902,
816
+ "stopped_epoch": 110,
817
+ "time_s": 0.34017366700572893
818
+ },
819
+ {
820
+ "hp": {
821
+ "hidden_dims": [
822
+ 64,
823
+ 32
824
+ ],
825
+ "lr": 0.0003,
826
+ "beta_kl": 0.1,
827
+ "dropout": 0.0,
828
+ "weight_decay": 0.0001,
829
+ "use_pid_loss": true
830
+ },
831
+ "val_loss": 0.1020527184009552,
832
+ "stopped_epoch": 109,
833
+ "time_s": 0.31051195797044784
834
+ },
835
+ {
836
+ "hp": {
837
+ "hidden_dims": [
838
+ 64,
839
+ 32
840
+ ],
841
+ "lr": 0.0003,
842
+ "beta_kl": 0.1,
843
+ "dropout": 0.1,
844
+ "weight_decay": 0.0001,
845
+ "use_pid_loss": true
846
+ },
847
+ "val_loss": 0.1047804057598114,
848
+ "stopped_epoch": 136,
849
+ "time_s": 0.40434908302268013
850
+ },
851
+ {
852
+ "hp": {
853
+ "hidden_dims": [
854
+ 64,
855
+ 32
856
+ ],
857
+ "lr": 0.0003,
858
+ "beta_kl": 0.1,
859
+ "dropout": 0.2,
860
+ "weight_decay": 0.0001,
861
+ "use_pid_loss": true
862
+ },
863
+ "val_loss": 0.09742459654808044,
864
+ "stopped_epoch": 126,
865
+ "time_s": 0.3808767079608515
866
+ },
867
+ {
868
+ "hp": {
869
+ "hidden_dims": [
870
+ 128,
871
+ 64
872
+ ],
873
+ "lr": 0.003,
874
+ "beta_kl": 0.02,
875
+ "dropout": 0.0,
876
+ "weight_decay": 0.0001,
877
+ "use_pid_loss": true
878
+ },
879
+ "val_loss": 0.09734103828668594,
880
+ "stopped_epoch": 143,
881
+ "time_s": 0.45690974994795397
882
+ },
883
+ {
884
+ "hp": {
885
+ "hidden_dims": [
886
+ 128,
887
+ 64
888
+ ],
889
+ "lr": 0.003,
890
+ "beta_kl": 0.02,
891
+ "dropout": 0.1,
892
+ "weight_decay": 0.0001,
893
+ "use_pid_loss": true
894
+ },
895
+ "val_loss": 0.09987301379442215,
896
+ "stopped_epoch": 135,
897
+ "time_s": 0.46669295796891674
898
+ },
899
+ {
900
+ "hp": {
901
+ "hidden_dims": [
902
+ 128,
903
+ 64
904
+ ],
905
+ "lr": 0.003,
906
+ "beta_kl": 0.02,
907
+ "dropout": 0.2,
908
+ "weight_decay": 0.0001,
909
+ "use_pid_loss": true
910
+ },
911
+ "val_loss": 0.09803911298513412,
912
+ "stopped_epoch": 125,
913
+ "time_s": 0.40080379199935123
914
+ },
915
+ {
916
+ "hp": {
917
+ "hidden_dims": [
918
+ 128,
919
+ 64
920
+ ],
921
+ "lr": 0.003,
922
+ "beta_kl": 0.05,
923
+ "dropout": 0.0,
924
+ "weight_decay": 0.0001,
925
+ "use_pid_loss": true
926
+ },
927
+ "val_loss": 0.10084632784128189,
928
+ "stopped_epoch": 128,
929
+ "time_s": 0.36044895800296217
930
+ },
931
+ {
932
+ "hp": {
933
+ "hidden_dims": [
934
+ 128,
935
+ 64
936
+ ],
937
+ "lr": 0.003,
938
+ "beta_kl": 0.05,
939
+ "dropout": 0.1,
940
+ "weight_decay": 0.0001,
941
+ "use_pid_loss": true
942
+ },
943
+ "val_loss": 0.10756398737430573,
944
+ "stopped_epoch": 120,
945
+ "time_s": 0.35246904200175777
946
+ },
947
+ {
948
+ "hp": {
949
+ "hidden_dims": [
950
+ 128,
951
+ 64
952
+ ],
953
+ "lr": 0.003,
954
+ "beta_kl": 0.05,
955
+ "dropout": 0.2,
956
+ "weight_decay": 0.0001,
957
+ "use_pid_loss": true
958
+ },
959
+ "val_loss": 0.10436253249645233,
960
+ "stopped_epoch": 104,
961
+ "time_s": 0.3163909580325708
962
+ },
963
+ {
964
+ "hp": {
965
+ "hidden_dims": [
966
+ 128,
967
+ 64
968
+ ],
969
+ "lr": 0.003,
970
+ "beta_kl": 0.1,
971
+ "dropout": 0.0,
972
+ "weight_decay": 0.0001,
973
+ "use_pid_loss": true
974
+ },
975
+ "val_loss": 0.10651297867298126,
976
+ "stopped_epoch": 131,
977
+ "time_s": 0.402843791001942
978
+ },
979
+ {
980
+ "hp": {
981
+ "hidden_dims": [
982
+ 128,
983
+ 64
984
+ ],
985
+ "lr": 0.003,
986
+ "beta_kl": 0.1,
987
+ "dropout": 0.1,
988
+ "weight_decay": 0.0001,
989
+ "use_pid_loss": true
990
+ },
991
+ "val_loss": 0.1076364517211914,
992
+ "stopped_epoch": 106,
993
+ "time_s": 0.33409766695695
994
+ },
995
+ {
996
+ "hp": {
997
+ "hidden_dims": [
998
+ 128,
999
+ 64
1000
+ ],
1001
+ "lr": 0.003,
1002
+ "beta_kl": 0.1,
1003
+ "dropout": 0.2,
1004
+ "weight_decay": 0.0001,
1005
+ "use_pid_loss": true
1006
+ },
1007
+ "val_loss": 0.10318459570407867,
1008
+ "stopped_epoch": 106,
1009
+ "time_s": 0.37188029097160324
1010
+ },
1011
+ {
1012
+ "hp": {
1013
+ "hidden_dims": [
1014
+ 128,
1015
+ 64
1016
+ ],
1017
+ "lr": 0.001,
1018
+ "beta_kl": 0.02,
1019
+ "dropout": 0.0,
1020
+ "weight_decay": 0.0001,
1021
+ "use_pid_loss": true
1022
+ },
1023
+ "val_loss": 0.09828519821166992,
1024
+ "stopped_epoch": 104,
1025
+ "time_s": 0.30446800001664087
1026
+ },
1027
+ {
1028
+ "hp": {
1029
+ "hidden_dims": [
1030
+ 128,
1031
+ 64
1032
+ ],
1033
+ "lr": 0.001,
1034
+ "beta_kl": 0.02,
1035
+ "dropout": 0.1,
1036
+ "weight_decay": 0.0001,
1037
+ "use_pid_loss": true
1038
+ },
1039
+ "val_loss": 0.10254894196987152,
1040
+ "stopped_epoch": 206,
1041
+ "time_s": 0.6062651670072228
1042
+ },
1043
+ {
1044
+ "hp": {
1045
+ "hidden_dims": [
1046
+ 128,
1047
+ 64
1048
+ ],
1049
+ "lr": 0.001,
1050
+ "beta_kl": 0.02,
1051
+ "dropout": 0.2,
1052
+ "weight_decay": 0.0001,
1053
+ "use_pid_loss": true
1054
+ },
1055
+ "val_loss": 0.09948968887329102,
1056
+ "stopped_epoch": 112,
1057
+ "time_s": 0.3444743750151247
1058
+ },
1059
+ {
1060
+ "hp": {
1061
+ "hidden_dims": [
1062
+ 128,
1063
+ 64
1064
+ ],
1065
+ "lr": 0.001,
1066
+ "beta_kl": 0.05,
1067
+ "dropout": 0.0,
1068
+ "weight_decay": 0.0001,
1069
+ "use_pid_loss": true
1070
+ },
1071
+ "val_loss": 0.1088625118136406,
1072
+ "stopped_epoch": 115,
1073
+ "time_s": 0.33527904201764613
1074
+ },
1075
+ {
1076
+ "hp": {
1077
+ "hidden_dims": [
1078
+ 128,
1079
+ 64
1080
+ ],
1081
+ "lr": 0.001,
1082
+ "beta_kl": 0.05,
1083
+ "dropout": 0.1,
1084
+ "weight_decay": 0.0001,
1085
+ "use_pid_loss": true
1086
+ },
1087
+ "val_loss": 0.10180449485778809,
1088
+ "stopped_epoch": 139,
1089
+ "time_s": 0.4372234169859439
1090
+ },
1091
+ {
1092
+ "hp": {
1093
+ "hidden_dims": [
1094
+ 128,
1095
+ 64
1096
+ ],
1097
+ "lr": 0.001,
1098
+ "beta_kl": 0.05,
1099
+ "dropout": 0.2,
1100
+ "weight_decay": 0.0001,
1101
+ "use_pid_loss": true
1102
+ },
1103
+ "val_loss": 0.098965123295784,
1104
+ "stopped_epoch": 103,
1105
+ "time_s": 0.3150777089758776
1106
+ },
1107
+ {
1108
+ "hp": {
1109
+ "hidden_dims": [
1110
+ 128,
1111
+ 64
1112
+ ],
1113
+ "lr": 0.001,
1114
+ "beta_kl": 0.1,
1115
+ "dropout": 0.0,
1116
+ "weight_decay": 0.0001,
1117
+ "use_pid_loss": true
1118
+ },
1119
+ "val_loss": 0.10939312726259232,
1120
+ "stopped_epoch": 102,
1121
+ "time_s": 0.2914257920347154
1122
+ },
1123
+ {
1124
+ "hp": {
1125
+ "hidden_dims": [
1126
+ 128,
1127
+ 64
1128
+ ],
1129
+ "lr": 0.001,
1130
+ "beta_kl": 0.1,
1131
+ "dropout": 0.1,
1132
+ "weight_decay": 0.0001,
1133
+ "use_pid_loss": true
1134
+ },
1135
+ "val_loss": 0.10926490277051926,
1136
+ "stopped_epoch": 148,
1137
+ "time_s": 0.4702196249854751
1138
+ },
1139
+ {
1140
+ "hp": {
1141
+ "hidden_dims": [
1142
+ 128,
1143
+ 64
1144
+ ],
1145
+ "lr": 0.001,
1146
+ "beta_kl": 0.1,
1147
+ "dropout": 0.2,
1148
+ "weight_decay": 0.0001,
1149
+ "use_pid_loss": true
1150
+ },
1151
+ "val_loss": 0.10315917432308197,
1152
+ "stopped_epoch": 146,
1153
+ "time_s": 0.4897199170081876
1154
+ },
1155
+ {
1156
+ "hp": {
1157
+ "hidden_dims": [
1158
+ 128,
1159
+ 64
1160
+ ],
1161
+ "lr": 0.0003,
1162
+ "beta_kl": 0.02,
1163
+ "dropout": 0.0,
1164
+ "weight_decay": 0.0001,
1165
+ "use_pid_loss": true
1166
+ },
1167
+ "val_loss": 0.09730847179889679,
1168
+ "stopped_epoch": 106,
1169
+ "time_s": 0.3132300000288524
1170
+ },
1171
+ {
1172
+ "hp": {
1173
+ "hidden_dims": [
1174
+ 128,
1175
+ 64
1176
+ ],
1177
+ "lr": 0.0003,
1178
+ "beta_kl": 0.02,
1179
+ "dropout": 0.1,
1180
+ "weight_decay": 0.0001,
1181
+ "use_pid_loss": true
1182
+ },
1183
+ "val_loss": 0.1053319051861763,
1184
+ "stopped_epoch": 264,
1185
+ "time_s": 0.818234707985539
1186
+ },
1187
+ {
1188
+ "hp": {
1189
+ "hidden_dims": [
1190
+ 128,
1191
+ 64
1192
+ ],
1193
+ "lr": 0.0003,
1194
+ "beta_kl": 0.02,
1195
+ "dropout": 0.2,
1196
+ "weight_decay": 0.0001,
1197
+ "use_pid_loss": true
1198
+ },
1199
+ "val_loss": 0.10112594813108444,
1200
+ "stopped_epoch": 157,
1201
+ "time_s": 0.5580500829964876
1202
+ },
1203
+ {
1204
+ "hp": {
1205
+ "hidden_dims": [
1206
+ 128,
1207
+ 64
1208
+ ],
1209
+ "lr": 0.0003,
1210
+ "beta_kl": 0.05,
1211
+ "dropout": 0.0,
1212
+ "weight_decay": 0.0001,
1213
+ "use_pid_loss": true
1214
+ },
1215
+ "val_loss": 0.10136955976486206,
1216
+ "stopped_epoch": 134,
1217
+ "time_s": 0.40941287501482293
1218
+ },
1219
+ {
1220
+ "hp": {
1221
+ "hidden_dims": [
1222
+ 128,
1223
+ 64
1224
+ ],
1225
+ "lr": 0.0003,
1226
+ "beta_kl": 0.05,
1227
+ "dropout": 0.1,
1228
+ "weight_decay": 0.0001,
1229
+ "use_pid_loss": true
1230
+ },
1231
+ "val_loss": 0.09758520126342773,
1232
+ "stopped_epoch": 109,
1233
+ "time_s": 0.39445870800409466
1234
+ },
1235
+ {
1236
+ "hp": {
1237
+ "hidden_dims": [
1238
+ 128,
1239
+ 64
1240
+ ],
1241
+ "lr": 0.0003,
1242
+ "beta_kl": 0.05,
1243
+ "dropout": 0.2,
1244
+ "weight_decay": 0.0001,
1245
+ "use_pid_loss": true
1246
+ },
1247
+ "val_loss": 0.09949986636638641,
1248
+ "stopped_epoch": 107,
1249
+ "time_s": 0.3892017080215737
1250
+ },
1251
+ {
1252
+ "hp": {
1253
+ "hidden_dims": [
1254
+ 128,
1255
+ 64
1256
+ ],
1257
+ "lr": 0.0003,
1258
+ "beta_kl": 0.1,
1259
+ "dropout": 0.0,
1260
+ "weight_decay": 0.0001,
1261
+ "use_pid_loss": true
1262
+ },
1263
+ "val_loss": 0.1017155796289444,
1264
+ "stopped_epoch": 104,
1265
+ "time_s": 0.34829125000396743
1266
+ },
1267
+ {
1268
+ "hp": {
1269
+ "hidden_dims": [
1270
+ 128,
1271
+ 64
1272
+ ],
1273
+ "lr": 0.0003,
1274
+ "beta_kl": 0.1,
1275
+ "dropout": 0.1,
1276
+ "weight_decay": 0.0001,
1277
+ "use_pid_loss": true
1278
+ },
1279
+ "val_loss": 0.10721778124570847,
1280
+ "stopped_epoch": 219,
1281
+ "time_s": 0.7467506669927388
1282
+ },
1283
+ {
1284
+ "hp": {
1285
+ "hidden_dims": [
1286
+ 128,
1287
+ 64
1288
+ ],
1289
+ "lr": 0.0003,
1290
+ "beta_kl": 0.1,
1291
+ "dropout": 0.2,
1292
+ "weight_decay": 0.0001,
1293
+ "use_pid_loss": true
1294
+ },
1295
+ "val_loss": 0.09094587713479996,
1296
+ "stopped_epoch": 148,
1297
+ "time_s": 0.5177526250481606
1298
+ },
1299
+ {
1300
+ "hp": {
1301
+ "hidden_dims": [
1302
+ 128,
1303
+ 64,
1304
+ 32
1305
+ ],
1306
+ "lr": 0.003,
1307
+ "beta_kl": 0.02,
1308
+ "dropout": 0.0,
1309
+ "weight_decay": 0.0001,
1310
+ "use_pid_loss": true
1311
+ },
1312
+ "val_loss": 0.1061350554227829,
1313
+ "stopped_epoch": 103,
1314
+ "time_s": 0.4550556250032969
1315
+ },
1316
+ {
1317
+ "hp": {
1318
+ "hidden_dims": [
1319
+ 128,
1320
+ 64,
1321
+ 32
1322
+ ],
1323
+ "lr": 0.003,
1324
+ "beta_kl": 0.02,
1325
+ "dropout": 0.1,
1326
+ "weight_decay": 0.0001,
1327
+ "use_pid_loss": true
1328
+ },
1329
+ "val_loss": 0.09498627483844757,
1330
+ "stopped_epoch": 184,
1331
+ "time_s": 0.8283787079853937
1332
+ },
1333
+ {
1334
+ "hp": {
1335
+ "hidden_dims": [
1336
+ 128,
1337
+ 64,
1338
+ 32
1339
+ ],
1340
+ "lr": 0.003,
1341
+ "beta_kl": 0.02,
1342
+ "dropout": 0.2,
1343
+ "weight_decay": 0.0001,
1344
+ "use_pid_loss": true
1345
+ },
1346
+ "val_loss": 0.10696770250797272,
1347
+ "stopped_epoch": 123,
1348
+ "time_s": 0.47497366700554267
1349
+ },
1350
+ {
1351
+ "hp": {
1352
+ "hidden_dims": [
1353
+ 128,
1354
+ 64,
1355
+ 32
1356
+ ],
1357
+ "lr": 0.003,
1358
+ "beta_kl": 0.05,
1359
+ "dropout": 0.0,
1360
+ "weight_decay": 0.0001,
1361
+ "use_pid_loss": true
1362
+ },
1363
+ "val_loss": 0.10211507976055145,
1364
+ "stopped_epoch": 136,
1365
+ "time_s": 0.4954894579714164
1366
+ },
1367
+ {
1368
+ "hp": {
1369
+ "hidden_dims": [
1370
+ 128,
1371
+ 64,
1372
+ 32
1373
+ ],
1374
+ "lr": 0.003,
1375
+ "beta_kl": 0.05,
1376
+ "dropout": 0.1,
1377
+ "weight_decay": 0.0001,
1378
+ "use_pid_loss": true
1379
+ },
1380
+ "val_loss": 0.1070987656712532,
1381
+ "stopped_epoch": 102,
1382
+ "time_s": 0.4064530420000665
1383
+ },
1384
+ {
1385
+ "hp": {
1386
+ "hidden_dims": [
1387
+ 128,
1388
+ 64,
1389
+ 32
1390
+ ],
1391
+ "lr": 0.003,
1392
+ "beta_kl": 0.05,
1393
+ "dropout": 0.2,
1394
+ "weight_decay": 0.0001,
1395
+ "use_pid_loss": true
1396
+ },
1397
+ "val_loss": 0.10747461766004562,
1398
+ "stopped_epoch": 172,
1399
+ "time_s": 0.6777474579866976
1400
+ },
1401
+ {
1402
+ "hp": {
1403
+ "hidden_dims": [
1404
+ 128,
1405
+ 64,
1406
+ 32
1407
+ ],
1408
+ "lr": 0.003,
1409
+ "beta_kl": 0.1,
1410
+ "dropout": 0.0,
1411
+ "weight_decay": 0.0001,
1412
+ "use_pid_loss": true
1413
+ },
1414
+ "val_loss": 0.11168321222066879,
1415
+ "stopped_epoch": 127,
1416
+ "time_s": 0.45429399999557063
1417
+ },
1418
+ {
1419
+ "hp": {
1420
+ "hidden_dims": [
1421
+ 128,
1422
+ 64,
1423
+ 32
1424
+ ],
1425
+ "lr": 0.003,
1426
+ "beta_kl": 0.1,
1427
+ "dropout": 0.1,
1428
+ "weight_decay": 0.0001,
1429
+ "use_pid_loss": true
1430
+ },
1431
+ "val_loss": 0.10364384204149246,
1432
+ "stopped_epoch": 146,
1433
+ "time_s": 0.5978276670211926
1434
+ },
1435
+ {
1436
+ "hp": {
1437
+ "hidden_dims": [
1438
+ 128,
1439
+ 64,
1440
+ 32
1441
+ ],
1442
+ "lr": 0.003,
1443
+ "beta_kl": 0.1,
1444
+ "dropout": 0.2,
1445
+ "weight_decay": 0.0001,
1446
+ "use_pid_loss": true
1447
+ },
1448
+ "val_loss": 0.10173693299293518,
1449
+ "stopped_epoch": 138,
1450
+ "time_s": 0.5611937919748016
1451
+ },
1452
+ {
1453
+ "hp": {
1454
+ "hidden_dims": [
1455
+ 128,
1456
+ 64,
1457
+ 32
1458
+ ],
1459
+ "lr": 0.001,
1460
+ "beta_kl": 0.02,
1461
+ "dropout": 0.0,
1462
+ "weight_decay": 0.0001,
1463
+ "use_pid_loss": true
1464
+ },
1465
+ "val_loss": 0.10215073823928833,
1466
+ "stopped_epoch": 103,
1467
+ "time_s": 0.3868753340211697
1468
+ },
1469
+ {
1470
+ "hp": {
1471
+ "hidden_dims": [
1472
+ 128,
1473
+ 64,
1474
+ 32
1475
+ ],
1476
+ "lr": 0.001,
1477
+ "beta_kl": 0.02,
1478
+ "dropout": 0.1,
1479
+ "weight_decay": 0.0001,
1480
+ "use_pid_loss": true
1481
+ },
1482
+ "val_loss": 0.0983714833855629,
1483
+ "stopped_epoch": 128,
1484
+ "time_s": 0.4771239580004476
1485
+ },
1486
+ {
1487
+ "hp": {
1488
+ "hidden_dims": [
1489
+ 128,
1490
+ 64,
1491
+ 32
1492
+ ],
1493
+ "lr": 0.001,
1494
+ "beta_kl": 0.02,
1495
+ "dropout": 0.2,
1496
+ "weight_decay": 0.0001,
1497
+ "use_pid_loss": true
1498
+ },
1499
+ "val_loss": 0.09527570754289627,
1500
+ "stopped_epoch": 110,
1501
+ "time_s": 0.4013628330430947
1502
+ },
1503
+ {
1504
+ "hp": {
1505
+ "hidden_dims": [
1506
+ 128,
1507
+ 64,
1508
+ 32
1509
+ ],
1510
+ "lr": 0.001,
1511
+ "beta_kl": 0.05,
1512
+ "dropout": 0.0,
1513
+ "weight_decay": 0.0001,
1514
+ "use_pid_loss": true
1515
+ },
1516
+ "val_loss": 0.10344179719686508,
1517
+ "stopped_epoch": 105,
1518
+ "time_s": 0.3833526249509305
1519
+ },
1520
+ {
1521
+ "hp": {
1522
+ "hidden_dims": [
1523
+ 128,
1524
+ 64,
1525
+ 32
1526
+ ],
1527
+ "lr": 0.001,
1528
+ "beta_kl": 0.05,
1529
+ "dropout": 0.1,
1530
+ "weight_decay": 0.0001,
1531
+ "use_pid_loss": true
1532
+ },
1533
+ "val_loss": 0.10618078708648682,
1534
+ "stopped_epoch": 162,
1535
+ "time_s": 0.666418207983952
1536
+ },
1537
+ {
1538
+ "hp": {
1539
+ "hidden_dims": [
1540
+ 128,
1541
+ 64,
1542
+ 32
1543
+ ],
1544
+ "lr": 0.001,
1545
+ "beta_kl": 0.05,
1546
+ "dropout": 0.2,
1547
+ "weight_decay": 0.0001,
1548
+ "use_pid_loss": true
1549
+ },
1550
+ "val_loss": 0.10211540758609772,
1551
+ "stopped_epoch": 103,
1552
+ "time_s": 0.4025729999993928
1553
+ },
1554
+ {
1555
+ "hp": {
1556
+ "hidden_dims": [
1557
+ 128,
1558
+ 64,
1559
+ 32
1560
+ ],
1561
+ "lr": 0.001,
1562
+ "beta_kl": 0.1,
1563
+ "dropout": 0.0,
1564
+ "weight_decay": 0.0001,
1565
+ "use_pid_loss": true
1566
+ },
1567
+ "val_loss": 0.10063455253839493,
1568
+ "stopped_epoch": 115,
1569
+ "time_s": 0.39842316700378433
1570
+ },
1571
+ {
1572
+ "hp": {
1573
+ "hidden_dims": [
1574
+ 128,
1575
+ 64,
1576
+ 32
1577
+ ],
1578
+ "lr": 0.001,
1579
+ "beta_kl": 0.1,
1580
+ "dropout": 0.1,
1581
+ "weight_decay": 0.0001,
1582
+ "use_pid_loss": true
1583
+ },
1584
+ "val_loss": 0.10491722822189331,
1585
+ "stopped_epoch": 106,
1586
+ "time_s": 0.38957387499976903
1587
+ },
1588
+ {
1589
+ "hp": {
1590
+ "hidden_dims": [
1591
+ 128,
1592
+ 64,
1593
+ 32
1594
+ ],
1595
+ "lr": 0.001,
1596
+ "beta_kl": 0.1,
1597
+ "dropout": 0.2,
1598
+ "weight_decay": 0.0001,
1599
+ "use_pid_loss": true
1600
+ },
1601
+ "val_loss": 0.10448549687862396,
1602
+ "stopped_epoch": 129,
1603
+ "time_s": 0.4947644579806365
1604
+ },
1605
+ {
1606
+ "hp": {
1607
+ "hidden_dims": [
1608
+ 128,
1609
+ 64,
1610
+ 32
1611
+ ],
1612
+ "lr": 0.0003,
1613
+ "beta_kl": 0.02,
1614
+ "dropout": 0.0,
1615
+ "weight_decay": 0.0001,
1616
+ "use_pid_loss": true
1617
+ },
1618
+ "val_loss": 0.1014222577214241,
1619
+ "stopped_epoch": 112,
1620
+ "time_s": 0.41112791700288653
1621
+ },
1622
+ {
1623
+ "hp": {
1624
+ "hidden_dims": [
1625
+ 128,
1626
+ 64,
1627
+ 32
1628
+ ],
1629
+ "lr": 0.0003,
1630
+ "beta_kl": 0.02,
1631
+ "dropout": 0.1,
1632
+ "weight_decay": 0.0001,
1633
+ "use_pid_loss": true
1634
+ },
1635
+ "val_loss": 0.09844069182872772,
1636
+ "stopped_epoch": 107,
1637
+ "time_s": 0.4165982089471072
1638
+ },
1639
+ {
1640
+ "hp": {
1641
+ "hidden_dims": [
1642
+ 128,
1643
+ 64,
1644
+ 32
1645
+ ],
1646
+ "lr": 0.0003,
1647
+ "beta_kl": 0.02,
1648
+ "dropout": 0.2,
1649
+ "weight_decay": 0.0001,
1650
+ "use_pid_loss": true
1651
+ },
1652
+ "val_loss": 0.09828032553195953,
1653
+ "stopped_epoch": 209,
1654
+ "time_s": 0.7904425000306219
1655
+ },
1656
+ {
1657
+ "hp": {
1658
+ "hidden_dims": [
1659
+ 128,
1660
+ 64,
1661
+ 32
1662
+ ],
1663
+ "lr": 0.0003,
1664
+ "beta_kl": 0.05,
1665
+ "dropout": 0.0,
1666
+ "weight_decay": 0.0001,
1667
+ "use_pid_loss": true
1668
+ },
1669
+ "val_loss": 0.09409097582101822,
1670
+ "stopped_epoch": 131,
1671
+ "time_s": 0.4789945419761352
1672
+ },
1673
+ {
1674
+ "hp": {
1675
+ "hidden_dims": [
1676
+ 128,
1677
+ 64,
1678
+ 32
1679
+ ],
1680
+ "lr": 0.0003,
1681
+ "beta_kl": 0.05,
1682
+ "dropout": 0.1,
1683
+ "weight_decay": 0.0001,
1684
+ "use_pid_loss": true
1685
+ },
1686
+ "val_loss": 0.10193637758493423,
1687
+ "stopped_epoch": 109,
1688
+ "time_s": 0.42699700000230223
1689
+ },
1690
+ {
1691
+ "hp": {
1692
+ "hidden_dims": [
1693
+ 128,
1694
+ 64,
1695
+ 32
1696
+ ],
1697
+ "lr": 0.0003,
1698
+ "beta_kl": 0.05,
1699
+ "dropout": 0.2,
1700
+ "weight_decay": 0.0001,
1701
+ "use_pid_loss": true
1702
+ },
1703
+ "val_loss": 0.0998620018362999,
1704
+ "stopped_epoch": 271,
1705
+ "time_s": 1.0426472090184689
1706
+ },
1707
+ {
1708
+ "hp": {
1709
+ "hidden_dims": [
1710
+ 128,
1711
+ 64,
1712
+ 32
1713
+ ],
1714
+ "lr": 0.0003,
1715
+ "beta_kl": 0.1,
1716
+ "dropout": 0.0,
1717
+ "weight_decay": 0.0001,
1718
+ "use_pid_loss": true
1719
+ },
1720
+ "val_loss": 0.10092081874608994,
1721
+ "stopped_epoch": 123,
1722
+ "time_s": 0.44738416600739583
1723
+ },
1724
+ {
1725
+ "hp": {
1726
+ "hidden_dims": [
1727
+ 128,
1728
+ 64,
1729
+ 32
1730
+ ],
1731
+ "lr": 0.0003,
1732
+ "beta_kl": 0.1,
1733
+ "dropout": 0.1,
1734
+ "weight_decay": 0.0001,
1735
+ "use_pid_loss": true
1736
+ },
1737
+ "val_loss": 0.10327319800853729,
1738
+ "stopped_epoch": 144,
1739
+ "time_s": 0.5763382079894654
1740
+ },
1741
+ {
1742
+ "hp": {
1743
+ "hidden_dims": [
1744
+ 128,
1745
+ 64,
1746
+ 32
1747
+ ],
1748
+ "lr": 0.0003,
1749
+ "beta_kl": 0.1,
1750
+ "dropout": 0.2,
1751
+ "weight_decay": 0.0001,
1752
+ "use_pid_loss": true
1753
+ },
1754
+ "val_loss": 0.10982925444841385,
1755
+ "stopped_epoch": 113,
1756
+ "time_s": 0.4403615409974009
1757
+ }
1758
+ ],
1759
+ "router_b": [
1760
+ {
1761
+ "hp": {
1762
+ "hidden_dims": [
1763
+ 32,
1764
+ 16
1765
+ ],
1766
+ "lr": 0.003,
1767
+ "beta_kl": 0.02,
1768
+ "dropout": 0.0,
1769
+ "weight_decay": 0.0001,
1770
+ "use_pid_loss": true
1771
+ },
1772
+ "val_loss": 0.1002978314726339,
1773
+ "stopped_epoch": 104,
1774
+ "time_s": 3.1333408749778755
1775
+ },
1776
+ {
1777
+ "hp": {
1778
+ "hidden_dims": [
1779
+ 32,
1780
+ 16
1781
+ ],
1782
+ "lr": 0.003,
1783
+ "beta_kl": 0.02,
1784
+ "dropout": 0.1,
1785
+ "weight_decay": 0.0001,
1786
+ "use_pid_loss": true
1787
+ },
1788
+ "val_loss": 0.09994719216244759,
1789
+ "stopped_epoch": 118,
1790
+ "time_s": 3.6624700829852372
1791
+ },
1792
+ {
1793
+ "hp": {
1794
+ "hidden_dims": [
1795
+ 32,
1796
+ 16
1797
+ ],
1798
+ "lr": 0.003,
1799
+ "beta_kl": 0.02,
1800
+ "dropout": 0.2,
1801
+ "weight_decay": 0.0001,
1802
+ "use_pid_loss": true
1803
+ },
1804
+ "val_loss": 0.10003617061355899,
1805
+ "stopped_epoch": 111,
1806
+ "time_s": 3.5706530419993214
1807
+ },
1808
+ {
1809
+ "hp": {
1810
+ "hidden_dims": [
1811
+ 32,
1812
+ 16
1813
+ ],
1814
+ "lr": 0.003,
1815
+ "beta_kl": 0.05,
1816
+ "dropout": 0.0,
1817
+ "weight_decay": 0.0001,
1818
+ "use_pid_loss": true
1819
+ },
1820
+ "val_loss": 0.10096073714811678,
1821
+ "stopped_epoch": 102,
1822
+ "time_s": 3.1130946669727564
1823
+ },
1824
+ {
1825
+ "hp": {
1826
+ "hidden_dims": [
1827
+ 32,
1828
+ 16
1829
+ ],
1830
+ "lr": 0.003,
1831
+ "beta_kl": 0.05,
1832
+ "dropout": 0.1,
1833
+ "weight_decay": 0.0001,
1834
+ "use_pid_loss": true
1835
+ },
1836
+ "val_loss": 0.10027475099515364,
1837
+ "stopped_epoch": 110,
1838
+ "time_s": 3.417081249994226
1839
+ },
1840
+ {
1841
+ "hp": {
1842
+ "hidden_dims": [
1843
+ 32,
1844
+ 16
1845
+ ],
1846
+ "lr": 0.003,
1847
+ "beta_kl": 0.05,
1848
+ "dropout": 0.2,
1849
+ "weight_decay": 0.0001,
1850
+ "use_pid_loss": true
1851
+ },
1852
+ "val_loss": 0.10015452605795998,
1853
+ "stopped_epoch": 117,
1854
+ "time_s": 3.7293019170174375
1855
+ },
1856
+ {
1857
+ "hp": {
1858
+ "hidden_dims": [
1859
+ 32,
1860
+ 16
1861
+ ],
1862
+ "lr": 0.003,
1863
+ "beta_kl": 0.1,
1864
+ "dropout": 0.0,
1865
+ "weight_decay": 0.0001,
1866
+ "use_pid_loss": true
1867
+ },
1868
+ "val_loss": 0.10159092314670541,
1869
+ "stopped_epoch": 108,
1870
+ "time_s": 3.2152729999506846
1871
+ },
1872
+ {
1873
+ "hp": {
1874
+ "hidden_dims": [
1875
+ 32,
1876
+ 16
1877
+ ],
1878
+ "lr": 0.003,
1879
+ "beta_kl": 0.1,
1880
+ "dropout": 0.1,
1881
+ "weight_decay": 0.0001,
1882
+ "use_pid_loss": true
1883
+ },
1884
+ "val_loss": 0.10142215397316597,
1885
+ "stopped_epoch": 118,
1886
+ "time_s": 3.7818635840085335
1887
+ },
1888
+ {
1889
+ "hp": {
1890
+ "hidden_dims": [
1891
+ 32,
1892
+ 16
1893
+ ],
1894
+ "lr": 0.003,
1895
+ "beta_kl": 0.1,
1896
+ "dropout": 0.2,
1897
+ "weight_decay": 0.0001,
1898
+ "use_pid_loss": true
1899
+ },
1900
+ "val_loss": 0.1014296852612082,
1901
+ "stopped_epoch": 122,
1902
+ "time_s": 3.847522000025492
1903
+ },
1904
+ {
1905
+ "hp": {
1906
+ "hidden_dims": [
1907
+ 32,
1908
+ 16
1909
+ ],
1910
+ "lr": 0.001,
1911
+ "beta_kl": 0.02,
1912
+ "dropout": 0.0,
1913
+ "weight_decay": 0.0001,
1914
+ "use_pid_loss": true
1915
+ },
1916
+ "val_loss": 0.09987101574681398,
1917
+ "stopped_epoch": 115,
1918
+ "time_s": 3.3345228329999372
1919
+ },
1920
+ {
1921
+ "hp": {
1922
+ "hidden_dims": [
1923
+ 32,
1924
+ 16
1925
+ ],
1926
+ "lr": 0.001,
1927
+ "beta_kl": 0.02,
1928
+ "dropout": 0.1,
1929
+ "weight_decay": 0.0001,
1930
+ "use_pid_loss": true
1931
+ },
1932
+ "val_loss": 0.09983301873324234,
1933
+ "stopped_epoch": 117,
1934
+ "time_s": 3.6786156250163913
1935
+ },
1936
+ {
1937
+ "hp": {
1938
+ "hidden_dims": [
1939
+ 32,
1940
+ 16
1941
+ ],
1942
+ "lr": 0.001,
1943
+ "beta_kl": 0.02,
1944
+ "dropout": 0.2,
1945
+ "weight_decay": 0.0001,
1946
+ "use_pid_loss": true
1947
+ },
1948
+ "val_loss": 0.100017885553699,
1949
+ "stopped_epoch": 120,
1950
+ "time_s": 3.7129283329704776
1951
+ },
1952
+ {
1953
+ "hp": {
1954
+ "hidden_dims": [
1955
+ 32,
1956
+ 16
1957
+ ],
1958
+ "lr": 0.001,
1959
+ "beta_kl": 0.05,
1960
+ "dropout": 0.0,
1961
+ "weight_decay": 0.0001,
1962
+ "use_pid_loss": true
1963
+ },
1964
+ "val_loss": 0.1007061679928289,
1965
+ "stopped_epoch": 107,
1966
+ "time_s": 3.160037250025198
1967
+ },
1968
+ {
1969
+ "hp": {
1970
+ "hidden_dims": [
1971
+ 32,
1972
+ 16
1973
+ ],
1974
+ "lr": 0.001,
1975
+ "beta_kl": 0.05,
1976
+ "dropout": 0.1,
1977
+ "weight_decay": 0.0001,
1978
+ "use_pid_loss": true
1979
+ },
1980
+ "val_loss": 0.10030909666436257,
1981
+ "stopped_epoch": 119,
1982
+ "time_s": 3.680276832950767
1983
+ },
1984
+ {
1985
+ "hp": {
1986
+ "hidden_dims": [
1987
+ 32,
1988
+ 16
1989
+ ],
1990
+ "lr": 0.001,
1991
+ "beta_kl": 0.05,
1992
+ "dropout": 0.2,
1993
+ "weight_decay": 0.0001,
1994
+ "use_pid_loss": true
1995
+ },
1996
+ "val_loss": 0.10049859529113495,
1997
+ "stopped_epoch": 120,
1998
+ "time_s": 3.761285582964774
1999
+ },
2000
+ {
2001
+ "hp": {
2002
+ "hidden_dims": [
2003
+ 32,
2004
+ 16
2005
+ ],
2006
+ "lr": 0.001,
2007
+ "beta_kl": 0.1,
2008
+ "dropout": 0.0,
2009
+ "weight_decay": 0.0001,
2010
+ "use_pid_loss": true
2011
+ },
2012
+ "val_loss": 0.10150012266256905,
2013
+ "stopped_epoch": 109,
2014
+ "time_s": 3.2808405829709955
2015
+ },
2016
+ {
2017
+ "hp": {
2018
+ "hidden_dims": [
2019
+ 32,
2020
+ 16
2021
+ ],
2022
+ "lr": 0.001,
2023
+ "beta_kl": 0.1,
2024
+ "dropout": 0.1,
2025
+ "weight_decay": 0.0001,
2026
+ "use_pid_loss": true
2027
+ },
2028
+ "val_loss": 0.10109278930060436,
2029
+ "stopped_epoch": 111,
2030
+ "time_s": 3.5070266670081764
2031
+ },
2032
+ {
2033
+ "hp": {
2034
+ "hidden_dims": [
2035
+ 32,
2036
+ 16
2037
+ ],
2038
+ "lr": 0.001,
2039
+ "beta_kl": 0.1,
2040
+ "dropout": 0.2,
2041
+ "weight_decay": 0.0001,
2042
+ "use_pid_loss": true
2043
+ },
2044
+ "val_loss": 0.10144081082991782,
2045
+ "stopped_epoch": 115,
2046
+ "time_s": 3.6915548749966547
2047
+ },
2048
+ {
2049
+ "hp": {
2050
+ "hidden_dims": [
2051
+ 32,
2052
+ 16
2053
+ ],
2054
+ "lr": 0.0003,
2055
+ "beta_kl": 0.02,
2056
+ "dropout": 0.0,
2057
+ "weight_decay": 0.0001,
2058
+ "use_pid_loss": true
2059
+ },
2060
+ "val_loss": 0.09992979906197917,
2061
+ "stopped_epoch": 113,
2062
+ "time_s": 3.3264151249895804
2063
+ },
2064
+ {
2065
+ "hp": {
2066
+ "hidden_dims": [
2067
+ 32,
2068
+ 16
2069
+ ],
2070
+ "lr": 0.0003,
2071
+ "beta_kl": 0.02,
2072
+ "dropout": 0.1,
2073
+ "weight_decay": 0.0001,
2074
+ "use_pid_loss": true
2075
+ },
2076
+ "val_loss": 0.09998821200146152,
2077
+ "stopped_epoch": 115,
2078
+ "time_s": 3.6174755829852074
2079
+ },
2080
+ {
2081
+ "hp": {
2082
+ "hidden_dims": [
2083
+ 32,
2084
+ 16
2085
+ ],
2086
+ "lr": 0.0003,
2087
+ "beta_kl": 0.02,
2088
+ "dropout": 0.2,
2089
+ "weight_decay": 0.0001,
2090
+ "use_pid_loss": true
2091
+ },
2092
+ "val_loss": 0.1000879502968292,
2093
+ "stopped_epoch": 112,
2094
+ "time_s": 3.492072332999669
2095
+ },
2096
+ {
2097
+ "hp": {
2098
+ "hidden_dims": [
2099
+ 32,
2100
+ 16
2101
+ ],
2102
+ "lr": 0.0003,
2103
+ "beta_kl": 0.05,
2104
+ "dropout": 0.0,
2105
+ "weight_decay": 0.0001,
2106
+ "use_pid_loss": true
2107
+ },
2108
+ "val_loss": 0.10073393393803194,
2109
+ "stopped_epoch": 121,
2110
+ "time_s": 3.5111334589892067
2111
+ },
2112
+ {
2113
+ "hp": {
2114
+ "hidden_dims": [
2115
+ 32,
2116
+ 16
2117
+ ],
2118
+ "lr": 0.0003,
2119
+ "beta_kl": 0.05,
2120
+ "dropout": 0.1,
2121
+ "weight_decay": 0.0001,
2122
+ "use_pid_loss": true
2123
+ },
2124
+ "val_loss": 0.10046835446116552,
2125
+ "stopped_epoch": 123,
2126
+ "time_s": 3.8516207089996897
2127
+ },
2128
+ {
2129
+ "hp": {
2130
+ "hidden_dims": [
2131
+ 32,
2132
+ 16
2133
+ ],
2134
+ "lr": 0.0003,
2135
+ "beta_kl": 0.05,
2136
+ "dropout": 0.2,
2137
+ "weight_decay": 0.0001,
2138
+ "use_pid_loss": true
2139
+ },
2140
+ "val_loss": 0.10024260548670168,
2141
+ "stopped_epoch": 121,
2142
+ "time_s": 3.7040837079985067
2143
+ },
2144
+ {
2145
+ "hp": {
2146
+ "hidden_dims": [
2147
+ 32,
2148
+ 16
2149
+ ],
2150
+ "lr": 0.0003,
2151
+ "beta_kl": 0.1,
2152
+ "dropout": 0.0,
2153
+ "weight_decay": 0.0001,
2154
+ "use_pid_loss": true
2155
+ },
2156
+ "val_loss": 0.10171747121507722,
2157
+ "stopped_epoch": 120,
2158
+ "time_s": 3.53738183301175
2159
+ },
2160
+ {
2161
+ "hp": {
2162
+ "hidden_dims": [
2163
+ 32,
2164
+ 16
2165
+ ],
2166
+ "lr": 0.0003,
2167
+ "beta_kl": 0.1,
2168
+ "dropout": 0.1,
2169
+ "weight_decay": 0.0001,
2170
+ "use_pid_loss": true
2171
+ },
2172
+ "val_loss": 0.10151153794258316,
2173
+ "stopped_epoch": 108,
2174
+ "time_s": 3.3648806249839254
2175
+ },
2176
+ {
2177
+ "hp": {
2178
+ "hidden_dims": [
2179
+ 32,
2180
+ 16
2181
+ ],
2182
+ "lr": 0.0003,
2183
+ "beta_kl": 0.1,
2184
+ "dropout": 0.2,
2185
+ "weight_decay": 0.0001,
2186
+ "use_pid_loss": true
2187
+ },
2188
+ "val_loss": 0.10180397852824602,
2189
+ "stopped_epoch": 118,
2190
+ "time_s": 3.6381992080132477
2191
+ },
2192
+ {
2193
+ "hp": {
2194
+ "hidden_dims": [
2195
+ 64,
2196
+ 32
2197
+ ],
2198
+ "lr": 0.003,
2199
+ "beta_kl": 0.02,
2200
+ "dropout": 0.0,
2201
+ "weight_decay": 0.0001,
2202
+ "use_pid_loss": true
2203
+ },
2204
+ "val_loss": 0.09993160968226504,
2205
+ "stopped_epoch": 112,
2206
+ "time_s": 3.3314534579985775
2207
+ },
2208
+ {
2209
+ "hp": {
2210
+ "hidden_dims": [
2211
+ 64,
2212
+ 32
2213
+ ],
2214
+ "lr": 0.003,
2215
+ "beta_kl": 0.02,
2216
+ "dropout": 0.1,
2217
+ "weight_decay": 0.0001,
2218
+ "use_pid_loss": true
2219
+ },
2220
+ "val_loss": 0.09967074780105856,
2221
+ "stopped_epoch": 112,
2222
+ "time_s": 3.4509888329776004
2223
+ },
2224
+ {
2225
+ "hp": {
2226
+ "hidden_dims": [
2227
+ 64,
2228
+ 32
2229
+ ],
2230
+ "lr": 0.003,
2231
+ "beta_kl": 0.02,
2232
+ "dropout": 0.2,
2233
+ "weight_decay": 0.0001,
2234
+ "use_pid_loss": true
2235
+ },
2236
+ "val_loss": 0.09953705385068938,
2237
+ "stopped_epoch": 112,
2238
+ "time_s": 3.5696694159996696
2239
+ },
2240
+ {
2241
+ "hp": {
2242
+ "hidden_dims": [
2243
+ 64,
2244
+ 32
2245
+ ],
2246
+ "lr": 0.003,
2247
+ "beta_kl": 0.05,
2248
+ "dropout": 0.0,
2249
+ "weight_decay": 0.0001,
2250
+ "use_pid_loss": true
2251
+ },
2252
+ "val_loss": 0.10061087541152976,
2253
+ "stopped_epoch": 117,
2254
+ "time_s": 3.5069861669908278
2255
+ },
2256
+ {
2257
+ "hp": {
2258
+ "hidden_dims": [
2259
+ 64,
2260
+ 32
2261
+ ],
2262
+ "lr": 0.003,
2263
+ "beta_kl": 0.05,
2264
+ "dropout": 0.1,
2265
+ "weight_decay": 0.0001,
2266
+ "use_pid_loss": true
2267
+ },
2268
+ "val_loss": 0.10028560730935521,
2269
+ "stopped_epoch": 113,
2270
+ "time_s": 3.5167379589984193
2271
+ },
2272
+ {
2273
+ "hp": {
2274
+ "hidden_dims": [
2275
+ 64,
2276
+ 32
2277
+ ],
2278
+ "lr": 0.003,
2279
+ "beta_kl": 0.05,
2280
+ "dropout": 0.2,
2281
+ "weight_decay": 0.0001,
2282
+ "use_pid_loss": true
2283
+ },
2284
+ "val_loss": 0.10046777581376147,
2285
+ "stopped_epoch": 116,
2286
+ "time_s": 3.6155172919970937
2287
+ },
2288
+ {
2289
+ "hp": {
2290
+ "hidden_dims": [
2291
+ 64,
2292
+ 32
2293
+ ],
2294
+ "lr": 0.003,
2295
+ "beta_kl": 0.1,
2296
+ "dropout": 0.0,
2297
+ "weight_decay": 0.0001,
2298
+ "use_pid_loss": true
2299
+ },
2300
+ "val_loss": 0.10118137635936626,
2301
+ "stopped_epoch": 116,
2302
+ "time_s": 3.3618514579720795
2303
+ },
2304
+ {
2305
+ "hp": {
2306
+ "hidden_dims": [
2307
+ 64,
2308
+ 32
2309
+ ],
2310
+ "lr": 0.003,
2311
+ "beta_kl": 0.1,
2312
+ "dropout": 0.1,
2313
+ "weight_decay": 0.0001,
2314
+ "use_pid_loss": true
2315
+ },
2316
+ "val_loss": 0.10134138352092291,
2317
+ "stopped_epoch": 115,
2318
+ "time_s": 3.5782984999823384
2319
+ },
2320
+ {
2321
+ "hp": {
2322
+ "hidden_dims": [
2323
+ 64,
2324
+ 32
2325
+ ],
2326
+ "lr": 0.003,
2327
+ "beta_kl": 0.1,
2328
+ "dropout": 0.2,
2329
+ "weight_decay": 0.0001,
2330
+ "use_pid_loss": true
2331
+ },
2332
+ "val_loss": 0.1014707936458505,
2333
+ "stopped_epoch": 110,
2334
+ "time_s": 3.3920865419786423
2335
+ },
2336
+ {
2337
+ "hp": {
2338
+ "hidden_dims": [
2339
+ 64,
2340
+ 32
2341
+ ],
2342
+ "lr": 0.001,
2343
+ "beta_kl": 0.02,
2344
+ "dropout": 0.0,
2345
+ "weight_decay": 0.0001,
2346
+ "use_pid_loss": true
2347
+ },
2348
+ "val_loss": 0.10020166876688169,
2349
+ "stopped_epoch": 112,
2350
+ "time_s": 3.2913420000113547
2351
+ },
2352
+ {
2353
+ "hp": {
2354
+ "hidden_dims": [
2355
+ 64,
2356
+ 32
2357
+ ],
2358
+ "lr": 0.001,
2359
+ "beta_kl": 0.02,
2360
+ "dropout": 0.1,
2361
+ "weight_decay": 0.0001,
2362
+ "use_pid_loss": true
2363
+ },
2364
+ "val_loss": 0.10019513508143453,
2365
+ "stopped_epoch": 109,
2366
+ "time_s": 3.4098627499770373
2367
+ },
2368
+ {
2369
+ "hp": {
2370
+ "hidden_dims": [
2371
+ 64,
2372
+ 32
2373
+ ],
2374
+ "lr": 0.001,
2375
+ "beta_kl": 0.02,
2376
+ "dropout": 0.2,
2377
+ "weight_decay": 0.0001,
2378
+ "use_pid_loss": true
2379
+ },
2380
+ "val_loss": 0.09960498613429207,
2381
+ "stopped_epoch": 118,
2382
+ "time_s": 3.722041499975603
2383
+ },
2384
+ {
2385
+ "hp": {
2386
+ "hidden_dims": [
2387
+ 64,
2388
+ 32
2389
+ ],
2390
+ "lr": 0.001,
2391
+ "beta_kl": 0.05,
2392
+ "dropout": 0.0,
2393
+ "weight_decay": 0.0001,
2394
+ "use_pid_loss": true
2395
+ },
2396
+ "val_loss": 0.10057421579870875,
2397
+ "stopped_epoch": 108,
2398
+ "time_s": 3.1797627500491217
2399
+ },
2400
+ {
2401
+ "hp": {
2402
+ "hidden_dims": [
2403
+ 64,
2404
+ 32
2405
+ ],
2406
+ "lr": 0.001,
2407
+ "beta_kl": 0.05,
2408
+ "dropout": 0.1,
2409
+ "weight_decay": 0.0001,
2410
+ "use_pid_loss": true
2411
+ },
2412
+ "val_loss": 0.10083792970187402,
2413
+ "stopped_epoch": 107,
2414
+ "time_s": 3.3281579579925165
2415
+ },
2416
+ {
2417
+ "hp": {
2418
+ "hidden_dims": [
2419
+ 64,
2420
+ 32
2421
+ ],
2422
+ "lr": 0.001,
2423
+ "beta_kl": 0.05,
2424
+ "dropout": 0.2,
2425
+ "weight_decay": 0.0001,
2426
+ "use_pid_loss": true
2427
+ },
2428
+ "val_loss": 0.1003585527139592,
2429
+ "stopped_epoch": 119,
2430
+ "time_s": 3.6722764580044895
2431
+ },
2432
+ {
2433
+ "hp": {
2434
+ "hidden_dims": [
2435
+ 64,
2436
+ 32
2437
+ ],
2438
+ "lr": 0.001,
2439
+ "beta_kl": 0.1,
2440
+ "dropout": 0.0,
2441
+ "weight_decay": 0.0001,
2442
+ "use_pid_loss": true
2443
+ },
2444
+ "val_loss": 0.10174657585303908,
2445
+ "stopped_epoch": 107,
2446
+ "time_s": 3.1175755419535562
2447
+ },
2448
+ {
2449
+ "hp": {
2450
+ "hidden_dims": [
2451
+ 64,
2452
+ 32
2453
+ ],
2454
+ "lr": 0.001,
2455
+ "beta_kl": 0.1,
2456
+ "dropout": 0.1,
2457
+ "weight_decay": 0.0001,
2458
+ "use_pid_loss": true
2459
+ },
2460
+ "val_loss": 0.10160301397473825,
2461
+ "stopped_epoch": 117,
2462
+ "time_s": 3.719696250045672
2463
+ },
2464
+ {
2465
+ "hp": {
2466
+ "hidden_dims": [
2467
+ 64,
2468
+ 32
2469
+ ],
2470
+ "lr": 0.001,
2471
+ "beta_kl": 0.1,
2472
+ "dropout": 0.2,
2473
+ "weight_decay": 0.0001,
2474
+ "use_pid_loss": true
2475
+ },
2476
+ "val_loss": 0.10133520785094685,
2477
+ "stopped_epoch": 123,
2478
+ "time_s": 3.8800283330492675
2479
+ },
2480
+ {
2481
+ "hp": {
2482
+ "hidden_dims": [
2483
+ 64,
2484
+ 32
2485
+ ],
2486
+ "lr": 0.0003,
2487
+ "beta_kl": 0.02,
2488
+ "dropout": 0.0,
2489
+ "weight_decay": 0.0001,
2490
+ "use_pid_loss": true
2491
+ },
2492
+ "val_loss": 0.10025234137139569,
2493
+ "stopped_epoch": 104,
2494
+ "time_s": 3.0465412079938687
2495
+ },
2496
+ {
2497
+ "hp": {
2498
+ "hidden_dims": [
2499
+ 64,
2500
+ 32
2501
+ ],
2502
+ "lr": 0.0003,
2503
+ "beta_kl": 0.02,
2504
+ "dropout": 0.1,
2505
+ "weight_decay": 0.0001,
2506
+ "use_pid_loss": true
2507
+ },
2508
+ "val_loss": 0.09963718461508007,
2509
+ "stopped_epoch": 118,
2510
+ "time_s": 3.704440625035204
2511
+ },
2512
+ {
2513
+ "hp": {
2514
+ "hidden_dims": [
2515
+ 64,
2516
+ 32
2517
+ ],
2518
+ "lr": 0.0003,
2519
+ "beta_kl": 0.02,
2520
+ "dropout": 0.2,
2521
+ "weight_decay": 0.0001,
2522
+ "use_pid_loss": true
2523
+ },
2524
+ "val_loss": 0.1003386748663952,
2525
+ "stopped_epoch": 120,
2526
+ "time_s": 3.793109167017974
2527
+ },
2528
+ {
2529
+ "hp": {
2530
+ "hidden_dims": [
2531
+ 64,
2532
+ 32
2533
+ ],
2534
+ "lr": 0.0003,
2535
+ "beta_kl": 0.05,
2536
+ "dropout": 0.0,
2537
+ "weight_decay": 0.0001,
2538
+ "use_pid_loss": true
2539
+ },
2540
+ "val_loss": 0.10046426641803256,
2541
+ "stopped_epoch": 117,
2542
+ "time_s": 3.416684208030347
2543
+ },
2544
+ {
2545
+ "hp": {
2546
+ "hidden_dims": [
2547
+ 64,
2548
+ 32
2549
+ ],
2550
+ "lr": 0.0003,
2551
+ "beta_kl": 0.05,
2552
+ "dropout": 0.1,
2553
+ "weight_decay": 0.0001,
2554
+ "use_pid_loss": true
2555
+ },
2556
+ "val_loss": 0.10103861884230134,
2557
+ "stopped_epoch": 106,
2558
+ "time_s": 3.315584291005507
2559
+ },
2560
+ {
2561
+ "hp": {
2562
+ "hidden_dims": [
2563
+ 64,
2564
+ 32
2565
+ ],
2566
+ "lr": 0.0003,
2567
+ "beta_kl": 0.05,
2568
+ "dropout": 0.2,
2569
+ "weight_decay": 0.0001,
2570
+ "use_pid_loss": true
2571
+ },
2572
+ "val_loss": 0.10069953891410993,
2573
+ "stopped_epoch": 119,
2574
+ "time_s": 3.657582541985903
2575
+ },
2576
+ {
2577
+ "hp": {
2578
+ "hidden_dims": [
2579
+ 64,
2580
+ 32
2581
+ ],
2582
+ "lr": 0.0003,
2583
+ "beta_kl": 0.1,
2584
+ "dropout": 0.0,
2585
+ "weight_decay": 0.0001,
2586
+ "use_pid_loss": true
2587
+ },
2588
+ "val_loss": 0.10164152158994895,
2589
+ "stopped_epoch": 108,
2590
+ "time_s": 3.1802500410121866
2591
+ },
2592
+ {
2593
+ "hp": {
2594
+ "hidden_dims": [
2595
+ 64,
2596
+ 32
2597
+ ],
2598
+ "lr": 0.0003,
2599
+ "beta_kl": 0.1,
2600
+ "dropout": 0.1,
2601
+ "weight_decay": 0.0001,
2602
+ "use_pid_loss": true
2603
+ },
2604
+ "val_loss": 0.10177982770810927,
2605
+ "stopped_epoch": 104,
2606
+ "time_s": 3.185938791022636
2607
+ },
2608
+ {
2609
+ "hp": {
2610
+ "hidden_dims": [
2611
+ 64,
2612
+ 32
2613
+ ],
2614
+ "lr": 0.0003,
2615
+ "beta_kl": 0.1,
2616
+ "dropout": 0.2,
2617
+ "weight_decay": 0.0001,
2618
+ "use_pid_loss": true
2619
+ },
2620
+ "val_loss": 0.10127968875142192,
2621
+ "stopped_epoch": 117,
2622
+ "time_s": 3.7299201249843463
2623
+ },
2624
+ {
2625
+ "hp": {
2626
+ "hidden_dims": [
2627
+ 128,
2628
+ 64
2629
+ ],
2630
+ "lr": 0.003,
2631
+ "beta_kl": 0.02,
2632
+ "dropout": 0.0,
2633
+ "weight_decay": 0.0001,
2634
+ "use_pid_loss": true
2635
+ },
2636
+ "val_loss": 0.09985922410481238,
2637
+ "stopped_epoch": 113,
2638
+ "time_s": 3.287646041950211
2639
+ },
2640
+ {
2641
+ "hp": {
2642
+ "hidden_dims": [
2643
+ 128,
2644
+ 64
2645
+ ],
2646
+ "lr": 0.003,
2647
+ "beta_kl": 0.02,
2648
+ "dropout": 0.1,
2649
+ "weight_decay": 0.0001,
2650
+ "use_pid_loss": true
2651
+ },
2652
+ "val_loss": 0.09990367771400882,
2653
+ "stopped_epoch": 115,
2654
+ "time_s": 3.675513625028543
2655
+ },
2656
+ {
2657
+ "hp": {
2658
+ "hidden_dims": [
2659
+ 128,
2660
+ 64
2661
+ ],
2662
+ "lr": 0.003,
2663
+ "beta_kl": 0.02,
2664
+ "dropout": 0.2,
2665
+ "weight_decay": 0.0001,
2666
+ "use_pid_loss": true
2667
+ },
2668
+ "val_loss": 0.10004977800081231,
2669
+ "stopped_epoch": 115,
2670
+ "time_s": 3.5951722080353647
2671
+ },
2672
+ {
2673
+ "hp": {
2674
+ "hidden_dims": [
2675
+ 128,
2676
+ 64
2677
+ ],
2678
+ "lr": 0.003,
2679
+ "beta_kl": 0.05,
2680
+ "dropout": 0.0,
2681
+ "weight_decay": 0.0001,
2682
+ "use_pid_loss": true
2683
+ },
2684
+ "val_loss": 0.10060431002881486,
2685
+ "stopped_epoch": 115,
2686
+ "time_s": 3.3754059160128236
2687
+ },
2688
+ {
2689
+ "hp": {
2690
+ "hidden_dims": [
2691
+ 128,
2692
+ 64
2693
+ ],
2694
+ "lr": 0.003,
2695
+ "beta_kl": 0.05,
2696
+ "dropout": 0.1,
2697
+ "weight_decay": 0.0001,
2698
+ "use_pid_loss": true
2699
+ },
2700
+ "val_loss": 0.10045119521418058,
2701
+ "stopped_epoch": 116,
2702
+ "time_s": 3.652446708001662
2703
+ },
2704
+ {
2705
+ "hp": {
2706
+ "hidden_dims": [
2707
+ 128,
2708
+ 64
2709
+ ],
2710
+ "lr": 0.003,
2711
+ "beta_kl": 0.05,
2712
+ "dropout": 0.2,
2713
+ "weight_decay": 0.0001,
2714
+ "use_pid_loss": true
2715
+ },
2716
+ "val_loss": 0.10013833676459473,
2717
+ "stopped_epoch": 120,
2718
+ "time_s": 3.7072132079629228
2719
+ },
2720
+ {
2721
+ "hp": {
2722
+ "hidden_dims": [
2723
+ 128,
2724
+ 64
2725
+ ],
2726
+ "lr": 0.003,
2727
+ "beta_kl": 0.1,
2728
+ "dropout": 0.0,
2729
+ "weight_decay": 0.0001,
2730
+ "use_pid_loss": true
2731
+ },
2732
+ "val_loss": 0.10139421265938378,
2733
+ "stopped_epoch": 111,
2734
+ "time_s": 3.3141994169563986
2735
+ },
2736
+ {
2737
+ "hp": {
2738
+ "hidden_dims": [
2739
+ 128,
2740
+ 64
2741
+ ],
2742
+ "lr": 0.003,
2743
+ "beta_kl": 0.1,
2744
+ "dropout": 0.1,
2745
+ "weight_decay": 0.0001,
2746
+ "use_pid_loss": true
2747
+ },
2748
+ "val_loss": 0.10159098779018215,
2749
+ "stopped_epoch": 114,
2750
+ "time_s": 3.5245656670304015
2751
+ },
2752
+ {
2753
+ "hp": {
2754
+ "hidden_dims": [
2755
+ 128,
2756
+ 64
2757
+ ],
2758
+ "lr": 0.003,
2759
+ "beta_kl": 0.1,
2760
+ "dropout": 0.2,
2761
+ "weight_decay": 0.0001,
2762
+ "use_pid_loss": true
2763
+ },
2764
+ "val_loss": 0.10182442150019497,
2765
+ "stopped_epoch": 119,
2766
+ "time_s": 3.7169010829529725
2767
+ },
2768
+ {
2769
+ "hp": {
2770
+ "hidden_dims": [
2771
+ 128,
2772
+ 64
2773
+ ],
2774
+ "lr": 0.001,
2775
+ "beta_kl": 0.02,
2776
+ "dropout": 0.0,
2777
+ "weight_decay": 0.0001,
2778
+ "use_pid_loss": true
2779
+ },
2780
+ "val_loss": 0.09998827556826476,
2781
+ "stopped_epoch": 113,
2782
+ "time_s": 3.3213011659681797
2783
+ },
2784
+ {
2785
+ "hp": {
2786
+ "hidden_dims": [
2787
+ 128,
2788
+ 64
2789
+ ],
2790
+ "lr": 0.001,
2791
+ "beta_kl": 0.02,
2792
+ "dropout": 0.1,
2793
+ "weight_decay": 0.0001,
2794
+ "use_pid_loss": true
2795
+ },
2796
+ "val_loss": 0.09971228138559815,
2797
+ "stopped_epoch": 114,
2798
+ "time_s": 3.548976624966599
2799
+ },
2800
+ {
2801
+ "hp": {
2802
+ "hidden_dims": [
2803
+ 128,
2804
+ 64
2805
+ ],
2806
+ "lr": 0.001,
2807
+ "beta_kl": 0.02,
2808
+ "dropout": 0.2,
2809
+ "weight_decay": 0.0001,
2810
+ "use_pid_loss": true
2811
+ },
2812
+ "val_loss": 0.09980365000880523,
2813
+ "stopped_epoch": 117,
2814
+ "time_s": 3.7031168750254437
2815
+ },
2816
+ {
2817
+ "hp": {
2818
+ "hidden_dims": [
2819
+ 128,
2820
+ 64
2821
+ ],
2822
+ "lr": 0.001,
2823
+ "beta_kl": 0.05,
2824
+ "dropout": 0.0,
2825
+ "weight_decay": 0.0001,
2826
+ "use_pid_loss": true
2827
+ },
2828
+ "val_loss": 0.10074372425933793,
2829
+ "stopped_epoch": 103,
2830
+ "time_s": 3.0002782499650493
2831
+ },
2832
+ {
2833
+ "hp": {
2834
+ "hidden_dims": [
2835
+ 128,
2836
+ 64
2837
+ ],
2838
+ "lr": 0.001,
2839
+ "beta_kl": 0.05,
2840
+ "dropout": 0.1,
2841
+ "weight_decay": 0.0001,
2842
+ "use_pid_loss": true
2843
+ },
2844
+ "val_loss": 0.10058945826540104,
2845
+ "stopped_epoch": 117,
2846
+ "time_s": 3.645035583002027
2847
+ },
2848
+ {
2849
+ "hp": {
2850
+ "hidden_dims": [
2851
+ 128,
2852
+ 64
2853
+ ],
2854
+ "lr": 0.001,
2855
+ "beta_kl": 0.05,
2856
+ "dropout": 0.2,
2857
+ "weight_decay": 0.0001,
2858
+ "use_pid_loss": true
2859
+ },
2860
+ "val_loss": 0.1004295109668908,
2861
+ "stopped_epoch": 113,
2862
+ "time_s": 3.5120147499837913
2863
+ },
2864
+ {
2865
+ "hp": {
2866
+ "hidden_dims": [
2867
+ 128,
2868
+ 64
2869
+ ],
2870
+ "lr": 0.001,
2871
+ "beta_kl": 0.1,
2872
+ "dropout": 0.0,
2873
+ "weight_decay": 0.0001,
2874
+ "use_pid_loss": true
2875
+ },
2876
+ "val_loss": 0.10173731564269589,
2877
+ "stopped_epoch": 105,
2878
+ "time_s": 3.1309280000277795
2879
+ },
2880
+ {
2881
+ "hp": {
2882
+ "hidden_dims": [
2883
+ 128,
2884
+ 64
2885
+ ],
2886
+ "lr": 0.001,
2887
+ "beta_kl": 0.1,
2888
+ "dropout": 0.1,
2889
+ "weight_decay": 0.0001,
2890
+ "use_pid_loss": true
2891
+ },
2892
+ "val_loss": 0.10171924509926339,
2893
+ "stopped_epoch": 115,
2894
+ "time_s": 3.5972887499956414
2895
+ },
2896
+ {
2897
+ "hp": {
2898
+ "hidden_dims": [
2899
+ 128,
2900
+ 64
2901
+ ],
2902
+ "lr": 0.001,
2903
+ "beta_kl": 0.1,
2904
+ "dropout": 0.2,
2905
+ "weight_decay": 0.0001,
2906
+ "use_pid_loss": true
2907
+ },
2908
+ "val_loss": 0.10134396120647475,
2909
+ "stopped_epoch": 122,
2910
+ "time_s": 3.7915082499966957
2911
+ },
2912
+ {
2913
+ "hp": {
2914
+ "hidden_dims": [
2915
+ 128,
2916
+ 64
2917
+ ],
2918
+ "lr": 0.0003,
2919
+ "beta_kl": 0.02,
2920
+ "dropout": 0.0,
2921
+ "weight_decay": 0.0001,
2922
+ "use_pid_loss": true
2923
+ },
2924
+ "val_loss": 0.1002227241076486,
2925
+ "stopped_epoch": 104,
2926
+ "time_s": 3.086728749971371
2927
+ },
2928
+ {
2929
+ "hp": {
2930
+ "hidden_dims": [
2931
+ 128,
2932
+ 64
2933
+ ],
2934
+ "lr": 0.0003,
2935
+ "beta_kl": 0.02,
2936
+ "dropout": 0.1,
2937
+ "weight_decay": 0.0001,
2938
+ "use_pid_loss": true
2939
+ },
2940
+ "val_loss": 0.09969229121945497,
2941
+ "stopped_epoch": 115,
2942
+ "time_s": 3.617608916014433
2943
+ },
2944
+ {
2945
+ "hp": {
2946
+ "hidden_dims": [
2947
+ 128,
2948
+ 64
2949
+ ],
2950
+ "lr": 0.0003,
2951
+ "beta_kl": 0.02,
2952
+ "dropout": 0.2,
2953
+ "weight_decay": 0.0001,
2954
+ "use_pid_loss": true
2955
+ },
2956
+ "val_loss": 0.10025505704342286,
2957
+ "stopped_epoch": 112,
2958
+ "time_s": 3.5011124159791507
2959
+ },
2960
+ {
2961
+ "hp": {
2962
+ "hidden_dims": [
2963
+ 128,
2964
+ 64
2965
+ ],
2966
+ "lr": 0.0003,
2967
+ "beta_kl": 0.05,
2968
+ "dropout": 0.0,
2969
+ "weight_decay": 0.0001,
2970
+ "use_pid_loss": true
2971
+ },
2972
+ "val_loss": 0.10078207657516347,
2973
+ "stopped_epoch": 104,
2974
+ "time_s": 3.1034459579968825
2975
+ },
2976
+ {
2977
+ "hp": {
2978
+ "hidden_dims": [
2979
+ 128,
2980
+ 64
2981
+ ],
2982
+ "lr": 0.0003,
2983
+ "beta_kl": 0.05,
2984
+ "dropout": 0.1,
2985
+ "weight_decay": 0.0001,
2986
+ "use_pid_loss": true
2987
+ },
2988
+ "val_loss": 0.10049004990585966,
2989
+ "stopped_epoch": 111,
2990
+ "time_s": 3.4338022499578074
2991
+ },
2992
+ {
2993
+ "hp": {
2994
+ "hidden_dims": [
2995
+ 128,
2996
+ 64
2997
+ ],
2998
+ "lr": 0.0003,
2999
+ "beta_kl": 0.05,
3000
+ "dropout": 0.2,
3001
+ "weight_decay": 0.0001,
3002
+ "use_pid_loss": true
3003
+ },
3004
+ "val_loss": 0.10097866333116685,
3005
+ "stopped_epoch": 116,
3006
+ "time_s": 3.695972791989334
3007
+ },
3008
+ {
3009
+ "hp": {
3010
+ "hidden_dims": [
3011
+ 128,
3012
+ 64
3013
+ ],
3014
+ "lr": 0.0003,
3015
+ "beta_kl": 0.1,
3016
+ "dropout": 0.0,
3017
+ "weight_decay": 0.0001,
3018
+ "use_pid_loss": true
3019
+ },
3020
+ "val_loss": 0.1021029808359339,
3021
+ "stopped_epoch": 104,
3022
+ "time_s": 3.094552583002951
3023
+ },
3024
+ {
3025
+ "hp": {
3026
+ "hidden_dims": [
3027
+ 128,
3028
+ 64
3029
+ ],
3030
+ "lr": 0.0003,
3031
+ "beta_kl": 0.1,
3032
+ "dropout": 0.1,
3033
+ "weight_decay": 0.0001,
3034
+ "use_pid_loss": true
3035
+ },
3036
+ "val_loss": 0.1015732720752672,
3037
+ "stopped_epoch": 102,
3038
+ "time_s": 3.3792612500255927
3039
+ },
3040
+ {
3041
+ "hp": {
3042
+ "hidden_dims": [
3043
+ 128,
3044
+ 64
3045
+ ],
3046
+ "lr": 0.0003,
3047
+ "beta_kl": 0.1,
3048
+ "dropout": 0.2,
3049
+ "weight_decay": 0.0001,
3050
+ "use_pid_loss": true
3051
+ },
3052
+ "val_loss": 0.10211800818326157,
3053
+ "stopped_epoch": 103,
3054
+ "time_s": 3.2438625000067987
3055
+ },
3056
+ {
3057
+ "hp": {
3058
+ "hidden_dims": [
3059
+ 128,
3060
+ 64,
3061
+ 32
3062
+ ],
3063
+ "lr": 0.003,
3064
+ "beta_kl": 0.02,
3065
+ "dropout": 0.0,
3066
+ "weight_decay": 0.0001,
3067
+ "use_pid_loss": true
3068
+ },
3069
+ "val_loss": 0.09977769339187986,
3070
+ "stopped_epoch": 109,
3071
+ "time_s": 4.1589582079905085
3072
+ },
3073
+ {
3074
+ "hp": {
3075
+ "hidden_dims": [
3076
+ 128,
3077
+ 64,
3078
+ 32
3079
+ ],
3080
+ "lr": 0.003,
3081
+ "beta_kl": 0.02,
3082
+ "dropout": 0.1,
3083
+ "weight_decay": 0.0001,
3084
+ "use_pid_loss": true
3085
+ },
3086
+ "val_loss": 0.0999079815225105,
3087
+ "stopped_epoch": 118,
3088
+ "time_s": 4.701027208997402
3089
+ },
3090
+ {
3091
+ "hp": {
3092
+ "hidden_dims": [
3093
+ 128,
3094
+ 64,
3095
+ 32
3096
+ ],
3097
+ "lr": 0.003,
3098
+ "beta_kl": 0.02,
3099
+ "dropout": 0.2,
3100
+ "weight_decay": 0.0001,
3101
+ "use_pid_loss": true
3102
+ },
3103
+ "val_loss": 0.09967796478657365,
3104
+ "stopped_epoch": 108,
3105
+ "time_s": 4.272658915957436
3106
+ },
3107
+ {
3108
+ "hp": {
3109
+ "hidden_dims": [
3110
+ 128,
3111
+ 64,
3112
+ 32
3113
+ ],
3114
+ "lr": 0.003,
3115
+ "beta_kl": 0.05,
3116
+ "dropout": 0.0,
3117
+ "weight_decay": 0.0001,
3118
+ "use_pid_loss": true
3119
+ },
3120
+ "val_loss": 0.10056822514430636,
3121
+ "stopped_epoch": 110,
3122
+ "time_s": 4.034177790977992
3123
+ },
3124
+ {
3125
+ "hp": {
3126
+ "hidden_dims": [
3127
+ 128,
3128
+ 64,
3129
+ 32
3130
+ ],
3131
+ "lr": 0.003,
3132
+ "beta_kl": 0.05,
3133
+ "dropout": 0.1,
3134
+ "weight_decay": 0.0001,
3135
+ "use_pid_loss": true
3136
+ },
3137
+ "val_loss": 0.10046395702513657,
3138
+ "stopped_epoch": 120,
3139
+ "time_s": 4.709886667027604
3140
+ },
3141
+ {
3142
+ "hp": {
3143
+ "hidden_dims": [
3144
+ 128,
3145
+ 64,
3146
+ 32
3147
+ ],
3148
+ "lr": 0.003,
3149
+ "beta_kl": 0.05,
3150
+ "dropout": 0.2,
3151
+ "weight_decay": 0.0001,
3152
+ "use_pid_loss": true
3153
+ },
3154
+ "val_loss": 0.10072425175781195,
3155
+ "stopped_epoch": 118,
3156
+ "time_s": 4.633909999975003
3157
+ },
3158
+ {
3159
+ "hp": {
3160
+ "hidden_dims": [
3161
+ 128,
3162
+ 64,
3163
+ 32
3164
+ ],
3165
+ "lr": 0.003,
3166
+ "beta_kl": 0.1,
3167
+ "dropout": 0.0,
3168
+ "weight_decay": 0.0001,
3169
+ "use_pid_loss": true
3170
+ },
3171
+ "val_loss": 0.1017374399769513,
3172
+ "stopped_epoch": 106,
3173
+ "time_s": 3.8571577499969862
3174
+ },
3175
+ {
3176
+ "hp": {
3177
+ "hidden_dims": [
3178
+ 128,
3179
+ 64,
3180
+ 32
3181
+ ],
3182
+ "lr": 0.003,
3183
+ "beta_kl": 0.1,
3184
+ "dropout": 0.1,
3185
+ "weight_decay": 0.0001,
3186
+ "use_pid_loss": true
3187
+ },
3188
+ "val_loss": 0.10173432709853773,
3189
+ "stopped_epoch": 110,
3190
+ "time_s": 4.408030332997441
3191
+ },
3192
+ {
3193
+ "hp": {
3194
+ "hidden_dims": [
3195
+ 128,
3196
+ 64,
3197
+ 32
3198
+ ],
3199
+ "lr": 0.003,
3200
+ "beta_kl": 0.1,
3201
+ "dropout": 0.2,
3202
+ "weight_decay": 0.0001,
3203
+ "use_pid_loss": true
3204
+ },
3205
+ "val_loss": 0.10138227174736861,
3206
+ "stopped_epoch": 116,
3207
+ "time_s": 4.686070125026163
3208
+ },
3209
+ {
3210
+ "hp": {
3211
+ "hidden_dims": [
3212
+ 128,
3213
+ 64,
3214
+ 32
3215
+ ],
3216
+ "lr": 0.001,
3217
+ "beta_kl": 0.02,
3218
+ "dropout": 0.0,
3219
+ "weight_decay": 0.0001,
3220
+ "use_pid_loss": true
3221
+ },
3222
+ "val_loss": 0.09979622057407578,
3223
+ "stopped_epoch": 114,
3224
+ "time_s": 4.152623708010651
3225
+ },
3226
+ {
3227
+ "hp": {
3228
+ "hidden_dims": [
3229
+ 128,
3230
+ 64,
3231
+ 32
3232
+ ],
3233
+ "lr": 0.001,
3234
+ "beta_kl": 0.02,
3235
+ "dropout": 0.1,
3236
+ "weight_decay": 0.0001,
3237
+ "use_pid_loss": true
3238
+ },
3239
+ "val_loss": 0.09978852705287107,
3240
+ "stopped_epoch": 110,
3241
+ "time_s": 4.2832466249819845
3242
+ },
3243
+ {
3244
+ "hp": {
3245
+ "hidden_dims": [
3246
+ 128,
3247
+ 64,
3248
+ 32
3249
+ ],
3250
+ "lr": 0.001,
3251
+ "beta_kl": 0.02,
3252
+ "dropout": 0.2,
3253
+ "weight_decay": 0.0001,
3254
+ "use_pid_loss": true
3255
+ },
3256
+ "val_loss": 0.09976707351517815,
3257
+ "stopped_epoch": 115,
3258
+ "time_s": 4.480864625016693
3259
+ },
3260
+ {
3261
+ "hp": {
3262
+ "hidden_dims": [
3263
+ 128,
3264
+ 64,
3265
+ 32
3266
+ ],
3267
+ "lr": 0.001,
3268
+ "beta_kl": 0.05,
3269
+ "dropout": 0.0,
3270
+ "weight_decay": 0.0001,
3271
+ "use_pid_loss": true
3272
+ },
3273
+ "val_loss": 0.10056655815226494,
3274
+ "stopped_epoch": 105,
3275
+ "time_s": 3.828549500030931
3276
+ },
3277
+ {
3278
+ "hp": {
3279
+ "hidden_dims": [
3280
+ 128,
3281
+ 64,
3282
+ 32
3283
+ ],
3284
+ "lr": 0.001,
3285
+ "beta_kl": 0.05,
3286
+ "dropout": 0.1,
3287
+ "weight_decay": 0.0001,
3288
+ "use_pid_loss": true
3289
+ },
3290
+ "val_loss": 0.10027094637555194,
3291
+ "stopped_epoch": 118,
3292
+ "time_s": 4.661708125029691
3293
+ },
3294
+ {
3295
+ "hp": {
3296
+ "hidden_dims": [
3297
+ 128,
3298
+ 64,
3299
+ 32
3300
+ ],
3301
+ "lr": 0.001,
3302
+ "beta_kl": 0.05,
3303
+ "dropout": 0.2,
3304
+ "weight_decay": 0.0001,
3305
+ "use_pid_loss": true
3306
+ },
3307
+ "val_loss": 0.10006732967375331,
3308
+ "stopped_epoch": 115,
3309
+ "time_s": 4.516163124993909
3310
+ },
3311
+ {
3312
+ "hp": {
3313
+ "hidden_dims": [
3314
+ 128,
3315
+ 64,
3316
+ 32
3317
+ ],
3318
+ "lr": 0.001,
3319
+ "beta_kl": 0.1,
3320
+ "dropout": 0.0,
3321
+ "weight_decay": 0.0001,
3322
+ "use_pid_loss": true
3323
+ },
3324
+ "val_loss": 0.10170130931228571,
3325
+ "stopped_epoch": 107,
3326
+ "time_s": 3.962069708039053
3327
+ },
3328
+ {
3329
+ "hp": {
3330
+ "hidden_dims": [
3331
+ 128,
3332
+ 64,
3333
+ 32
3334
+ ],
3335
+ "lr": 0.001,
3336
+ "beta_kl": 0.1,
3337
+ "dropout": 0.1,
3338
+ "weight_decay": 0.0001,
3339
+ "use_pid_loss": true
3340
+ },
3341
+ "val_loss": 0.10164271101269419,
3342
+ "stopped_epoch": 107,
3343
+ "time_s": 4.276909207983408
3344
+ },
3345
+ {
3346
+ "hp": {
3347
+ "hidden_dims": [
3348
+ 128,
3349
+ 64,
3350
+ 32
3351
+ ],
3352
+ "lr": 0.001,
3353
+ "beta_kl": 0.1,
3354
+ "dropout": 0.2,
3355
+ "weight_decay": 0.0001,
3356
+ "use_pid_loss": true
3357
+ },
3358
+ "val_loss": 0.10138021297537522,
3359
+ "stopped_epoch": 112,
3360
+ "time_s": 4.4507720829569735
3361
+ },
3362
+ {
3363
+ "hp": {
3364
+ "hidden_dims": [
3365
+ 128,
3366
+ 64,
3367
+ 32
3368
+ ],
3369
+ "lr": 0.0003,
3370
+ "beta_kl": 0.02,
3371
+ "dropout": 0.0,
3372
+ "weight_decay": 0.0001,
3373
+ "use_pid_loss": true
3374
+ },
3375
+ "val_loss": 0.10023869556843201,
3376
+ "stopped_epoch": 106,
3377
+ "time_s": 3.941070375032723
3378
+ },
3379
+ {
3380
+ "hp": {
3381
+ "hidden_dims": [
3382
+ 128,
3383
+ 64,
3384
+ 32
3385
+ ],
3386
+ "lr": 0.0003,
3387
+ "beta_kl": 0.02,
3388
+ "dropout": 0.1,
3389
+ "weight_decay": 0.0001,
3390
+ "use_pid_loss": true
3391
+ },
3392
+ "val_loss": 0.09969418599254135,
3393
+ "stopped_epoch": 113,
3394
+ "time_s": 4.537661709007807
3395
+ },
3396
+ {
3397
+ "hp": {
3398
+ "hidden_dims": [
3399
+ 128,
3400
+ 64,
3401
+ 32
3402
+ ],
3403
+ "lr": 0.0003,
3404
+ "beta_kl": 0.02,
3405
+ "dropout": 0.2,
3406
+ "weight_decay": 0.0001,
3407
+ "use_pid_loss": true
3408
+ },
3409
+ "val_loss": 0.09968935913605496,
3410
+ "stopped_epoch": 123,
3411
+ "time_s": 4.793168333009817
3412
+ },
3413
+ {
3414
+ "hp": {
3415
+ "hidden_dims": [
3416
+ 128,
3417
+ 64,
3418
+ 32
3419
+ ],
3420
+ "lr": 0.0003,
3421
+ "beta_kl": 0.05,
3422
+ "dropout": 0.0,
3423
+ "weight_decay": 0.0001,
3424
+ "use_pid_loss": true
3425
+ },
3426
+ "val_loss": 0.10051763087855597,
3427
+ "stopped_epoch": 105,
3428
+ "time_s": 3.8117689169594087
3429
+ },
3430
+ {
3431
+ "hp": {
3432
+ "hidden_dims": [
3433
+ 128,
3434
+ 64,
3435
+ 32
3436
+ ],
3437
+ "lr": 0.0003,
3438
+ "beta_kl": 0.05,
3439
+ "dropout": 0.1,
3440
+ "weight_decay": 0.0001,
3441
+ "use_pid_loss": true
3442
+ },
3443
+ "val_loss": 0.10028412447154866,
3444
+ "stopped_epoch": 109,
3445
+ "time_s": 4.260053374979179
3446
+ },
3447
+ {
3448
+ "hp": {
3449
+ "hidden_dims": [
3450
+ 128,
3451
+ 64,
3452
+ 32
3453
+ ],
3454
+ "lr": 0.0003,
3455
+ "beta_kl": 0.05,
3456
+ "dropout": 0.2,
3457
+ "weight_decay": 0.0001,
3458
+ "use_pid_loss": true
3459
+ },
3460
+ "val_loss": 0.10085658574035403,
3461
+ "stopped_epoch": 104,
3462
+ "time_s": 4.087226709001698
3463
+ },
3464
+ {
3465
+ "hp": {
3466
+ "hidden_dims": [
3467
+ 128,
3468
+ 64,
3469
+ 32
3470
+ ],
3471
+ "lr": 0.0003,
3472
+ "beta_kl": 0.1,
3473
+ "dropout": 0.0,
3474
+ "weight_decay": 0.0001,
3475
+ "use_pid_loss": true
3476
+ },
3477
+ "val_loss": 0.1021297210127632,
3478
+ "stopped_epoch": 104,
3479
+ "time_s": 3.7596522080129944
3480
+ },
3481
+ {
3482
+ "hp": {
3483
+ "hidden_dims": [
3484
+ 128,
3485
+ 64,
3486
+ 32
3487
+ ],
3488
+ "lr": 0.0003,
3489
+ "beta_kl": 0.1,
3490
+ "dropout": 0.1,
3491
+ "weight_decay": 0.0001,
3492
+ "use_pid_loss": true
3493
+ },
3494
+ "val_loss": 0.101444021384151,
3495
+ "stopped_epoch": 108,
3496
+ "time_s": 4.312188666954171
3497
+ },
3498
+ {
3499
+ "hp": {
3500
+ "hidden_dims": [
3501
+ 128,
3502
+ 64,
3503
+ 32
3504
+ ],
3505
+ "lr": 0.0003,
3506
+ "beta_kl": 0.1,
3507
+ "dropout": 0.2,
3508
+ "weight_decay": 0.0001,
3509
+ "use_pid_loss": true
3510
+ },
3511
+ "val_loss": 0.1015386668416117,
3512
+ "stopped_epoch": 120,
3513
+ "time_s": 4.72497254202608
3514
+ }
3515
+ ]
3516
+ }