Commit History

Phase 4 final results with training analysis

9825ad6
verified

JorgeAV commited on Apr 28

Upload results/phase4_smollm2.json with huggingface_hub

7fe69de
verified

JorgeAV commited on Apr 28

Phase 4 results - SmolLM2 decoder, 7 epochs

9eb985d
verified

JorgeAV commited on Apr 28

Phase 4 (SmolLM2 decoder) checkpoint - Stage 2 Epoch 3

4786946
verified

JorgeAV commited on Apr 28

Add Phase 4 training: SmolLM2-135M decoder + bridge MLP

f1842cf
verified

JorgeAV commited on Apr 28

Add Phase 3.1 training: gen_weight 2.0, gen_len 32, scheduled sampling, beam search

206e1ad
verified

JorgeAV commited on Apr 25

Upload checkpoints/hybrid_main_phase3_1_best.pt with huggingface_hub

ecb8790
verified

JorgeAV commited on Apr 25

Upload results/hybrid_main_phase3_1.json with huggingface_hub

706d4eb
verified

JorgeAV commited on Apr 25

Upload results/hybrid_main_phase3.json with huggingface_hub

6f8358f
verified

JorgeAV commited on Apr 25

Add Phase 3 training results

097b6ad
verified

JorgeAV commited on Apr 25

Add Phase 3 best checkpoint (epoch 0)

d119d5e
verified

JorgeAV commited on Apr 25

Add complete Phase 3 training script with generative decoder + open-ended VQA

a05b0f9
verified

JorgeAV commited on Apr 25

Add Phase 3 training script (generative decoder + enriched evidence)

3d28d08
verified

JorgeAV commited on Apr 25

Upload checkpoints/hybrid_main_phase3_best.pt with huggingface_hub

f5d823f
verified

JorgeAV commited on Apr 25

Upload results/hybrid_main_phase3.json with huggingface_hub

d5eff1e
verified

JorgeAV commited on Apr 25

feat: add persistent Trackio Space for image logging (space_id + sync)

da959e9
verified

JorgeAV commited on Apr 21

Upload results/hybrid_main_phase2.json with huggingface_hub

3f403f6
verified

JorgeAV commited on Apr 21

Fix TextEncoder.unfreeze_last: compatible with both AutoModel (Qwen3Model.layers) and ForCausalLM (model.model.layers)

33997db
verified

JorgeAV commited on Apr 21

Phase 2: Add SOTA visual diagnostics (13 types) with Trackio image logging

5c15aea
verified

JorgeAV commited on Apr 21

Upload checkpoints/hybrid_main_best.pt with huggingface_hub

2df74fc
verified

JorgeAV commited on Apr 21

Upload results/hybrid_main.json with huggingface_hub

1691248
verified

JorgeAV commited on Apr 21

add: train_phase2.py — Phase 2 perception fine-tuning with 3-group differential LR optimizer

66b3447
verified

JorgeAV commited on Apr 21

add: launch_ablations.py — complete ablation matrix with CLI commands for all 12 experiments

212d421
verified

JorgeAV commited on Apr 21

fix: test_architecture.py — use os.path.dirname(file) instead of hardcoded /app for sys.path

83e1328
verified

JorgeAV commited on Apr 21

fix: target_encoder.py — respect config.jepa_loss_fn (smooth_l1/mse/cosine) instead of hardcoded MSE

ac52da3
verified

JorgeAV commited on Apr 21

fix: test_architecture.py — correct evidence_gate attribute check (gate_type='none' not gate=None), add dinov2 config test, compact formatting

67a4198
verified

JorgeAV commited on Apr 21

fix: export get_dinov2_ablation_config from configs

e574f91
verified

JorgeAV commited on Apr 21

fix: test_architecture.py — add comprehensive ablation tests (no_jepa, no_rollout, no_gate, K variants, loss_fn variants, SIGReg vs VICReg, purist config)

e158637
verified

JorgeAV commited on Apr 21

fix: README.md — complete ablation table with all 13 experiments and CLI flags, add no_sigreg/vicreg_only

b292067
verified

JorgeAV commited on Apr 21

fix: ARCHITECTURE.md — complete ablation table with all 13 experiments, CLI flags, dinov2/loss_fn/sigreg/vicreg ablations, footnote on no_rollout→no_jepa

bac1326
verified

JorgeAV commited on Apr 21

fix: train_mrjepa.py — add VICReg loss, --no_sigreg/--use_vicreg CLI flags, log all ablation fields in results JSON, document no_rollout→no_jepa conflation

7741500
verified

JorgeAV commited on Apr 21

fix: mr_jepa.py — proper evidence gate disable via gate_type (no lambda hack), init ablation flags from config, update docstrings to current stack

3c437e2
verified

JorgeAV commited on Apr 21

fix: ablation.py — wire all disable flags, add missing experiments (dinov2, mse/cosine loss, no_sigreg, vicreg_only), add run()/load_results() methods, generate CLI commands

b198bd2
verified

JorgeAV commited on Apr 21

fix: backbones.py — DINOv3 via timm + Qwen3-Embedding-0.6B, add get_transform(), proper layer unfreezing for both architectures

ad6ce96
verified

JorgeAV commited on Apr 21

fix: model_config.py — update defaults to DINOv3-L + Qwen3-Embedding, add jepa_loss_fn/use_jepa fields, add get_dinov2_ablation_config()

b564d53
verified

JorgeAV commited on Apr 21

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror) in training script

96110fc
verified

JorgeAV commited on Apr 21

fix: update ARCHITECTURE.md — SAM 3.1 → jetjodh/sam3.1, align with current component stack (DINOv3-L, Qwen3-Embedding)

19d4210
verified

JorgeAV commited on Apr 21

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror)

22e8e09
verified

JorgeAV commited on Apr 21

Training script with DINOv3+Qwen3-Embedding, dtype fixes, all ablation flags

6bb8162
verified

JorgeAV commited on Apr 21

Update README with adapted component stack (DINOv3, Qwen3, SAM3.1, etc.)

b10dd74
verified

JorgeAV commited on Apr 21

Upload checkpoints/test_cpu_best.pt with huggingface_hub

9ad9d20
verified

JorgeAV commited on Apr 21

Upload results/test_cpu.json with huggingface_hub

bfc40c6
verified

JorgeAV commited on Apr 21

Remove pycache files

cc832df
verified

JorgeAV commited on Apr 21

Initial MR-JEPA codebase: architecture, training, evaluation, and tests

dba2c56
verified

JorgeAV commited on Apr 21

initial commit

3b4df8f
verified

JorgeAV commited on Apr 21

Commit History

Phase 4 final results with training analysis 9825ad6 verified

Upload results/phase4_smollm2.json with huggingface_hub 7fe69de verified

Phase 4 results - SmolLM2 decoder, 7 epochs 9eb985d verified

Phase 4 (SmolLM2 decoder) checkpoint - Stage 2 Epoch 3 4786946 verified

Add Phase 4 training: SmolLM2-135M decoder + bridge MLP f1842cf verified

Add Phase 3.1 training: gen_weight 2.0, gen_len 32, scheduled sampling, beam search 206e1ad verified

Upload checkpoints/hybrid_main_phase3_1_best.pt with huggingface_hub ecb8790 verified

Upload results/hybrid_main_phase3_1.json with huggingface_hub 706d4eb verified

Upload results/hybrid_main_phase3.json with huggingface_hub 6f8358f verified

Add Phase 3 training results 097b6ad verified

Add Phase 3 best checkpoint (epoch 0) d119d5e verified

Add complete Phase 3 training script with generative decoder + open-ended VQA a05b0f9 verified

Add Phase 3 training script (generative decoder + enriched evidence) 3d28d08 verified

Upload checkpoints/hybrid_main_phase3_best.pt with huggingface_hub f5d823f verified

Upload results/hybrid_main_phase3.json with huggingface_hub d5eff1e verified

feat: add persistent Trackio Space for image logging (space_id + sync) da959e9 verified

Upload results/hybrid_main_phase2.json with huggingface_hub 3f403f6 verified

Fix TextEncoder.unfreeze_last: compatible with both AutoModel (Qwen3Model.layers) and ForCausalLM (model.model.layers) 33997db verified

Phase 2: Add SOTA visual diagnostics (13 types) with Trackio image logging 5c15aea verified

Upload checkpoints/hybrid_main_best.pt with huggingface_hub 2df74fc verified

Upload results/hybrid_main.json with huggingface_hub 1691248 verified

add: train_phase2.py — Phase 2 perception fine-tuning with 3-group differential LR optimizer 66b3447 verified

add: launch_ablations.py — complete ablation matrix with CLI commands for all 12 experiments 212d421 verified

fix: test_architecture.py — use os.path.dirname(__file__) instead of hardcoded /app for sys.path 83e1328 verified

fix: target_encoder.py — respect config.jepa_loss_fn (smooth_l1/mse/cosine) instead of hardcoded MSE ac52da3 verified

fix: test_architecture.py — correct evidence_gate attribute check (gate_type='none' not gate=None), add dinov2 config test, compact formatting 67a4198 verified

fix: export get_dinov2_ablation_config from configs e574f91 verified

fix: test_architecture.py — add comprehensive ablation tests (no_jepa, no_rollout, no_gate, K variants, loss_fn variants, SIGReg vs VICReg, purist config) e158637 verified

fix: README.md — complete ablation table with all 13 experiments and CLI flags, add no_sigreg/vicreg_only b292067 verified

fix: ARCHITECTURE.md — complete ablation table with all 13 experiments, CLI flags, dinov2/loss_fn/sigreg/vicreg ablations, footnote on no_rollout→no_jepa bac1326 verified

fix: train_mrjepa.py — add VICReg loss, --no_sigreg/--use_vicreg CLI flags, log all ablation fields in results JSON, document no_rollout→no_jepa conflation 7741500 verified

fix: mr_jepa.py — proper evidence gate disable via gate_type (no lambda hack), init ablation flags from config, update docstrings to current stack 3c437e2 verified

fix: ablation.py — wire all disable flags, add missing experiments (dinov2, mse/cosine loss, no_sigreg, vicreg_only), add run()/load_results() methods, generate CLI commands b198bd2 verified

fix: backbones.py — DINOv3 via timm + Qwen3-Embedding-0.6B, add get_transform(), proper layer unfreezing for both architectures ad6ce96 verified

fix: model_config.py — update defaults to DINOv3-L + Qwen3-Embedding, add jepa_loss_fn/use_jepa fields, add get_dinov2_ablation_config() b564d53 verified

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror) in training script 96110fc verified

fix: update ARCHITECTURE.md — SAM 3.1 → jetjodh/sam3.1, align with current component stack (DINOv3-L, Qwen3-Embedding) 19d4210 verified

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror) 22e8e09 verified

Training script with DINOv3+Qwen3-Embedding, dtype fixes, all ablation flags 6bb8162 verified

Update README with adapted component stack (DINOv3, Qwen3, SAM3.1, etc.) b10dd74 verified

Upload checkpoints/test_cpu_best.pt with huggingface_hub 9ad9d20 verified

Upload results/test_cpu.json with huggingface_hub bfc40c6 verified

Remove __pycache__ files cc832df verified

Initial MR-JEPA codebase: architecture, training, evaluation, and tests dba2c56 verified

initial commit 3b4df8f verified

Phase 4 final results with training analysis

9825ad6
verified

Upload results/phase4_smollm2.json with huggingface_hub

7fe69de
verified

Phase 4 results - SmolLM2 decoder, 7 epochs

9eb985d
verified

Phase 4 (SmolLM2 decoder) checkpoint - Stage 2 Epoch 3

4786946
verified

Add Phase 4 training: SmolLM2-135M decoder + bridge MLP

f1842cf
verified

Add Phase 3.1 training: gen_weight 2.0, gen_len 32, scheduled sampling, beam search

206e1ad
verified

Upload checkpoints/hybrid_main_phase3_1_best.pt with huggingface_hub

ecb8790
verified

Upload results/hybrid_main_phase3_1.json with huggingface_hub

706d4eb
verified

Upload results/hybrid_main_phase3.json with huggingface_hub

6f8358f
verified

Add Phase 3 training results

097b6ad
verified

Add Phase 3 best checkpoint (epoch 0)

d119d5e
verified

Add complete Phase 3 training script with generative decoder + open-ended VQA

a05b0f9
verified

Add Phase 3 training script (generative decoder + enriched evidence)

3d28d08
verified

Upload checkpoints/hybrid_main_phase3_best.pt with huggingface_hub

f5d823f
verified

Upload results/hybrid_main_phase3.json with huggingface_hub

d5eff1e
verified

feat: add persistent Trackio Space for image logging (space_id + sync)

da959e9
verified

Upload results/hybrid_main_phase2.json with huggingface_hub

3f403f6
verified

Fix TextEncoder.unfreeze_last: compatible with both AutoModel (Qwen3Model.layers) and ForCausalLM (model.model.layers)

33997db
verified

Phase 2: Add SOTA visual diagnostics (13 types) with Trackio image logging

5c15aea
verified

Upload checkpoints/hybrid_main_best.pt with huggingface_hub

2df74fc
verified

Upload results/hybrid_main.json with huggingface_hub

1691248
verified

add: train_phase2.py — Phase 2 perception fine-tuning with 3-group differential LR optimizer

66b3447
verified

add: launch_ablations.py — complete ablation matrix with CLI commands for all 12 experiments

212d421
verified

fix: test_architecture.py — use os.path.dirname(file) instead of hardcoded /app for sys.path

83e1328
verified

fix: target_encoder.py — respect config.jepa_loss_fn (smooth_l1/mse/cosine) instead of hardcoded MSE

ac52da3
verified

fix: test_architecture.py — correct evidence_gate attribute check (gate_type='none' not gate=None), add dinov2 config test, compact formatting

67a4198
verified

fix: export get_dinov2_ablation_config from configs

e574f91
verified

fix: test_architecture.py — add comprehensive ablation tests (no_jepa, no_rollout, no_gate, K variants, loss_fn variants, SIGReg vs VICReg, purist config)

e158637
verified

fix: README.md — complete ablation table with all 13 experiments and CLI flags, add no_sigreg/vicreg_only

b292067
verified

fix: ARCHITECTURE.md — complete ablation table with all 13 experiments, CLI flags, dinov2/loss_fn/sigreg/vicreg ablations, footnote on no_rollout→no_jepa

bac1326
verified

fix: train_mrjepa.py — add VICReg loss, --no_sigreg/--use_vicreg CLI flags, log all ablation fields in results JSON, document no_rollout→no_jepa conflation

7741500
verified

fix: mr_jepa.py — proper evidence gate disable via gate_type (no lambda hack), init ablation flags from config, update docstrings to current stack

3c437e2
verified

fix: ablation.py — wire all disable flags, add missing experiments (dinov2, mse/cosine loss, no_sigreg, vicreg_only), add run()/load_results() methods, generate CLI commands

b198bd2
verified

fix: backbones.py — DINOv3 via timm + Qwen3-Embedding-0.6B, add get_transform(), proper layer unfreezing for both architectures

ad6ce96
verified

fix: model_config.py — update defaults to DINOv3-L + Qwen3-Embedding, add jepa_loss_fn/use_jepa fields, add get_dinov2_ablation_config()

b564d53
verified

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror) in training script

96110fc
verified

fix: update ARCHITECTURE.md — SAM 3.1 → jetjodh/sam3.1, align with current component stack (DINOv3-L, Qwen3-Embedding)

19d4210
verified

fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror)

22e8e09
verified

Training script with DINOv3+Qwen3-Embedding, dtype fixes, all ablation flags

6bb8162
verified

Update README with adapted component stack (DINOv3, Qwen3, SAM3.1, etc.)

b10dd74
verified

Upload checkpoints/test_cpu_best.pt with huggingface_hub

9ad9d20
verified

Upload results/test_cpu.json with huggingface_hub

bfc40c6
verified

Remove pycache files

cc832df
verified

Initial MR-JEPA codebase: architecture, training, evaluation, and tests

dba2c56
verified

initial commit

3b4df8f
verified