Upload results/phase4_smollm2.json with huggingface_hub 7fe69de verified JorgeAV commited on 12 days ago
Phase 4 (SmolLM2 decoder) checkpoint - Stage 2 Epoch 3 4786946 verified JorgeAV commited on 12 days ago
Add Phase 4 training: SmolLM2-135M decoder + bridge MLP f1842cf verified JorgeAV commited on 12 days ago
Add Phase 3.1 training: gen_weight 2.0, gen_len 32, scheduled sampling, beam search 206e1ad verified JorgeAV commited on 14 days ago
Upload checkpoints/hybrid_main_phase3_1_best.pt with huggingface_hub ecb8790 verified JorgeAV commited on 14 days ago
Upload results/hybrid_main_phase3_1.json with huggingface_hub 706d4eb verified JorgeAV commited on 14 days ago
Upload results/hybrid_main_phase3.json with huggingface_hub 6f8358f verified JorgeAV commited on 14 days ago
Add complete Phase 3 training script with generative decoder + open-ended VQA a05b0f9 verified JorgeAV commited on 14 days ago
Add Phase 3 training script (generative decoder + enriched evidence) 3d28d08 verified JorgeAV commited on 14 days ago
Upload checkpoints/hybrid_main_phase3_best.pt with huggingface_hub f5d823f verified JorgeAV commited on 14 days ago
Upload results/hybrid_main_phase3.json with huggingface_hub d5eff1e verified JorgeAV commited on 14 days ago
feat: add persistent Trackio Space for image logging (space_id + sync) da959e9 verified JorgeAV commited on 18 days ago
Upload results/hybrid_main_phase2.json with huggingface_hub 3f403f6 verified JorgeAV commited on 18 days ago
Fix TextEncoder.unfreeze_last: compatible with both AutoModel (Qwen3Model.layers) and ForCausalLM (model.model.layers) 33997db verified JorgeAV commited on 18 days ago
Phase 2: Add SOTA visual diagnostics (13 types) with Trackio image logging 5c15aea verified JorgeAV commited on 18 days ago
Upload checkpoints/hybrid_main_best.pt with huggingface_hub 2df74fc verified JorgeAV commited on 18 days ago
Upload results/hybrid_main.json with huggingface_hub 1691248 verified JorgeAV commited on 18 days ago
add: train_phase2.py — Phase 2 perception fine-tuning with 3-group differential LR optimizer 66b3447 verified JorgeAV commited on 18 days ago
add: launch_ablations.py — complete ablation matrix with CLI commands for all 12 experiments 212d421 verified JorgeAV commited on 18 days ago
fix: test_architecture.py — use os.path.dirname(__file__) instead of hardcoded /app for sys.path 83e1328 verified JorgeAV commited on 18 days ago
fix: target_encoder.py — respect config.jepa_loss_fn (smooth_l1/mse/cosine) instead of hardcoded MSE ac52da3 verified JorgeAV commited on 18 days ago
fix: test_architecture.py — correct evidence_gate attribute check (gate_type='none' not gate=None), add dinov2 config test, compact formatting 67a4198 verified JorgeAV commited on 18 days ago
fix: test_architecture.py — add comprehensive ablation tests (no_jepa, no_rollout, no_gate, K variants, loss_fn variants, SIGReg vs VICReg, purist config) e158637 verified JorgeAV commited on 18 days ago
fix: README.md — complete ablation table with all 13 experiments and CLI flags, add no_sigreg/vicreg_only b292067 verified JorgeAV commited on 18 days ago
fix: ARCHITECTURE.md — complete ablation table with all 13 experiments, CLI flags, dinov2/loss_fn/sigreg/vicreg ablations, footnote on no_rollout→no_jepa bac1326 verified JorgeAV commited on 18 days ago
fix: train_mrjepa.py — add VICReg loss, --no_sigreg/--use_vicreg CLI flags, log all ablation fields in results JSON, document no_rollout→no_jepa conflation 7741500 verified JorgeAV commited on 18 days ago
fix: mr_jepa.py — proper evidence gate disable via gate_type (no lambda hack), init ablation flags from config, update docstrings to current stack 3c437e2 verified JorgeAV commited on 18 days ago
fix: ablation.py — wire all disable flags, add missing experiments (dinov2, mse/cosine loss, no_sigreg, vicreg_only), add run()/load_results() methods, generate CLI commands b198bd2 verified JorgeAV commited on 18 days ago
fix: backbones.py — DINOv3 via timm + Qwen3-Embedding-0.6B, add get_transform(), proper layer unfreezing for both architectures ad6ce96 verified JorgeAV commited on 18 days ago
fix: model_config.py — update defaults to DINOv3-L + Qwen3-Embedding, add jepa_loss_fn/use_jepa fields, add get_dinov2_ablation_config() b564d53 verified JorgeAV commited on 18 days ago
fix: SAM 3.1 → jetjodh/sam3.1 (non-gated mirror) in training script 96110fc verified JorgeAV commited on 18 days ago
fix: update ARCHITECTURE.md — SAM 3.1 → jetjodh/sam3.1, align with current component stack (DINOv3-L, Qwen3-Embedding) 19d4210 verified JorgeAV commited on 18 days ago
Training script with DINOv3+Qwen3-Embedding, dtype fixes, all ablation flags 6bb8162 verified JorgeAV commited on 18 days ago
Update README with adapted component stack (DINOv3, Qwen3, SAM3.1, etc.) b10dd74 verified JorgeAV commited on 18 days ago
Upload checkpoints/test_cpu_best.pt with huggingface_hub 9ad9d20 verified JorgeAV commited on 18 days ago
Initial MR-JEPA codebase: architecture, training, evaluation, and tests dba2c56 verified JorgeAV commited on 18 days ago