Aesop-RL Model Checkpoint

This repository contains trained model weights for the Aesop-RL project.

Model Details

  • Model Type: Policy Network for Aesop Configuration
  • Training Stage: warm_up
  • Framework: PyTorch
  • Lean Version: 4

Architecture Validation

  • warm_up.pt: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)
  • warm_up_best.pt: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)
  • warm_up_final.pt: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)

Validation Results

  • warm_up.pt: 25.00% (61/244)
  • warm_up_best.pt: 25.82% (63/244)
  • warm_up_final.pt: 25.82% (63/244)

Files

  • warm_up.pt: PyTorch checkpoint (1056.4 MB)
  • warm_up_best.pt: PyTorch checkpoint (1056.4 MB)
  • warm_up_final.pt: PyTorch checkpoint (1056.4 MB)

Usage

import torch
from aesop.models.policy import AesopPolicy

# Load checkpoint
checkpoint = torch.load("warm_up.pt", map_location="cpu")
model = AesopPolicy()
model.load_state_dict(checkpoint['model_state_dict'])

Training Info

{
  "stage": "warm_up",
  "files_description": "- `warm_up.pt`: PyTorch checkpoint (1056.4 MB)\n- `warm_up_best.pt`: PyTorch checkpoint (1056.4 MB)\n- `warm_up_final.pt`: PyTorch checkpoint (1056.4 MB)",
  "architecture_summary": "- **warm_up.pt**: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)\n- **warm_up_best.pt**: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)\n- **warm_up_final.pt**: Valid (shared_mlp, tactic_heads, lemma_heads, config_heads, base_encoder)",
  "validation_summary": "- **warm_up.pt**: 25.00% (61/244)\n- **warm_up_best.pt**: 25.82% (63/244)\n- **warm_up_final.pt**: 25.82% (63/244)",
  "checkpoints": [
    {
      "stage": "warm_up",
      "keys": [
        "epoch",
        "model_state_dict",
        "trainer_state_dict",
        "train_loss",
        "val_acc",
        "best_val_acc"
      ],
      "epoch": 1,
      "best_val_acc": 0.25,
      "model_architecture": {
        "is_valid": true,
        "components": [
          {
            "name": "shared_mlp",
            "found_keys": 2,
            "total_expected": 3
          },
          {
            "name": "tactic_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "lemma_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "config_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "base_encoder",
            "found_keys": 280,
            "total_expected": "variable"
          }
        ],
        "warnings": [],
        "errors": []
      },
      "file_size_mb": 1056.3982362747192,
      "filename": "warm_up.pt"
    },
    {
      "stage": "warm_up",
      "keys": [
        "epoch",
        "model_state_dict",
        "trainer_state_dict",
        "train_loss",
        "val_acc",
        "best_val_acc"
      ],
      "epoch": 8,
      "best_val_acc": 0.2581967213114754,
      "model_architecture": {
        "is_valid": true,
        "components": [
          {
            "name": "shared_mlp",
            "found_keys": 2,
            "total_expected": 3
          },
          {
            "name": "tactic_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "lemma_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "config_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "base_encoder",
            "found_keys": 280,
            "total_expected": "variable"
          }
        ],
        "warnings": [],
        "errors": []
      },
      "file_size_mb": 1056.3955039978027,
      "filename": "warm_up_best.pt"
    },
    {
      "stage": "warm_up",
      "keys": [
        "model_state_dict",
        "trainer_state_dict",
        "final_train_loss",
        "final_val_acc",
        "best_val_acc"
      ],
      "best_val_acc": 0.2581967213114754,
      "model_architecture": {
        "is_valid": true,
        "components": [
          {
            "name": "shared_mlp",
            "found_keys": 2,
            "total_expected": 3
          },
          {
            "name": "tactic_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "lemma_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "config_heads",
            "found_keys": 2,
            "total_expected": 2
          },
          {
            "name": "base_encoder",
            "found_keys": 280,
            "total_expected": "variable"
          }
        ],
        "warnings": [],
        "errors": []
      },
      "file_size_mb": 1056.396414756775,
      "filename": "warm_up_final.pt"
    }
  ]
}

Citation

If you use this model, please cite the Aesop-RL project.

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading