Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Rayugacodes
/
kernelx-strategist

Safetensors
llama
Model card Files Files and versions
xet
Community
kernelx-strategist
1.49 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 33 commits
Rayugacodes's picture
Rayugacodes
Blog: The Digital Traffic Jam
7d2467c verified 11 days ago
  • adapter
    LoRA adapter (warm-start SFT) 11 days ago
  • plots
    Benchmark: benchmark_results.json 11 days ago
  • training
    Training pipeline scripts 11 days ago
  • .gitattributes
    1.64 kB
    Add training plot: grpo_training.png 11 days ago
  • Blog.md
    13.2 kB
    Blog: The Digital Traffic Jam 11 days ago
  • KernelX_Training.ipynb
    23.2 kB
    Fix: keep Colab defaults, install only trl+peft with --no-deps, handle all TRL versions 11 days ago
  • README.md
    11.4 kB
    Update README with latest 11 days ago
  • config.json
    921 Bytes
    Merged strategist (warm-start + GRPO) 11 days ago
  • generation_config.json
    132 Bytes
    Merged strategist (warm-start + GRPO) 11 days ago
  • merges.txt
    466 kB
    Tokenizer 11 days ago
  • model.safetensors
    1.45 GB
    xet
    Merged strategist (warm-start + GRPO) 11 days ago
  • special_tokens_map.json
    655 Bytes
    Tokenizer 11 days ago
  • tokenizer.json
    3.52 MB
    Tokenizer 11 days ago
  • tokenizer_config.json
    3.79 kB
    Tokenizer 11 days ago
  • train_on_hf.py
    13.1 kB
    GPU training script for HF 11 days ago
  • vocab.json
    801 kB
    Tokenizer 11 days ago