mindi-backup / README_COMPONENT_1_SETUP.md
Mindigenous
Initial full project backup with Git LFS
53f0cc2

Component 1: Project Setup (Windows + RTX 4060 8GB)

What This Component Does

  • Creates a clean folder structure for the full coding-assistant project.
  • Sets up a Python virtual environment.
  • Installs all core dependencies needed across Components 2-10.
  • Verifies that Python, PyTorch, CUDA visibility, and key libraries work.

Folder Structure Created

  • data/raw -> raw datasets you will provide later
  • data/interim -> temporary cleaned data
  • data/processed -> training-ready tokenized data
  • data/external -> any third-party resources
  • src/tokenizer -> Component 2 code tokenizer
  • src/dataset_pipeline -> Component 3 preprocessing pipeline
  • src/model_architecture -> Component 4 transformer code
  • src/training_pipeline -> Component 5 training loop
  • src/evaluation_system -> Component 6 evaluation code
  • src/inference_engine -> Component 7 inference code
  • src/chat_interface -> Component 8 Gradio interface
  • src/finetuning_system -> Component 9 LoRA fine-tuning
  • src/export_optimization -> Component 10 quantization/export tools
  • configs -> config files for all components
  • scripts -> setup, verification, and utility scripts
  • tests -> quick checks for each component
  • checkpoints -> model checkpoints saved during training
  • models/base -> base trained model files
  • models/lora -> LoRA adapters
  • models/quantized -> optimized quantized models
  • artifacts -> generated reports, metrics, and outputs
  • logs -> training and runtime logs

Exact Commands To Run (in this order)

Run from: D:\Desktop 31st Jan 2026\MIND-AI-MODEL

  1. Install Python 3.11 (required for package compatibility):
  1. Allow script execution for this terminal only:
Set-ExecutionPolicy -Scope Process -ExecutionPolicy Bypass
  1. If you already attempted setup once, remove old virtual environment first:
if (Test-Path .\.venv) { Remove-Item -Recurse -Force .\.venv }
  1. Create folders, virtual env, install dependencies:
.\scripts\setup_windows_environment.ps1
  1. Activate virtual environment:
.\.venv\Scripts\Activate.ps1
  1. Verify setup:
python .\scripts\verify_component1_setup.py

Expected Verification Result

  • Prints Python version
  • Prints PyTorch version
  • Shows whether CUDA is available
  • Shows GPU name if available
  • Confirms critical libraries import correctly

Note:

  • codebleu is excluded from base install on Windows due to a tree-sitter dependency conflict on Python 3.11.
  • Component 6 will use Windows-stable evaluation metrics and add code-quality checks without breaking setup.
  • bitsandbytes is optional on native Windows because some CUDA/driver combinations fail to load its DLL.
  • Base setup and all early components continue without it.
  • For Component 5, we will:
    • try bitsandbytes if available, and
    • automatically fall back to a stable optimizer on your machine if it is not.

If verification fails, copy the full terminal output and share it with me.