modelx / .gitignore
nivakaran's picture
Upload folder using huggingface_hub
eba8dff verified
# Python-generated files
__pycache__/
*.py[oc]
build/
dist/
wheels/
*.egg-info
# Virtual environments
.venv
# Environment files
.env
# LangGraph
.langgraph_api
# =============================================================================
# BROWSER DATA (Playwright/Chromium session files - very large)
# =============================================================================
.browser_data/
**/.*_browser_data/
**/.browser_data/
**/ShaderCache/
**/GraphiteDawnCache/
**/GrShaderCache/
**/.sessions/
**/Session Storage/
**/Local Storage/
**/Service Worker/
# =============================================================================
# ML MODEL ARTIFACTS (Large files - don't push to Git)
# =============================================================================
# Trained model weights (large binary files) - EXCEPT LFS tracked ones
# Note: We use git lfs for .h5 and .joblib model files we want to push
# *.h5
# *.hdf5
# *.joblib
# *.pkl
*.pickle
*.pt
*.pth
*.onnx
*.pb
# Model output directories (training intermediates - regenerable)
**/Artifacts/
**/artifacts/data_ingestion/
**/artifacts/data_validation/
**/artifacts/data_transformation/
**/artifacts/model_evaluation/
models/*/output/
models/*/models_cache/
models/*/checkpoints/
# Data files
*.parquet
*.npy
*.csv
# Airflow local state
models/*/.astro/
# MLflow artifacts (tracked separately)
mlruns/
mlartifacts/
# =============================================================================
# DATA FILES (Can be large)
# =============================================================================
data/
datasets/
# Database files
*.db
*.sqlite
*.sqlite3
# ChromaDB persistence (can be large)
chroma_db/
# =============================================================================
# KEEP THESE (source code, configs)
# =============================================================================
# The models/ folders themselves ARE tracked for:
# - main.py, src/, dags/ (pipeline code)
# - requirements.txt, setup.py (dependencies)
# - data_schema/ (validation configs)
# - README.md (documentation)