Commit History

Exclude large jsonl files from repo
cee9266

walidsobhie-code commited on

Add git lfs tracking for large jsonl files
29a776a

walidsobhie-code commited on

Fix Gradio/huggingface_hub version compatibility
97fa10c

walidsobhie-code Claude Opus 4.6 commited on

Add ddgs for free web search
729d832

walidsobhie-code commited on

Add real web search with ddgs (DuckDuckGo HTML scraper, free, no API key)
0fbc572

walidsobhie-code commited on

Fix search with DuckDuckGo API (proper JSON parsing)
35799ef

walidsobhie-code commited on

Use local model instead of HF for faster loading
3c29912

walidsobhie-code commited on

Add web search command (search:query) with DuckDuckGo API
40b1cc9

walidsobhie-code commited on

Load model from HF: my-ai-stack/stack-2-9-finetuned
217c8d8

walidsobhie-code commited on

Improve chat.py with system prompt and User/Assistant format
2aa22b3

walidsobhie-code commited on

Add interactive chat script with improved generation settings
7a8afa9

walidsobhie-code commited on

feat: add code completion generator and model registry tools
4ca507e

walidsobhie-code commited on

feat: add 5000 more tool calling examples (total: 6500)
2e091e7

walidsobhie-code commited on

feat: add training recipes for T4 QLoRA, A100, local training
e4ff487

walidsobhie-code commited on

feat: add evaluation datasets (HumanEval 50, MBPP 100, Tool scenarios 50)
20a06fb

walidsobhie-code commited on

feat: add production infrastructure - CI/CD, Docker, code quality, and monitoring
b5998ff

walidsobhie-code commited on

feat: add inference API, quickstart guide, roadmap, and combined tool data
b03a8a0

walidsobhie-code commited on

feat: add evaluation scripts, tool calling data generator, and 7B training configs
183b3b6

walidsobhie-code commited on

fix: remove Google Drive mount (Colab-only) - use Kaggle output + GitHub push instead
a8f2981

walidsobhie-code commited on

feat: add Google Drive auto-save to prevent losing model outputs
65c52b2

walidsobhie-code commited on

feat: add model testing and evaluation scripts
f4b31b2

walidsobhie-code commited on

fix: add numpy import at top to ensure it's loaded (fixes Numpy not available error)
7adbecc

walidsobhie-code commited on

fix: disable AMP (fp16=False, bf16=False) to bypass P100 GradScaler bug
27a755a

walidsobhie-code commited on

fix: load model in FP32 to avoid AMP gradient scaling conflict
b098bb5

walidsobhie-code commited on

fix: proper variable ordering for bf16/fp16 detection — define before use
e78785b

walidsobhie-code commited on

fixes for Kaggle run
bb61f7c

walidsobhie-code commited on

fix: define use_fp16 before model load; remove duplicate
5ac765e

walidsobhie-code commited on

fix: load model in fp16 to match fp16 training precision (bf16 not supported on P100/T4)
445d8a0

walidsobhie-code commited on

fix: force FP16 and disable BF16 for T4 compatibility
896f8a1

walidsobhie-code commited on

fix: use torch.cuda.is_bf16_supported() instead of compute capability check
c2a0307

walidsobhie-code commited on

fix: load model in bfloat16, train in fp16 on T4 (bf16 not supported on Turing)
15ef1c5

walidsobhie-code commited on

fix: set expandable_segments=False to fix PyTorch #124807 gradient checkpointing bug
98e3329

walidsobhie-code commited on

fix: use device_map={'': cuda} to force all model layers on GPU (fixes gradient flow issue)
c2013aa

walidsobhie-code commited on

fix: use device_map=auto for 1.5B model, add model.train() after LoRA
cd96c3d

walidsobhie-code commited on

chore: use Qwen2.5-Coder-1.5B for Kaggle T4 (7B doesn't fit in 16GB)
fffdf6d

walidsobhie-code commited on

fix: use device_map=None for single GPU (T4) to avoid meta tensor errors with float16
7d0f28b

walidsobhie-code commited on

fix: fallback to fp16 when bf16 requested but GPU doesn't support it (T4/Pascal)
bdf34ba

walidsobhie-code commited on

fix: resolve CUDA OOM on T4 by using bfloat16, device_map=auto, and enabling gradient checkpointing
c82e627

walidsobhie-code commited on

fix: revert device_map=auto (AMP gradient conflict on T4, use explicit cuda device instead)
fdeb8f3

walidsobhie-code commited on

fix: disable gradient checkpointing (conflicts with LoRA on T4, breaks backward pass)
07b92ca

walidsobhie-code commited on

fix: switch bf16->fp16 (Kaggle T4/P100 is Pascal, no Ampere bf16 support)
6e6bd1d

walidsobhie-code commited on

fix: add numpy to pip install deps for device_map=auto CPU offload
fc98957

walidsobhie-code commited on

fix OOM: use device_map=auto with CPU offload + float16
54daa38

walidsobhie-code commited on

fix: reduce memory for T4 GPU
6badaa9

walidsobhie-code commited on

fix: add missing datasets import in train_simple_nobnb.py
f2c605a

walidsobhie-code commited on

revert bitsandbytes: CUDA 13 runtime missing on Kaggle
732bef9

walidsobhie-code commited on

fix OOM: enable 4-bit NF4 quantization + bf16 training
4ce3e59

walidsobhie-code commited on

fix: force PyTorch 2.2.0+cu118 for P100 (sm_60) GPU compatibility
9150ac1

walidsobhie-code commited on

fix: use explicit cuda device map and use_reentrant=False for gradient checkpointing
3bfb6ea

walidsobhie-code commited on

fix OOM: batch_size 2->1, gradient_accumulation 4->8 for T4 GPU
cd564ae

walidsobhie-code commited on