Spaces:
Paused
Paused
| # Running Comparison with LM Student | |
| ## Changes Made | |
| Updated `compare_strategies.py` to use **LM Student (DistilBERT)** instead of MockStudentAgent for all three strategies: | |
| - Random Strategy | |
| - Progressive Strategy | |
| - Teacher Strategy | |
| ## Usage | |
| ```bash | |
| cd teacher_agent_dev | |
| python compare_strategies.py --iterations 500 --deterministic | |
| ``` | |
| ## Notes | |
| - **LM Student is slower** - Each iteration involves DistilBERT inference/fine-tuning | |
| - Uses DistilBERT for multiple choice questions | |
| - Online learning (fine-tunes on 1 task at a time) | |
| - Memory decay using Ebbinghaus forgetting curve | |
| - Per-topic skill tracking | |
| ## Parameters | |
| - `learning_rate`: 5e-5 (LM fine-tuning rate) | |
| - `retention_constant`: 80.0 (slower forgetting) | |
| - `device`: 'cpu' (can be changed to 'cuda' if GPU available) | |
| - `max_length`: 256 tokens | |
| - `gradient_accumulation_steps`: 4 | |
| ## Expected Runtime | |
| With LM Student: | |
| - **Random Strategy**: ~5-10 minutes for 500 iterations | |
| - **Progressive Strategy**: ~5-10 minutes for 500 iterations | |
| - **Teacher Strategy**: ~5-10 minutes for 500 iterations | |
| **Total**: ~15-30 minutes for full comparison | |
| ## Fallback | |
| If LM Student cannot be imported (e.g., transformers library missing), it will automatically fall back to MockStudentAgent. | |