MentorFlow / teacher_agent_dev /RUN_LM_COMPARISON.md
Cornelius
Deploy MentorFlow with GPU support
a52f96d
|
raw
history blame
1.25 kB
# Running Comparison with LM Student
## Changes Made
Updated `compare_strategies.py` to use **LM Student (DistilBERT)** instead of MockStudentAgent for all three strategies:
- Random Strategy
- Progressive Strategy
- Teacher Strategy
## Usage
```bash
cd teacher_agent_dev
python compare_strategies.py --iterations 500 --deterministic
```
## Notes
- **LM Student is slower** - Each iteration involves DistilBERT inference/fine-tuning
- Uses DistilBERT for multiple choice questions
- Online learning (fine-tunes on 1 task at a time)
- Memory decay using Ebbinghaus forgetting curve
- Per-topic skill tracking
## Parameters
- `learning_rate`: 5e-5 (LM fine-tuning rate)
- `retention_constant`: 80.0 (slower forgetting)
- `device`: 'cpu' (can be changed to 'cuda' if GPU available)
- `max_length`: 256 tokens
- `gradient_accumulation_steps`: 4
## Expected Runtime
With LM Student:
- **Random Strategy**: ~5-10 minutes for 500 iterations
- **Progressive Strategy**: ~5-10 minutes for 500 iterations
- **Teacher Strategy**: ~5-10 minutes for 500 iterations
**Total**: ~15-30 minutes for full comparison
## Fallback
If LM Student cannot be imported (e.g., transformers library missing), it will automatically fall back to MockStudentAgent.