contextflow-rl / README.md
namish10's picture
Upload README.md with huggingface_hub
ced8871 verified
---
license: apache-2.0
tags:
- reinforcement-learning
- education
- doubt-prediction
- adaptive-learning
- multi-agent-systems
- gesture-recognition
- computer-vision
- q-learning
- grpo
- edtech
- mediapipe
- privacy
datasets:
- synthetic-learning-interactions
---
# ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems
**A Research Implementation of RL-Powered Educational Technology**
| Property | Value |
|----------|-------|
| **Algorithm** | GRPO + Q-Learning |
| **State Dimension** | 64 features |
| **Action Dimension** | 10 doubt predictions |
| **Policy Version** | 50 |
| **Training Samples** | 200 |
| **Final Loss** | 0.2465 |
| **Avg Reward** | 0.75 |
## Overview
ContextFlow predicts student confusion **before** it occurs using reinforcement learning and behavioral signal analysis. When a learner's actions suggest they might be struggling (mouse hesitation, scroll reversals, help-seeking gestures), the system proactively offers assistance.
## Architecture
```
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ 9 Specialized Agents β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚ β€’ StudyOrchestrator β€’ DoubtPredictorAgent β”‚
β”‚ β€’ BehavioralAgent β€’ HandGestureAgent β”‚
β”‚ β€’ RecallAgent β€’ KnowledgeGraphAgent β”‚
β”‚ β€’ PeerLearningAgent β€’ LLMOrchestrator β”‚
β”‚ β€’ GestureActionMapper β€’ PromptAgent β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
```
## Quick Start
```python
# Load the model
from huggingface_hub import hf_hub_download
import pickle
path = hf_hub_download(
repo_id='namish10/contextflow-rl',
filename='checkpoint.pkl'
)
with open(path, 'rb') as f:
checkpoint = pickle.load(f)
print(f"Policy version: {checkpoint.policy_version}")
print(f"Training samples: {checkpoint.training_stats['total_samples']}")
```
## State Vector (64 dimensions)
| Component | Dims | Description |
|-----------|------|-------------|
| Topic Embedding | 32 | TF-IDF of learning topic |
| Progress | 1 | Session progress (0.0-1.0) |
| Confusion Signals | 16 | Behavioral indicators |
| Gesture Signals | 14 | Hand gesture frequencies |
| Time Spent | 1 | Normalized session time |
## Actions (10 doubt predictions)
1. `what_is_backpropagation`
2. `why_gradient_descent`
3. `how_overfitting_works`
4. `explain_regularization`
5. `what_loss_function`
6. `how_optimization_works`
7. `explain_learning_rate`
8. `what_regularization`
9. `how_batch_norm_works`
10. `explain_softmax`
## Training Results
| Epoch | Loss | Epsilon | Avg Reward |
|-------|------|---------|------------|
| 1 | 1.2456 | 1.000 | 0.20 |
| 2 | 0.8923 | 0.995 | 0.35 |
| 3 | 0.6541 | 0.990 | 0.48 |
| 4 | 0.4127 | 0.985 | 0.62 |
| 5 | 0.2465 | 0.980 | 0.75 |
## Key Features
- **Predictive Detection**: RL-based confusion prediction before it happens
- **Multi-Agent Orchestration**: 9 specialized agents working together
- **Gesture Recognition**: Privacy-first hand gesture detection with MediaPipe
- **Face Blurring**: Real-time face blur for classroom deployment
- **Browser AI Launch**: Direct AI chat interface from predicted doubts
- **Spaced Repetition**: SM-2 based review scheduling
- **Knowledge Graphs**: Concept mapping and learning paths
## Files
| File | Description |
|------|-------------|
| `checkpoint.pkl` | Trained Q-network weights |
| `train_rl.py` | Training script with GRPO |
| `feature_extractor.py` | 64-dim state extraction |
| `inference_example.py` | Usage examples |
| `demo.ipynb` | Interactive notebook |
| `RESEARCH_PAPER.md` | Full research paper |
| `evaluation_results.json` | Training metrics |
| `requirements.txt` | Dependencies |
| `app/` | Backend agents (Flask API) |
| `frontend/` | React frontend |
## Evaluation
See [EVALUATION.md](EVALUATION.md) for detailed metrics and production readiness assessment.
## Citation
```bibtex
@software{contextflow,
title={ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems},
author={ContextFlow Team},
year={2026},
version={1.0},
url={https://huggingface.co/namish10/contextflow-rl}
}
```