Upload README.md with huggingface_hub

ced8871 verified about 1 month ago

4.5 kB

	---
	license: apache-2.0
	tags:
	- reinforcement-learning
	- education
	- doubt-prediction
	- adaptive-learning
	- multi-agent-systems
	- gesture-recognition
	- computer-vision
	- q-learning
	- grpo
	- edtech
	- mediapipe
	- privacy
	datasets:
	- synthetic-learning-interactions
	---

	# ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems

	A Research Implementation of RL-Powered Educational Technology

	\| Property \| Value \|
	\|----------\|-------\|
	\| Algorithm \| GRPO + Q-Learning \|
	\| State Dimension \| 64 features \|
	\| Action Dimension \| 10 doubt predictions \|
	\| Policy Version \| 50 \|
	\| Training Samples \| 200 \|
	\| Final Loss \| 0.2465 \|
	\| Avg Reward \| 0.75 \|

	## Overview

	ContextFlow predicts student confusion before it occurs using reinforcement learning and behavioral signal analysis. When a learner's actions suggest they might be struggling (mouse hesitation, scroll reversals, help-seeking gestures), the system proactively offers assistance.

	## Architecture

	```
	┌─────────────────────────────────────────────────┐
	│ 9 Specialized Agents │
	├─────────────────────────────────────────────────┤
	│ • StudyOrchestrator • DoubtPredictorAgent │
	│ • BehavioralAgent • HandGestureAgent │
	│ • RecallAgent • KnowledgeGraphAgent │
	│ • PeerLearningAgent • LLMOrchestrator │
	│ • GestureActionMapper • PromptAgent │
	└─────────────────────────────────────────────────┘
	```

	## Quick Start

	```python
	# Load the model
	from huggingface_hub import hf_hub_download
	import pickle

	path = hf_hub_download(
	repo_id='namish10/contextflow-rl',
	filename='checkpoint.pkl'
	)
	with open(path, 'rb') as f:
	checkpoint = pickle.load(f)

	print(f"Policy version: {checkpoint.policy_version}")
	print(f"Training samples: {checkpoint.training_stats['total_samples']}")
	```

	## State Vector (64 dimensions)

	\| Component \| Dims \| Description \|
	\|-----------\|------\|-------------\|
	\| Topic Embedding \| 32 \| TF-IDF of learning topic \|
	\| Progress \| 1 \| Session progress (0.0-1.0) \|
	\| Confusion Signals \| 16 \| Behavioral indicators \|
	\| Gesture Signals \| 14 \| Hand gesture frequencies \|
	\| Time Spent \| 1 \| Normalized session time \|

	## Actions (10 doubt predictions)

	1. `what_is_backpropagation`
	2. `why_gradient_descent`
	3. `how_overfitting_works`
	4. `explain_regularization`
	5. `what_loss_function`
	6. `how_optimization_works`
	7. `explain_learning_rate`
	8. `what_regularization`
	9. `how_batch_norm_works`
	10. `explain_softmax`

	## Training Results

	\| Epoch \| Loss \| Epsilon \| Avg Reward \|
	\|-------\|------\|---------\|------------\|
	\| 1 \| 1.2456 \| 1.000 \| 0.20 \|
	\| 2 \| 0.8923 \| 0.995 \| 0.35 \|
	\| 3 \| 0.6541 \| 0.990 \| 0.48 \|
	\| 4 \| 0.4127 \| 0.985 \| 0.62 \|
	\| 5 \| 0.2465 \| 0.980 \| 0.75 \|

	## Key Features

	- Predictive Detection: RL-based confusion prediction before it happens
	- Multi-Agent Orchestration: 9 specialized agents working together
	- Gesture Recognition: Privacy-first hand gesture detection with MediaPipe
	- Face Blurring: Real-time face blur for classroom deployment
	- Browser AI Launch: Direct AI chat interface from predicted doubts
	- Spaced Repetition: SM-2 based review scheduling
	- Knowledge Graphs: Concept mapping and learning paths

	## Files

	\| File \| Description \|
	\|------\|-------------\|
	\| `checkpoint.pkl` \| Trained Q-network weights \|
	\| `train_rl.py` \| Training script with GRPO \|
	\| `feature_extractor.py` \| 64-dim state extraction \|
	\| `inference_example.py` \| Usage examples \|
	\| `demo.ipynb` \| Interactive notebook \|
	\| `RESEARCH_PAPER.md` \| Full research paper \|
	\| `evaluation_results.json` \| Training metrics \|
	\| `requirements.txt` \| Dependencies \|
	\| `app/` \| Backend agents (Flask API) \|
	\| `frontend/` \| React frontend \|

	## Evaluation

	See [EVALUATION.md](EVALUATION.md) for detailed metrics and production readiness assessment.

	## Citation

	```bibtex
	@software{contextflow,
	title={ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems},
	author={ContextFlow Team},
	year={2026},
	version={1.0},
	url={https://huggingface.co/namish10/contextflow-rl}
	}
	```