Upload folder using huggingface_hub

03c6de8 verified about 2 months ago

10.2 kB

	![img.png](img.png)
	# Multi-Task Learning for Emotional and Cognitive State Detection

	[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)
	[![TensorFlow 2.x](https://img.shields.io/badge/tensorflow-2.x-orange.svg)](https://www.tensorflow.org/)
	[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

	Simultaneously predict emotional state (stress) and cognitive state (mental effort) from physiological signals using a unified deep learning architecture.

	> 🎯 Key Innovation: Multi-task framework achieving 70.7% accuracy with multimodal fusion (+8.3% over IBI-only) and principled data preparation (+3.7% over naive training).

	---

	## Why Multi-Task Learning?

	Traditional approaches train separate models for stress and effort detection. Our unified multi-task framework offers:

	- Joint representation learning: Shared encoders extract features relevant to both emotional and cognitive states
	- Efficient architecture: Single model (~87k parameters) predicts both dimensions simultaneously
	- Task-specific training: Principled data preparation (masking ambiguous samples) improves generalization
	- Real-time inference: One forward pass → both stress and effort predictions

	---

	## Quick Start

	```bash
	# Install
	git clone https://github.com/yisakSyn/SynheartFocus_MultitaskModel.git
	cd SynheartFocus_MultitaskModel
	pip install -r requirements.txt

	# Prepare data (SWELL-KW dataset)
	python prepare_multimodal_data_v5.py

	# Train with LOSO cross-validation
	python trainer_v8_masked_effortMultimodal.py --loso --data_dir ./prepared_data_v5_multimodal

	# Single holdout evaluation
	python trainer_v8_masked_effortMultimodal.py --holdout pp09 pp17 pp25
	```

	---

	## Architecture

	```
	Physiological Inputs (4 streams)
	├─ IBI timeseries (120 samples @ 2Hz = 60s) ──┐
	├─ HRV features (14 features) ├─→ IBI Encoder (CNN-LSTM) → Dense(32)
	├─ EDA timeseries (120 samples @ 2Hz = 60s) ──┤ ↓
	└─ EDA features (12 features) └─→ EDA Encoder (CNN-LSTM) → Dense(32)
	↓
	Fusion: Concat(32+16+32+16) = 96d
	↓
	Dense(64) → Dense(32)
	↓
	┌──────────────────────┴──────────────────────┐
	Stress Head Effort Head
	Dense(2, softmax) Dense(2, softmax)
	(trained on all samples) (trained on valid only)
	```

	### Model Details

	IBI Encoder (CNN-LSTM):
	- Conv1D blocks: 24→32→48 filters (kernels 7→5→3)
	- LSTM(32) with attention pooling
	- HRV feature branch: Dense(24→16)
	- Output: 48 dimensions (32+16)

	EDA Encoder (CNN-LSTM):
	- Conv1D blocks: 24→32→48 filters (kernels 9→7→5, larger for EDA)
	- LSTM(32) with attention pooling
	- EDA feature branch: Dense(24→16)
	- Output: 48 dimensions (32+16)

	Fusion & Classification:
	- Concatenate: [IBI(32) + HRV(16) + EDA(32) + EDA_feat(16)] = 96 dims
	- Dense layers: 64 → 32 with dropout (0.5, 0.25)
	- Two task-specific output heads (softmax)

	Joint Optimization:
	```python
	# Loss computation
	loss_stress = focal_loss(y_true_stress, y_pred_stress) # All samples
	loss_effort = masked_focal_loss(y_true_effort, y_pred_effort, mask) # Valid only
	total_loss = loss_stress + loss_effort
	```

	Parameters: ~87,000 total

	---

	## Results

	### LOSO Cross-Validation (18 subjects)

	\| Metric \| Accuracy \| Std Dev \|
	\|--------\|----------\|---------\|
	\| Stress Detection \| 68.9% \| ±12.9% \|
	\| Effort Detection \| 72.5% \| ±14.1% \|
	\| Average \| 70.7% \| ±13.2% \|

	### Ablation Studies

	1. Multimodal Fusion:

	\| Configuration \| Stress \| Effort \| Average \|
	\|---------------\|--------\|--------\|---------\|
	\| IBI only (Run 5) \| 60.2% \| 64.6% \| 62.4% \|
	\| IBI + EDA (ours, Run 3) \| 68.9% \| 72.5% \| 70.7% \|
	\| Improvement \| +8.7% \| +7.9% \| +8.3% \|

	Key Finding: Multimodal fusion (IBI + EDA) provides substantial improvement over single-modality IBI-only approach.

	2. Task-Specific Data Preparation:

	\| Training Strategy \| Stress \| Effort \| Average \|
	\|-------------------\|--------\|--------\|---------\|
	\| All conditions (c1, c2, c3) \| 68.7% \| 65.2% \| 67.0% \|
	\| Task-specific masking (ours) \| 68.9% \| 72.5% \| 70.7% \|
	\| Improvement \| +0.2% \| +7.3%* \| +3.7% \|

	*p < 0.01 (McNemar's test)

	Key Finding: Masking ambiguous effort labels (c2 interruption condition) improves effort detection by 7.3% while maintaining stress accuracy.

	### Key Contributions Validated

	Our experimental results validate two independent contributions:

	1. Multimodal Fusion (+8.3%): Combining IBI and EDA signals outperforms IBI-only approach
	2. Task-Specific Data Preparation (+3.7%): Masking ambiguous samples improves effort detection
	3. Combined Framework (70.7%): Multi-task architecture with principled data curation

	---

	## Methodology

	### Multi-Task Learning Framework

	Core Design: Unified architecture with shared encoders and task-specific output heads enables joint optimization of both emotional and cognitive state prediction.

	Loss Function:
	```python
	L_total = L_stress + L_effort

	where:
	L_stress = FocalLoss(y_stress, pred_stress) # All samples
	L_effort = MaskedFocalLoss(y_effort, pred_effort, mask) # Valid samples only
	```

	### Task-Specific Data Preparation

	\| Condition \| Stress Label \| Effort Label \| Training Strategy \|
	\|-----------\|--------------\|--------------\|-------------------\|
	\| c1 (Neutral) \| 0 (Low) \| 0 (Low) \| Both tasks trained \|
	\| c2 (Interruption) \| 1 (High) \| MASKED \| Stress only \|
	\| c3 (Time Pressure) \| 1 (High) \| 1 (High) \| Both tasks trained \|

	Rationale: Interruption-based tasks (c2) produce ambiguous effort labels due to individual differences in multitasking ability. NASA-TLX analysis shows 2.2× higher variance in effort ratings for c2 (CV=0.44) vs. c3 (CV=0.20).

	Implementation:
	- Stress head: Trained on all 11,332 samples
	- Effort head: Trained on 7,998 valid samples (c1 + c3 only)
	- Masking applied during loss computation, not data filtering

	### Signal Processing

	IBI Extraction:
	1. R-peak detection from ECG (2048 Hz)
	2. Artifact correction (physiological constraints: 300-2000ms)
	3. Cubic spline interpolation to 2 Hz
	4. 120-sample windows (60s at 2 Hz) with 75% overlap

	HRV Features (14):
	- Time-domain (7): Mean IBI, SDNN, RMSSD, pNN50, CV, Mean HR, Std HR
	- Frequency-domain (4): LF, HF, LF/HF, Total power
	- Nonlinear (3): SD1, SD2, SD1/SD2

	EDA Processing:
	1. Detrending and lowpass filter (1 Hz cutoff)
	2. Downsample to 2 Hz
	3. 120-sample windows with 75% overlap

	EDA Features (12):
	- Raw statistics (4): Mean, SD, Min, Max
	- Tonic component (4): Mean, SD, slope, range
	- Phasic component (4): Mean amplitude, SD, SCR count, Mean SCR height

	---

	## Project Structure

	```
	multitask-stress-effort/
	├── prepare_multimodal_data_v5.py # Raw signals → ML features
	├── trainer_v8_masked_effortMultimodal.py # Multi-task training
	├── Questionnaire.xlsx # Labels from SWELL-KW
	├── Processed_IBI_HR_EDA/ # MATLAB-processed signals
	│ └── ppXX_date_cY_IBI_HR_EDA.mat
	├── prepared_data_v5_multimodal/ # ML-ready dataset
	│ ├── X_ibi.npy # (N, 120, 1)
	│ ├── X_hrv.npy # (N, 14)
	│ ├── X_eda.npy # (N, 120, 1)
	│ ├── X_eda_features.npy # (N, 12)
	│ ├── y_stress.npy # (N, 2) one-hot
	│ ├── y_effort.npy # (N, 2) one-hot, masked=[0,0]
	│ ├── effort_mask.npy # (N,) 1=valid, 0=masked
	│ └── ...
	└── runs/ # Training results
	└── synheart_v8_multimodal_*/
	```

	---

	## Training Configuration

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Optimizer \| AdamW \|
	\| Learning Rate \| 2×10⁻⁴ \|
	\| Weight Decay \| 1×10⁻³ \|
	\| Batch Size \| 64 \|
	\| Max Epochs \| 200 \|
	\| Early Stopping \| 25 epochs patience \|
	\| Loss Function \| Focal Loss (γ=1.5) + Label Smoothing (ε=0.05) \|
	\| Dropout \| 0.35 (encoders), 0.50 (fusion) \|
	\| Augmentation \| Time Masking (p=0.5, width=15) \|

	---

	## Citation

	```bibtex
	@article{author2025multitask,
	title={Simultaneous Detection of Emotional and Cognitive States:
	A Multi-Task Deep Learning Approach},
	author={[Your Name]},
	journal={IEEE Transactions on Affective Computing},
	year={2025},
	note={Under review}
	}
	```

	---

	## Requirements

	```
	tensorflow>=2.10.0
	numpy>=1.21.0
	scipy>=1.7.0
	scikit-learn>=1.0.0
	matplotlib>=3.5.0
	pandas>=1.3.0
	openpyxl>=3.0.0 # For Excel label files
	```

	Full list: [requirements.txt](requirements.txt)

	---

	## Key References

	Multi-Task Learning:
	- Caruana (1997) - "Multitask Learning"
	- Ruder (2017) - "Multi-Task Learning in Deep Neural Networks"

	Dataset & Cognitive Load:
	- Koldijk et al. (2014) - "SWELL Knowledge Work Dataset"
	- Monsell (2003) - "Task Switching"
	- Hockey (1997) - "Compensatory Control Theory"

	---

	## License

	MIT License - see [LICENSE](LICENSE) for details.

	---

	## Contact

	[Yisak T] - [yisak@synheart.ai]
	PhD

	---

	<p align="center">
	<i>Multi-task framework for joint emotional and cognitive state detection from physiological signals</i><br>
	<b>70.7% LOSO accuracy \| Multimodal IBI+EDA fusion \| Task-specific training</b>
	</p>