wrathofgod
/

narrative-context-m2

narrative-context

Model card Files Files and versions

narrative-context-m2 / README.md

wrathofgod's picture

Add M2 model card

c1df78e verified about 2 months ago

|

history blame contribute delete

1.67 kB

	---
	language: en
	tags: [narrative-context, film-analysis, multi-task, pytorch, transformer, position-aware]
	---
	# Narrative Context Module 2 (Position-Aware)

	Cross-scene feature evolution Transformer for film narrative understanding.
	Consumes 256-d scene embeddings from [`wrathofgod/scene-perception-m1-unfreeze-deberta-small`](https://huggingface.co/wrathofgod/scene-perception-m1-unfreeze-deberta-small).

	## Architecture Upgrades vs Previous M2

	\| Component \| Old M2 \| New M2 \|
	\|-----------\|--------\|--------\|
	\| Feature dim \| 304-d \| 308-d (sin/cos position) \|
	\| Positional encoding \| window-relative only \| window-relative + film-absolute MLP \|
	\| Feature evolution \| none \| DeltaEncoder (GLU) as extra token \|
	\| Sequence length \| 5 \| 7 ([CLS] + 5 scenes + [DELTA]) \|
	\| Context fusion \| last token only \| CLS ⊕ current-scene via fusion gate \|
	\| Transformer depth \| 4L × 8H, FFN=512 \| 6L × 8H, FFN=768 \|
	\| Label smoothing \| none \| ε=0.1 \|

	## Input
	5-scene causal window [t-4 … t] per film.
	Per-scene feature: 305-d (M1 embedding + metadata + sin/cos position).
	Film position (0-1 scalar) fed separately to FilmPositionEncoder.

	## 7 Prediction Heads
	\| # \| Head \| Type \| Output \|
	\|---\|------\|------\|--------\|
	\| 1 \| scene_valence_continuous \| regression \| -1.0 to 1.0 \|
	\| 2 \| tension_level \| regression \| 1 to 10 \|
	\| 3 \| arousal_level \| regression \| 1 to 10 \|
	\| 4 \| emotional_shift_trigger \| binary \| True / False \|
	\| 5 \| narrative_arc_position \| 5-class \| Setup / Rising / Climax / Falling / Resolution \|
	\| 6 \| foreshadowing_type \| 4-class \| None / Foreshadow / Payoff / Echo \|
	\| 7 \| transition_type \| 5-class \| attacca / fade / segue / silence / cut \|