Spaces:

apoorvrajdev
/

image-captioning-api

Configuration error

App Files Files Community

image-captioning-api / notebooks /README.md

apoorvrajdev

feat: bootstrap production-grade ML repository tooling

b2594db 27 days ago

preview code

raw

history blame contribute delete

3.37 kB

Notebooks

This directory holds Jupyter notebooks. Each notebook has a specific role in the project lifecycle, and the rules are different for each one.

`01_ieee_inceptionv3_transformer.ipynb` — FROZEN

This notebook is the canonical research artefact behind the IEEE publication AI Narratives: Bridging Visual Content and Linguistic Expression. It contains the exact training pipeline, hyperparameters, and inference code used to produce the BLEU ~24 score reported in the paper.

Why is it frozen?

Reproducibility of a published result is non-negotiable. If the notebook drifts from what the paper describes, anyone trying to reproduce the result — reviewers, future students, recruiters running the demo — will see numbers that don't match the paper. That breaks scientific trust.

Rules

Do not edit cells. No improvements, no refactors, no comment fixes.
Do not re-run cells with different seeds. The committed outputs are reference outputs — they are stripped on commit by nbstripout, but the structure must stay identical.
Improvements go into the modular package at src/captioning/, never back into this notebook.
Parity is enforced in CI. The make freeze-paper-notebook target computes a SHA-256 of this file and asserts it matches the locked hash in .paper-notebook.sha256. If you change a cell, CI fails until you either revert OR explicitly re-lock with make lock-paper-notebook AND update the paper / model card to reflect the new behaviour.

When this rule changes

The frozen state lifts when (and only when) we publish a v2 of the paper or explicitly mark a re-run in the changelog. Until then, treat this file like a museum exhibit.

`02_dataset_eda.ipynb` — exploratory (Phase 1+)

Dataset inspection. Caption length distributions, vocabulary coverage, image dimension histograms, class balance across COCO super-categories. This notebook may be edited freely; it's a working scratchpad, not a published artefact.

`03_attention_visualization.ipynb` — exploratory (Phase 4+)

Visualisations of decoder attention weights over image patches. Used to generate the figures in docs/results/. Outputs are stripped by nbstripout on commit; PNGs land in docs/images/attention/ when explicitly exported.

Conventions for new notebooks

If you add a new notebook:

Number it (04_*, 05_*) so the lifecycle order is obvious.
Use prose Markdown cells between code cells — a notebook reads like a short paper, not a Python script.
Do not import from notebooks/ elsewhere in the codebase. Notebooks consume the captioning package; they never define library code.
Strip outputs before committing. nbstripout does this automatically if you ran make install-hooks. Without that hook, run nbstripout notebooks/your_notebook.ipynb manually before git add.

Why notebooks at all?

Notebooks are excellent for exploration — narrative, mixed media, iterative data wrangling. They are bad for libraries — no testing, no type-checking, no module reuse, hidden cell-execution-order bugs. The IEEE notebook stays because the paper points at it; everything else lives in src/captioning/.