juhenes
/

ngiml

Image Segmentation

Model card Files Files and versions

ngiml / README.md

juhenes's picture

Update README.md

5881d62 verified 2 days ago

|

history blame contribute delete

3.46 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: image-segmentation
	tags:
	- manipulation
	- forgery
	- image
	- cnn
	- transformer
	- residual-noise
	- efficientnet
	- swin
	- localization
	---
	# NGIML Model Card

	## Inference

	NGIML performs single-image forgery localization from a pretrained checkpoint and an input RGB image.

	## Checkpoints

	Pretrained checkpoints are hosted on Hugging Face:

	[juhenes/ngiml](https://huggingface.co/juhenes/ngiml)

	Available checkpoints:

	- `casia-effnet.pt`
	- `casia-effnet+noise.pt`
	- `casia-effnet+swin.pt`
	- `casia-full.pt`
	- `casia-swin.pt`
	- `casia-swin+noise.pt`

	## Run Inference

	### Recommended: Google Colab

	The easiest way to test the model is through Colab:

	- Google Colab: [Open `infer.ipynb` in Colab](https://colab.research.google.com/github/juhenes/ngiml-infer/blob/main/infer.ipynb)

	This is the recommended path for quick testing because the notebook is already set up for checkpoint-based inference.

	### Local CLI

	If you want to run the project locally, use the repository files here:

	- GitHub: [juhenes/ngiml-infer](https://github.com/juhenes/ngiml-infer)

	Install the dependencies:

	```bash
	pip install -r requirements.txt
	```

	Run the CLI:

	```bash
	python predict.py --checkpoint /path/to/checkpoint.pt --image /path/to/image.png
	```

	Example:

	```bash
	python predict.py --checkpoint checkpoints_cache/casia-full.pt --image /path/to/image.png
	```

	If `--output-dir` is omitted, outputs are saved under `outputs/<image-stem>/`.

	## Optional Arguments

	- `--output-dir` to choose where outputs are saved
	- `--threshold` to override the default binary threshold
	- `--normalization-mode` to set `imagenet` or `zero_one`
	- `--resize-max-side` to resize large images before preprocessing
	- `--crop-size` to override the inference crop size
	- `--device` to choose a device such as `cpu` or `cuda:0`

	## Output Files

	When an output directory is used, the runtime saves:

	- `input_rgb.png`
	- `preview_input_rgb.png`
	- `preview_probability_map.png`
	- `preview_binary_mask.png`
	- `preview_overlay.png`
	- `probability_map.png`
	- `binary_mask.png`
	- `overlay.png`
	- `prediction.json`

	`prediction.json` includes summary metadata such as the checkpoint path, threshold, normalization mode, device, and basic prediction statistics.

	## References

	1. Dong, J., Wang, W., and Tan, T. "CASIA Image Tampering Detection Evaluation Database." 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013. [DOI](https://doi.org/10.1109/chinasip.2013.6625374)

	```bibtex
	@inproceedings{Dong2013,
	doi = {10.1109/chinasip.2013.6625374},
	url = {https://doi.org/10.1109/chinasip.2013.6625374},
	year = {2013},
	month = jul,
	publisher = {{IEEE}},
	author = {Jing Dong and Wei Wang and Tieniu Tan},
	title = {{CASIA} Image Tampering Detection Evaluation Database},
	booktitle = {2013 {IEEE} China Summit and International Conference on Signal and Information Processing}
	}
	```

	2. Pham, N. T., Lee, J.-W., Kwon, G.-R., and Park, C.-S. "Hybrid Image-Retrieval Method for Image-Splicing Validation." Symmetry, 11(1), 83, 2019.

	```bibtex
	@article{pham2019hybrid,
	title = {Hybrid Image-Retrieval Method for Image-Splicing Validation},
	author = {Pham, Nam Thanh and Lee, Jong-Weon and Kwon, Goo-Rak and Park, Chun-Su},
	journal = {Symmetry},
	volume = {11},
	number = {1},
	pages = {83},
	year = {2019},
	publisher = {Multidisciplinary Digital Publishing Institute}
	}
	```