Duplicated from camenduru/openpose

JusJ7
/

openpose

Model card Files Files and versions

openpose / 3rdparty /caffe /docs /tutorial /data.md

JusJ7's picture

Duplicate from camenduru/openpose

b7f741c 21 days ago

|

history blame contribute delete

3.5 kB

	---
	title: Data
	---
	# Data: Ins and Outs

	Data flows through Caffe as [Blobs](net_layer_blob.html#blob-storage-and-communication).
	Data layers load input and save output by converting to and from Blob to other formats.
	Common transformations like mean-subtraction and feature-scaling are done by data layer configuration.
	New input types are supported by developing a new data layer -- the rest of the Net follows by the modularity of the Caffe layer catalogue.

	This data layer definition

	layer {
	name: "mnist"
	# Data layer loads leveldb or lmdb storage DBs for high-throughput.
	type: "Data"
	# the 1st top is the data itself: the name is only convention
	top: "data"
	# the 2nd top is the ground truth: the name is only convention
	top: "label"
	# the Data layer configuration
	data_param {
	# path to the DB
	source: "examples/mnist/mnist_train_lmdb"
	# type of DB: LEVELDB or LMDB (LMDB supports concurrent reads)
	backend: LMDB
	# batch processing improves efficiency.
	batch_size: 64
	}
	# common data transformations
	transform_param {
	# feature scaling coefficient: this maps the [0, 255] MNIST data to [0, 1]
	scale: 0.00390625
	}
	}

	loads the MNIST digits.

	Tops and Bottoms: A data layer makes top blobs to output data to the model.
	It does not have bottom blobs since it takes no input.

	Data and Label: a data layer has at least one top canonically named data.
	For ground truth a second top can be defined that is canonically named label.
	Both tops simply produce blobs and there is nothing inherently special about these names.
	The (data, label) pairing is a convenience for classification models.

	Transformations: data preprocessing is parametrized by transformation messages within the data layer definition.

	layer {
	name: "data"
	type: "Data"
	[...]
	transform_param {
	scale: 0.1
	mean_file_size: mean.binaryproto
	# for images in particular horizontal mirroring and random cropping
	# can be done as simple data augmentations.
	mirror: 1 # 1 = on, 0 = off
	# crop a `crop_size` x `crop_size` patch:
	# - at random during training
	# - from the center during testing
	crop_size: 227
	}
	}

	Prefetching: for throughput data layers fetch the next batch of data and prepare it in the background while the Net computes the current batch.

	Multiple Inputs: a Net can have multiple inputs of any number and type. Define as many data layers as needed giving each a unique name and top. Multiple inputs are useful for non-trivial ground truth: one data layer loads the actual data and the other data layer loads the ground truth in lock-step. In this arrangement both data and label can be any 4D array. Further applications of multiple inputs are found in multi-modal and sequence models. In these cases you may need to implement your own data preparation routines or a special data layer.

	Improvements to data processing to add formats, generality, or helper utilities are welcome!

	## Formats

	Refer to the layer catalogue of [data layers](layers.html#data-layers) for close-ups on each type of data Caffe understands.

	## Deployment Input

	For on-the-fly computation deployment Nets define their inputs by `input` fields: these Nets then accept direct assignment of data for online or interactive computation.