“…Feature Extraction Phase. During an off-line feature extraction phase, 3D pose [180], [34], [41], [181] or 6D pose [176], [179], [178], [182], [177], [23], [2], [24], [25] annotated templates involved in the training data are represented with robust feature descriptors. Features are manually-crafted utilizing the available shape, geometry, and appearance information [176], [179], [178], [182], [177], [23], [2], [25], and the recent paradigm in the field is to deep learn those using neural net architectures [180], [34], [41], [181].…”