“…Global encoding Bag-of-Words (BoW) [5], Pyramid Kernel [60], Tree codebook [61], Kernel codebook (KC) [62], Sparse coding [63], Locality-constrained Linear Coding (LLC) [64], Hamming Embedding (HE) [65], VLAD [66], Fisher Kernel (FK) [67], Super Vector [68], Bag-of-Binary-Words [69], BVLAD [70] Other Location coding [71,72], SIFT-Preserving JPEG [73] and H.264/AVC [74], Chen and Moulin [75], Hybrid ATC (HATC) [7], Interframe patch [9] and descriptor [76] coding, VideoSIFT [10], VideoBRISK [77] Feature networking Section V -Y a n g et al [78,79], feature extraction offloading [80][81][82], lossy feature transmission [3], Mobile Visual Search [83] exploited to encode visual features, providing a significant coding gain with respect to the case of still images. Similar works in the previous literatures focus on either feature extraction [12,13] or encoding [14,15].…”