“…Feature extraction from protein sequences plays an important role in protein classification [1,2,3,4] of many areas, such as identification of plant pentatricopeptide repeat coding protein [5], prediction of bacterial type IV secreted effectors [6,7], identification of heat shock protein [8], prediction of mitochondrial proteins [9], etc. In general, prevailing encoding approaches of protein sequences for feature extraction include pseudo-amino acid composition (PseAAC) [8,9,10,11,12,13,14,15,16,17,18,19,20], position-specific scoring matrix (PSSM) [7,21,22,23,24,25,26,27,28,29,30], position-specific iterated blast (PSI-BLAST) [31,32,33,34,35] etc.…”