“…These deep networks are different in terms of structure and connection weights. Many efforts, such as transfer learning, self-supervised learning, fuzzing with invocation ordering [ 23 , 24 ], fusion models [ 25 ], hierarchical models [ 26 ], adapting feature selection algorithms [ 27 ], subspace random optimization [ 28 ], multi-modal [ 29 ], and multi-label [ 30 ] techniques have improved the performance of these models [ 31 ].…”