“…Actually, there are a variety of scenarios in which the classifications of objects (bags) can only be determined by some key components (instances), such as medical diagnoses; that is, some instances trigger the bag label. Following this concept, DTIs can be characterized by an MIL framework: the private representation contains abundant information that has been proven to be effective for DTA prediction [ 6 , 9 , 11 , 19 ], as does each public feature obtained via early fusion [ 2 , 12 , 13 , 14 ] and each public feature obtained via concatenation [ 4 , 5 , 7 , 20 ]. However, the exact contribution of each instance to the final DTA value of the bag is unknown.…”