“…This kind of approaches could be categorized as sequential fusion. This paradigm can be used for combining different feature modalities [54], or simply different visual feature sets [55,56]. In other approaches, global and local image descriptors are used sequentially, the first ones performing a coarse similarity search, the latter ones, to refine the search [57,58].…”