Abstract-Multi sensor fusion has been widely used in recognition problems. Most existing work highly depend on the calibration between different sensor information, but less on modeling and reasoning of co-incidence of multiple hints. In this paper, we propose a generic framework for recognition and clustering problem using a non-parametric Dirichlet hierarchical model. It enables online labeling, clustering and recognition of sequential data simultaneously, while taking into account multiple types of sensor readings. The algorithm is data-driven, which does not depend on prior-knowledge of the data structure. The result shows the feasibility and its reliability against noise data.