Aiming at the problems of poor security and clustering accuracy in current data clustering algorithms, a controllable clustering algorithm for real-time streaming big data based on multi-source data fusion is proposed. The FIR filter structure model is used to suppress network interference, and ant colony algorithm is used to detect the abnormal data in the big data. By optimizing the iteration, the pheromone concentration is placed in the front position as the abnormal data point, and the filter is introduced. The fusion scope of multi-source data fusion is set. Combined with the data similarity function, the multi-source data fusion concept is used to construct the associated real-time streaming big data fusion device, and the data deduplication results are substituted into the fusion device to obtain the data clustering result. The experiments show that the proposed algorithm has high safety factor, good data clustering accuracy, high clustering efficiency, and low energy consumption.