In this work, a data-driven approach for the identification of a piece-wise linear model for nitrogen oxide daily concentration simulation is presented and applied. The model has been identified by using daily measured concentrations, meteorological variables, and emission levels estimated starting from the results contained in suitable emission databases. We propose an innovative methodology that jointly optimizes clustering and parameter identification. The procedure has been applied considering data from the Milan (Italy) metropolitan area. The methodology has been compared with two state-of-the-art approaches based on a two-step, cluster-based algorithm and on Hammerstein–Wiener models. The results show how, in the presented application, the devised approach ensures better performance with respect to the two literature methods, both in terms of statistical indexes (correlation, normalized mean absolute error) and in terms of problem-specific metrics (hit ratio, false alarm). For this reason, the approach can be considered suitable to be used in the definition of optimal emission control strategies.