Since there are many possible influencing factors of visibility, lightweight data requirements in practical applications of machine learning in visibility prediction can reduce the corresponding data observation cost and collection difficulty. By using the long-term measured data in Qingdao, this research comprehensively compares the performance of five common machine learning methods under different training parameter schemes, including XGBoost, LightGBM, Random Forest (RF), Support Vector Machine (SVM) and Multiple Linear Regression (MLR). The lightweight visibility prediction schemes based on pollutant parameter optimization are established. The seasonal training data of five machine learning models is preprocessed, and then performance evaluations of predictions are carried out. The analysis results show that in terms of models, ensemble learning models, including XGBoost, LightGBM, and RF, have significantly better seasonal visibility prediction effects than SVM and MLR models; XGBoost and LightGBM also have slightly better prediction effects than RF models. In terms of pollutant parameters, solid pollutants have a greater impact on visibility prediction than gaseous pollutants; PM2.5 is more influential than PM10 in visibility prediction. The visibility prediction scheme with six parameters using meteorological parameters and PM2.5 based on XGBoost or LightGBM model is preferably established in this research. This scheme can achieve the same prediction performance as the 11 parameter prediction scheme. The Correlation Coefficient (CC) of the results is around 0.85. The results of this study can not only be used to provide a machine learning scheme reference for practical visibility prediction applications, but also help to deepen the understanding of the factors affecting visibility.