In environmental research, missing data are often a challenge for statistical modeling. This paper addressed some advanced techniques to deal with missing values in a data set measuring air quality using a multiple imputation (MI) approach. MCAR, MAR, and NMAR missing data techniques are applied to the data set. Five missing data levels are considered: 5%, 10%, 20%, 30%, and 40%. The imputation method used in this paper is an iterative imputation method, missForest, which is related to the random forest approach. Air quality data sets were gathered from five monitoring stations in Kuwait, aggregated to a daily basis. Logarithm transformation was carried out for all pollutant data, in order to normalize their distributions and to minimize skewness. We found high levels of missing values for NO2 (18.4%), CO (18.5%), PM10 (57.4%), SO2 (19.0%), and O3 (18.2%) data. Climatological data (i.e., air temperature, relative humidity, wind direction, and wind speed) were used as control variables for better estimation. The results show that the MAR technique had the lowest RMSE and MAE. We conclude that MI using the missForest approach has a high level of accuracy in estimating missing values. MissForest had the lowest imputation error (RMSE and MAE) among the other imputation methods and, thus, can be considered to be appropriate for analyzing air quality data.
Rheumatoid arthritis (RA) is a chronic autoimmune of an unknown etiology. Air pollution has been proposed as one of the possible risk factors associated with disease activity, although has not been extensively studied. In this study, we measured the relationship between exposure to air pollutants and RA activity. Data on RA patients were extracted from the Kuwait Registry for Rheumatic Diseases (KRRD). Disease activity was measured using disease activity score with 28 examined joints (DAS-28) and the Clinical Disease Activity Index (CDAI) during their hospital visits from 2013 to 2017. Air pollution was assessed using air pollution components (PM 10 , NO 2 , SO 2 , O 3 , and CO). Air pollution data were obtained from Kuwait Environmental Public Authority (K-EPA) from six different air quality-monitoring stations during the same period. Multiple imputations by the chained equations (MICE) algorithm were applied to estimate missing air pollution data. Patients data were linked with air pollution data according to date and patient governorate address. Descriptive statistics, correlation analysis, and linear regression techniques were employed using STATA software. In total, 1651 RA patients with 9875 follow-up visits were studied. We detected an increased risk of RA using DAS-28 in participants exposed to SO 2 and NO 2 with β = 0 . 003 (95% CI: 0.0004–0.005, p < 0 . 01 ) and β = 0 . 003 (95% CI: 0.002–0.005, p < 0 . 01 ), respectively, but not to PM 10 , O 3 , and CO concentrations. Conclusively, we observed a strong association between air pollution with RA disease activity. This study suggests air pollution as a risk factor for RA and recommends further measures to be taken by the authorities to control this health problem.
The textural and mineralogical characteristics of the dust fallout along with the potential effect on human health were investigated at 12 locations in Kuwait City and suburbs during March 2006-February 2007. Mineralogically, the dust comprises mostly calcite and quartz with grain size ranging from 1 to 25 lm. Chemically, elements such as Ca, Fe, Mg, and Al are the dominant ones with the highest amount of average percentage concentration 12.89% for Ca. Results showed that humans in the 12 sites are being exposed to different ranges of average percentage concentrations of minerals such as calcite (15.00-51.80%), quartz (26.40-66.9%), albite (6.30-17.30%), dolomite (1.20-10.20%), and gypsum (0.00-6.4%), where site K3 recorded the lowest average concentrations for calcite, dolomite, and gypsum and the highest for quartz, and albite.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.