AimNon‐invasive diagnostics for metabolic dysfunction–associated fatty liver disease (MAFLD) remain challenging. We aimed to identify novel key genes as non‐invasive biomarkers for MAFLD, elucidate causal relationships between biomarkers and MAFLD and determine the role of immune cells as potential mediators.Materials and MethodsUtilizing published transcriptome data of patients with biopsy‐proven MAFLD, we applied linear models for microarray data, least absolute shrinkage and selector operation (LASSO) regressions and receiver operating characteristic (ROC) curve analyses to identify and validate biomarkers for MAFLD. Using the expression quantitative trait loci database and a cohort of 778 614 Europeans, we used Mendelian randomization to analyse the causal relationships between key biomarkers and MAFLD. Additionally, mediation analysis was performed to examine the involvement of 731 immunophenotypes in these relationships.ResultsWe identified 31 differentially expressed genes, and LASSO regression showed three hub genes, IGFBP2, PEG10, and P4HA1, with area under the receiver operating characteristic (AUROC) curve of 0.807, 0.772 and 0.791, respectively, for identifying MAFLD. The model of these three genes had an AUROC of 0.959 and 0.800 in the development and validation data sets, respectively. This model was also validated using serum‐based enzyme‐linked immunosorbent assay data from MAFLD patients and control subjects (AUROC: 0.819, 95% confidence interval: 0.736–0.902). PEG10 was associated with an increased MAFLD risk (odds ratio = 1.106, p = 0.032) via inverse variance–weighted analysis, and about 30% of this risk was mediated by the percentage of CD11c + CD62L– monocytes.ConclusionsThe MAFLD panels have good diagnostic accuracy, and the causal link between PEG10 and MAFLD was mediated by the percentage of CD11c + CD62L– monocytes.