Idiosyncratic drug-induced liver injury (DILI) is a complex and unpredictable event caused by different drugs, herbal, and dietary supplements. The early identification of human hepatotoxicity at the preclinical stages remains a major challenge, in which the selection of validated in vitro systems and test drugs has a significant impact. This systematic review aims to analyse the compounds used in hepatotoxicity assays and establish a unified list of DILI positive and negative control drugs for validation of in vitro models of DILI, supported by literature and clinical evidence and endorsed by a committee of experts from the COST Action ProEuroDILI Network (CA 17112). This systematic review was performed in accordance with the 2020 PRISMA guidelines. Original research articles focused on investigating DILI occurrence using in vitro human models performing at least one hepatotoxicity assay with positive and negative control compounds were included. A modified version of the "Toxicological Data Reliability Assessment Tool" (ToxRTool) was used to assess the bias of the included studies. A total of 2,936 studies were retrieved from the different databases. Of these, 51 met the inclusion criteria, with 30 categorized as reliable without restrictions. Diclofenac and buspirone were the most commonly used DILI-positive and DILI-negative control drugs, respectively. Although there was a broad consensus on the positive compounds, the selection of negative control compounds remained less clear. Regarding the models used, the 2D monoculture of primary human hepatocytes (PHHs) was the favoured choice. However, there was no clear consensus on the drug concentrations. Short acute exposure times were mostly utilised, and cytotoxicity was the preferred endpoint. The extensive analysis of included studies highlighted the lack of agreement on appropriate control compounds for the in vitro assessment of DILI. Therefore, following comprehensive in vitro and clinical data analysis together with input from the expert committee, an evidence-based consensus-driven list of 10 positive and negative drugs is proposed for validating new in vitro models for improving preclinical drug safety testing regimes.