Detection of data drift and outliers affecting machine learning model performance over time

Ackerman, Samuel; Farchi, Eitan; Raz, Orna; Zalmanovici, Marcel; Dube, Parijat

doi:10.48550/arxiv.2012.09258

Cited by 12 publications

(14 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This method aims to maximise the data quality by reducing various data issues, such as outliers, correlated features, skewed data, imbalanced categorical data and etc. [1,39,50]. This method also involves application of automated correction algorithms to correct the data issues, such as the SMOTE technique [18 ] can be used to mitigate the class imbalance problem or removal of redundant data using automated algorithms.…”

Section: Data Configuration Mechanismsmentioning

confidence: 99%

“…The manual configuration approach enables domain experts, such as healthcare experts, to utilise their prior knowledge to assess the importance of predictor variables and mitigate bias or anomalies within the training data. In contrast, the automated configuration highlights potential issues in the training data [1,39,50] and allows users to select the issues that need correction. The system automatically applies correction algorithms to minimise these potential issues and retrains the prediction model on the configured data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

EXMOS: Explanatory Model Steering through Multifaceted Explanations and Data Configurations

Bhattacharya,

Stumpf,

Gosak

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

Figure 1: Explanatory Model Steering (EXMOS) enable users to fine-tune prediction models with the help of Explainable AI and Interactive Machine Learning. This research explores the influence of different types of global explanations for supporting domain experts, such as healthcare experts, in improving ML models through manual and automated data configurations. The refined prediction model also dynamically updates the explanations and the predicted outcomes.

show abstract

Section: Data Configuration Mechanismsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

EXMOS: Explanatory Model Steering through Multifaceted Explanations and Data Configurations

Bhattacharya,

Stumpf,

Gosak

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

“…For example, radiologists working in the NHS breast screening programme are subject to a range of monitoring and auditing procedures (Cohen et al 2018). Data and concept 'drift' mean that an AI system's performance may also change over time (Davis et al 2017a, 2017b, Health 2022, raising the need for monitoring and auditing procedures and tools to detect changes that might put patient safety at risk (Ackerman et al 2020, Henne et al 2020, Nix et al 2022. Providing support for the monitoring and auditing of AI systems (Davis et al 2019, Liu et al 2022 would therefore be another scenario to be taken into consideration in the design of the system biography described above.…”

Section: Understanding Accountability As a Constraint And A Resource ...mentioning

confidence: 99%

Holding AI to Account: Challenges for the Delivery of Trustworthy AI in Healthcare

Procter

Tolmie

Rouncefield

2023

ACM Trans. Comput.-Hum. Interact.

View full text Add to dashboard Cite

The need for AI systems to provide explanations for their behaviour is now widely recognised as key to their adoption. In this paper, we examine the problem of trustworthy AI and explore what delivering this means in practice, with a focus on healthcare applications. Work in this area typically treats trustworthy AI as a problem of Human-Computer Interaction involving the individual user and an AI system. However, we argue here that this overlooks the important part played by organisational accountability in how people reason about and trust AI in socio-technical settings. To illustrate the importance of organisational accountability, we present findings from ethnographic studies of breast cancer screening and cancer treatment planning in multidisciplinary team meetings to show how participants made themselves accountable both to each other and to the organisations of which they are members. We use these findings to enrich existing understandings of the requirements for trustworthy AI and to outline some candidate solutions to the problems of making AI accountable both to individual users and organisationally. We conclude by outlining the implications of this for future work on the development of trustworthy AI, including ways in which our proposed solutions may be re-used in different application settings.

show abstract

“…Surrogate models (i.e. simplified proxies of a model; also called emulators) must be treated with care because some may miss important fringe–cases or rare events that more fine-grained models are able to better predict, such as in the case of machine learning algorithms deployed with insufficient training data [1, 2, 3, 5, 87].…”

Section: Building Multiscale Modelsmentioning

confidence: 99%

Development and Analysis of Multiscale Models for Tuberculosis: From Molecules to Populations

Nanda,

Budak,

Michael

et al. 2023

Preprint

View full text Add to dashboard Cite

Although infectious disease dynamics are often analyzed at the macro-scale, increasing numbers of drug-resistant infections highlight the importance of within-host modeling that simultaneously solves across multiple scales to effectively respond to epidemics. We review multiscale modeling approaches for complex, interconnected biological systems and discuss critical steps involved in building, analyzing, and applying such models within the discipline of model credibility. We also present our two tools: CaliPro, for calibrating multiscale models (MSMs) to datasets, and tunable resolution, for fine- and coarse-graining sub-models while retaining insights. We include as an example our work simulating infection withMycobacterium tuberculosisto demonstrate modeling choices and how predictions are made to generate new insights and test interventions. We discuss some of the current challenges of incorporating novel datasets, rigorously training computational biologists, and increasing the reach of MSMs. We also offer several promising future research directions of incorporating within-host dynamics into applications ranging from combinatorial treatment to epidemic response.

show abstract

Detection of data drift and outliers affecting machine learning model performance over time

Cited by 12 publications

References 5 publications

EXMOS: Explanatory Model Steering through Multifaceted Explanations and Data Configurations

EXMOS: Explanatory Model Steering through Multifaceted Explanations and Data Configurations

Holding AI to Account: Challenges for the Delivery of Trustworthy AI in Healthcare

Development and Analysis of Multiscale Models for Tuberculosis: From Molecules to Populations

Contact Info

Product

Resources

About