“…First, many works have proposed stochastic estimators [3,40,39,27,10,44] that rely on sampling either feature subsets or permutations; these are often consistent estimators, but they require many model evaluations and involve a trade-off between run-time and accuracy. Second, some works have proposed model-specific approximations, e.g., for trees [26] or neural networks [35,6,2,43]; these are generally faster, but they sometimes require many model evaluations, often induce bias, and typically lack flexibility regarding how to handle held-out features when generating explanations-a subject of continued debate in the field [1,19,9,14].…”