Learning Accurate and Interpretable Decision Rule Sets from Neural Networks

Qiao, Litao; Wang, Weijia; Lin, Bill

doi:10.1609/aaai.v35i5.16555

Cited by 17 publications

(17 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where 𝜎 (•) denotes the Softmax function, 𝑠 denotes the SIS vector (17), 𝑠 ′ is the normalized and scaled SIS with factor 𝑎, and 𝑠 ′ 𝑝 is the 𝑝th element of 𝑠 ′ for feature variable 𝑏 𝑝 . In this step, 𝑚 WCS IPs are created, each by sampling |P ′𝑒 𝑖 | = 𝜌 |P 𝑒 | (rounded, 0 < 𝜌 < 1) feature variables according to the probability Prob 𝑝 (18), and |N 𝑖 | = 𝑛 wcs observations uniformly from the observations set N .…”

Section: Weighted Column Sampling Optimizationmentioning

confidence: 99%

“…Note that these varying implementations of branch-and-bound solution procedure only affect the time efficiencies in solving the models, and do not change the solution values. The proposed SIS-based WCS optimization method samples feature variables according to their SIS values' associated Softmax probabilities (18). As an alternative for comparison, the SIS-based feature sampling approach in WCS can be replaced with a uniform sampling approach, called random column sampling (RCS), which associates equal sampling probabilities for each feature variable.…”

Section: Branch-and-bound Variantsmentioning

confidence: 99%

“…To achieve better transparency, traditionally, global explanation techniques have been used to extract approximate interpretable models from black-box ML models to mimic their decision logic. For example, classification rules can be extracted from neural networks [1,18], and decision trees can be extracted from tree ensembles [11,16]. However, each of these traditional global explanation methods is usually dedicated to a certain type of models, such as neural networks and tree ensembles, and thus they are not model-agnostic.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Explaining with Greater Support: Weighted Column Sampling Optimization for q-Consistent Summary-Explanations

Chen¹,

Dai²,

Xia³

et al. 2023

Preprint

View full text Add to dashboard Cite

Machine learning systems have been extensively used as auxiliary tools in domains that require critical decision-making, such as healthcare and criminal justice. The explainability of decisions is crucial for users to develop trust on these systems. In recent years, the globally-consistent rule-based summary-explanation and its max-support (MS) problem have been proposed, which can provide explanations for particular decisions along with useful statistics of the dataset. However, globally-consistent summary-explanations with limited complexity typically have small supports, if there are any. In this paper, we propose a relaxed version of summaryexplanation, i.e., the 𝑞-consistent summary-explanation, which aims to achieve greater support at the cost of slightly lower consistency.The challenge is that the max-support problem of 𝑞-consistent summary-explanation (MSqC) is much more complex than the original MS problem, resulting in over-extended solution time using standard branch-and-bound solvers. To improve the solution time efficiency, this paper proposes the weighted column sampling (WCS) method based on solving smaller problems by sampling variables according to their simplified increase support (SIS) values. Experiments verify that solving MSqC with the proposed SIS-based WCS method is not only more scalable in efficiency, but also yields solutions with greater support and better global extrapolation effectiveness.

show abstract

Section: Weighted Column Sampling Optimizationmentioning

confidence: 99%

Section: Branch-and-bound Variantsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Explaining with Greater Support: Weighted Column Sampling Optimization for q-Consistent Summary-Explanations

Chen¹,

Dai²,

Xia³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…The optimization techniques in AMIE speed up the rule mining procedure by one order but heavily sacrifice the expressiveness of Horn rules. Neural networks are also adopted for learning logic rules in some recent work 34 …”

Section: Related Workmentioning

confidence: 99%

“…Neural networks are also adopted for learning logic rules in some recent work. 34 Association patterns are less expressive compared to Horn rules. For example, the patterns induced from LLC can be expressed as the following Horn rules:…”

Section: Association Pattern Mining and Logic Rule Miningmentioning

confidence: 99%

Horn rule discovery with batched caching and rule identifier for proficient compressor of knowledge data

et al. 2022

View full text Add to dashboard Cite

Knowledge data has been widely applied to artificial intelligence applications for interpretable and complex reasoning. Modern knowledge bases are constructed via automatic knowledge extraction from open-accessible sources. Thus the sizes of KBs are continuously growing, heavily burdening the maintenance and application of the knowledge data. Besides the grammatical redundancies, semantically repeated information also frequently appears in knowledge bases but is still under-explored. Existing semantic compressors fail to efficiently discover expressive patterns and thus perform unsatisfyingly on knowledge data. This article proposes SInC, a semantic inductive compressor, to efficiently induce first-order Horn rules and semantically compress knowledge bases. SInC improves the scalability of top-down rule mining by batching correlated records in the cache and further optimizes the pruning of duplication and specialization via an identifier structure of Horn rules. SInC was evaluated on real-world and synthetic datasets and compared against the state-of-the-art.The results show that the batched caching speed up the rule mining procedure by more than two orders while consuming fewer than three times memory space. The identifier technique speeds up the duplication and specialization pruning by orders of magnitude with less than 5‰ and 15% error rates, respectively. SInC outperforms the state-of-the-art from the perspective of overall compression on both scalability and compression effect.

show abstract