Proceedings of the 30th ACM International Conference on Information &Amp; Knowledge Management 2021
DOI: 10.1145/3459637.3482015
|View full text |Cite
|
Sign up to set email alerts
|

QuaPy: A Python-Based Framework for Quantification

Abstract: QuaPy is an open-source framework for performing quantification (a.k.a. supervised prevalence estimation), written in Python. Quantification is the task of training quantifiers via supervised learning, where a quantifier is a predictor that estimates the relative frequencies (a.k.a. prevalence values) of the classes of interest in a sample of unlabelled data. While quantification can be trivially performed by applying a standard classifier to each unlabelled data item and counting how many data items have been… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 26 publications
3
5
0
Order By: Relevance
“…SLD is the strongest baseline; this is true in all four subtasks, in which SLD, while never being the best performer, is always in the top ranks. This confirms the fact (already recorded in previous work -see e.g., [34][35][36]) that SLD is a very strong performer when the APP is used for generating the dataset, i.e., when the test data contain many samples characterized by substantial distribution shift.…”
Section: Resultssupporting
confidence: 88%
See 2 more Smart Citations
“…SLD is the strongest baseline; this is true in all four subtasks, in which SLD, while never being the best performer, is always in the top ranks. This confirms the fact (already recorded in previous work -see e.g., [34][35][36]) that SLD is a very strong performer when the APP is used for generating the dataset, i.e., when the test data contain many samples characterized by substantial distribution shift.…”
Section: Resultssupporting
confidence: 88%
“…CC and PCC obtain very low quantification accuracy; this is the case in all four subtasks, where these two methods are always near the bottom of the ranking. This confirms the fact (already recorded in previous worksee e.g., [34][35][36]) that they are not good performers when the APP is used for generating the dataset, i.e., they are not good performers when there is substantial distribution shift. Interestingly enough, CC always outperforms PCC, which was somehow unexpected.…”
Section: Resultssupporting
confidence: 86%
See 1 more Smart Citation
“…In the first such talk, Alejandro Moreo presented (joint work with Andrea Esuli and Fabrizio Sebastiani) QuaPy (https://github.com/HLT-ISTI/QuaPy), an open-source Python-based software library for LQ. QuaPy (which was also the object of a presentation in the main CIKM 2021 conference -see [7]) provides implementations of both baseline and advanced LQ methods, of routines for LQoriented model selection, of several broadly accepted evaluation measures, and of robust evaluation protocols routinely used in the field. QuaPy also makes available datasets commonly used for testing quantifiers, and offers visualization tools for facilitating the analysis and interpretation of the results.…”
Section: The Workhopmentioning
confidence: 99%
“…We have recently developed (and made publicly available) QuaPy, an opensource, Python-based framework that implements several learning methods, evaluation measures, parameter optimisation routines, and evaluation protocols, for LQ [16]. 5 Among other things, QuaPy contains implementations of the baseline methods and evaluation measures officially adopted in LeQua 2022.…”
Section: Baselinesmentioning
confidence: 99%