2021
DOI: 10.1002/cpe.6304
|View full text |Cite
|
Sign up to set email alerts
|

Efficient computation of positional population counts using SIMD instructions

Abstract: In several fields such as statistics, machine learning, and bioinformatics, categorical variables are frequently represented as one-hot encoded vectors. For example, given 8 distinct values, we map each value to a byte where only a single bit has been set.We are motivated to quickly compute statistics over such encodings.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…The data is a dummy, and the data format is the text that resembles transactions and will be created using an automated random algorithm. 6) Software development: the development will use a multi-threaded model adopted from computer architecture categories [17]- [20], namely Flynn taxonomy. Since the model is at the process level, the term process will be used instead of instruction.…”
Section: Methodsmentioning
confidence: 99%
“…The data is a dummy, and the data format is the text that resembles transactions and will be created using an automated random algorithm. 6) Software development: the development will use a multi-threaded model adopted from computer architecture categories [17]- [20], namely Flynn taxonomy. Since the model is at the process level, the term process will be used instead of instruction.…”
Section: Methodsmentioning
confidence: 99%