Andre L. C. Barczak scite author profile

Andre L. C. Barczak

5Publications

63Citation Statements Received

66Citation Statements Given

How they've been cited

144

How they cite others

118

Affiliations

Massey University, Universidade Estadual de Campinas

Publications

Order By: Most citations

Stream processing for fast and efficient rotated Haar-like features using rotated integral images

Messom

Barczak

2009

IJISTA

View full text Add to dashboard Cite

This paper introduces an extended set of Haarlike features beyond the standard vertically and horizontally aligned Haar-like features [Viola and Jones, 2001a;2001b] and the 45 o twisted Haar-like features [Lienhart and Maydt, 2002;Lienhart et al., 2003a;2003b]. The extended rotated Haar-like features are based on the standard Haar-like features that have been rotated based on whole integer pixel based rotations. These rotated feature values can also be calculated using rotated integral images which means that they can be fast and efficiently calculated with just 8 operations irrespective of the feature size. In general each feature requires another 8 operations based on an identity integral image so that appropriate scaling corrections can be applied. These scaling corrections are needed due to the rounding errors associated with scaling the features. The errors introduced by these rotated features on natural images are small enough to allow rotated classifiers to be implemented using a classifier trained on only vertically aligned images. This is a significant improvement in training time for a classifier that is invariant to the rotations represented in the parallel classifier. Figure 1. Standard Haar-like features.

show abstract

A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

et al. 2020

View full text Add to dashboard Cite

Big Data analytics for storing, processing, and analyzing large-scale datasets has become an essential tool for the industry. The advent of distributed computing frameworks such as Hadoop and Spark offers efficient solutions to analyze vast amounts of data. Due to the application programming interface (API) availability and its performance, Spark becomes very popular, even more popular than the MapReduce framework. Both these frameworks have more than 150 parameters, and the combination of these parameters has a massive impact on cluster performance. The default system parameters help the system administrator deploy their system applications without much effort, and they can measure their specific cluster performance with factory-set parameters. However, an open question remains: can new parameter selection improve cluster performance for large datasets? In this regard, this study investigates the most impacting parameters, under resource utilization, input splits, and shuffle, to compare the performance between Hadoop and Spark, using an implemented cluster in our laboratory. We used a trial-and-error approach for tuning these parameters based on a large number of experiments. In order to evaluate the frameworks of comparative analysis, we select two workloads: WordCount and TeraSort. The performance metrics are carried out based on three criteria: execution time, throughput, and speedup. Our experimental results revealed that both system performances heavily depends on input data size and correct parameter selection. The analysis of the results shows that Spark has better performance as compared to Hadoop when data sets are small, achieving up to two times speedup in WordCount workloads and up to 14 times in TeraSort workloads when default parameter values are reconfigured.

show abstract

Perspectives on the challenges of generalizability, transparency and ethics in predictive learning analytics

Mathrani

Sušnjak

Ramaswami

et al. 2021

Computers and Education Open

View full text Add to dashboard Cite

Hybrid Fuzzy Colour Processing and Learning

Playne

Mehta

Reyes

et al.

View full text Add to dashboard Cite

Adaptive cascade of boosted ensembles for face detection in concept drift

Sušnjak

Barczak

Hawick

2011

Neural Comput & Applic

View full text Add to dashboard Cite

We propose an adaptive learning algorithm for cascades of boosted ensembles that is designed to handle the problem of concept drift in nonstationary environments. The goal was to create a real-time adaptive algorithm for dynamic environments that exhibit varying degrees of drift in high-volume streaming data. This we achieved using a hybrid of detect-and-retrain and constant-update approaches. The uniqueness of our method is found in two aspects of our framework. The first is the manner in which individual weak classifiers within each cascade layer of an ensemble are clustered during training and assigned a competence value. Secondly, the idea of learning optimal cascade-layer thresholds during runtime, which enables rapid adaptation to dynamic environments. The proposed adaptive learning method was applied to a binary-class problem with rare-event detection characteristics. For this, we chose the domain of face detection and demonstrate experimentally the ability of our algorithm to achieve an effective trade-off between accuracy and speed of adaptations in dense data streams with unknown rates of change.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Andre L. C. Barczak

Stream processing for fast and efficient rotated Haar-like features using rotated integral images

A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench

Perspectives on the challenges of generalizability, transparency and ethics in predictive learning analytics

Hybrid Fuzzy Colour Processing and Learning

Adaptive cascade of boosted ensembles for face detection in concept drift

Contact Info

Product

Resources

About