In recent years a new type of tradable assets appeared, generically known as
cryptocurrencies. Among them, the most widespread is Bitcoin. Given its
novelty, this paper investigates some statistical properties of the Bitcoin
market. This study compares Bitcoin and standard currencies dynamics and
focuses on the analysis of returns at different time scales. We test the
presence of long memory in return time series from 2011 to 2017, using
transaction data from one Bitcoin platform. We compute the Hurst exponent by
means of the Detrended Fluctuation Analysis method, using a sliding window in
order to measure long range dependence. We detect that Hurst exponents changes
significantly during the first years of existence of Bitcoin, tending to
stabilize in recent times. Additionally, multiscale analysis shows a similar
behavior of the Hurst exponent, implying a self-similar process.Comment: 17 pages, 6 figures. arXiv admin note: text overlap with
arXiv:1605.0670
Addressing the huge amount of data continuously generated is an important challenge in the Machine Learning field. The need to adapt the traditional techniques or create new ones is evident. To do so, distributed technologies have to be used to deal with the significant scalability constraints due to the Big Data context. In many Big Data applications for classification, there are some classes that are highly underrepresented, leading to what is known as the im balanced classification problem. In this scenario, learning algorithms are often biased towards the majority classes, treating minority ones as out liers or noise. Consequently, preprocessing techniques to balance the class distribution were developed. This can be achieved by suppressing majority instances (undersampling) or by creating minority examples (oversampling). Re garding the oversampling methods, one of the most widespread is the SMOTE algorithm, which creates artificial examples according to the neighborhood of each minority class instance. In this work, our objective is to analyze the SMOTE behavior in Big Data as a function of some key aspects such as the oversampling degree, the neighborhood value and, specially, the type of distributed design (local vs. global).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.