2018
DOI: 10.2139/ssrn.3185342
|View full text |Cite
|
Sign up to set email alerts
|

Meta-Modeling of Data Sources and Ingestion Big Data Layers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 4 publications
0
9
0
Order By: Relevance
“…We have proposed a BD architecture enabling an efficient RS processing. This architecture is composed of six layers which are: the data sources, the ingestion layer, the Hadoop storage, the processing and management layer, and finally, the visualization layer [26]. In this paper, we focus our clarifications on the ingestion layer.…”
Section: The Data Ingestionmentioning
confidence: 99%
“…We have proposed a BD architecture enabling an efficient RS processing. This architecture is composed of six layers which are: the data sources, the ingestion layer, the Hadoop storage, the processing and management layer, and finally, the visualization layer [26]. In this paper, we focus our clarifications on the ingestion layer.…”
Section: The Data Ingestionmentioning
confidence: 99%
“…There are two main data sources internal and external, internal sources which are controlled by the organizations and included data about daily operations of the company that collected and stored in databases, in this case we are discussing about structured data, external sources refer to all the data that retrieved from external sources that are not controlled by the organization. (Bucur, C., 2015) (Erraissi, A., Belangour, A., & Tragha, A., 2018) Big data has different data sources, social media is the most important source, Twitter and Facebook generate very large amount of data such as tweets, profiles and likes, this data can be analyzed and provide important value, for example analysis of social media data that related to new product can provide better understanding about customer satisfaction. Log files are another source of data, for example clicks on specific website can be logged into web log files, and these logs can be analyzed to understand the online user's behavior.…”
Section: Data Sourcementioning
confidence: 99%
“…Geospatial data that generated by cell phones is another source of data that can be used by another application. 0 There are three types of data: (Erraissi, A., Belangour, A., & Tragha, A., 2018) • Structured data: it refers to the data that has fixed format and stored into rows and columns, such as data that stored into relational databases.…”
Section: Data Sourcementioning
confidence: 99%
See 1 more Smart Citation
“…This triplestore uses the Jena Framework for querying RDF data. HBase is a distributed, column-oriented, scalable and fault-tolerant NoSQL [19] database where workload in terms of memory and computation (CPU) as well as storage is distributed on all machines in the HBase cluster. This NoSQL system is inspired by the BigTable's work [12], led by Google.…”
Section: Jena-hbasementioning
confidence: 99%