Nowadays, more and more information is flowing in and is provided on the Web. Large datasets are made available covering many fields and sectors. Open Data (OD) plays an important role in this field. Thanks to the volumes and the variety of the released datasets, OD brings high societal and business potential. In order to realize this potential, the reuse of the datasets (e.g. in internal business processes) becomes primordial. However, if the aim is to reuse OD, it is also necessary to be able of assessing its quality. This paper demonstrates how Information Visualization may help on this task and presents Stacktab chart-a new chart to analyse and assess CSV files in order to understand their structure, identify the location of relevant information and detect possible problems in the datasets.
This short paper focuses on the application of cloud computing principles and solutions to the domain of data integration. After an introduction to the topic, data integration is shortly discussed, and some quality criteria for data integration solutions, including infrastructure and the organizational context, are presented. Afterwards, cloud computing and possible cloud-based data integration scenarios are discussed. The beforementioned quality criteria are revisited especially relative to public cloud deployment scenarios. Finally, a design study for the examination of cloud-based data integration that focuses on open data integration for an environmental data management application is proposed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.