2021
DOI: 10.1111/test.12243
|View full text |Cite
|
Sign up to set email alerts
|

Introductory data science across disciplines, using Python, case studies, and industry consulting projects

Abstract: Data and its applications are increasingly ubiquitous in the rapidly digitizing world and consequently, students across different disciplines face increasing demand to develop skills to answer both academia's and businesses' increasing need to collect, manage, evaluate, apply and extract knowledge from data and critically reflect upon the derived insights. On the basis of recent experiences at the University of Ttingen, Germany, we present a new approach to teach the relevant data science skills as an introduc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 12 publications
0
8
0
1
Order By: Relevance
“…Computational notebooks have become popular programming environments that facilitate collaboration, interactive development and reproducibility and are, in particular, valued for rapid prototyping, data exploration and training [18]. There are dozens of different notebook systems available [19], but the use of Jupyter certainly excels across many disciplines [20][21][22], including bio-and health-informatics [12,23,24], general data science [13,25,26] and, more recently, Earth Observation research [27,28]. In 2021, more than 10 million Jupyter notebooks were available on the code-sharing platform GitHub [17,29].…”
Section: Introductionmentioning
confidence: 99%
“…Computational notebooks have become popular programming environments that facilitate collaboration, interactive development and reproducibility and are, in particular, valued for rapid prototyping, data exploration and training [18]. There are dozens of different notebook systems available [19], but the use of Jupyter certainly excels across many disciplines [20][21][22], including bio-and health-informatics [12,23,24], general data science [13,25,26] and, more recently, Earth Observation research [27,28]. In 2021, more than 10 million Jupyter notebooks were available on the code-sharing platform GitHub [17,29].…”
Section: Introductionmentioning
confidence: 99%
“…Since our focus is on providing students with skills that are useful when they have to practice the profession, we decided to use the Python language as a complement to data analysis (Lasser et al, 2021). Thus, there was a need to export data from Excel to CSV.…”
Section: Methodsmentioning
confidence: 99%
“…The programming language is syntactically simple, sim ple to learn, and straightforward. Additionally, Python is open source, free to use and has rich ecosystem of libraries for scientific computing (Lasser et al 2021;Robinson, 2017;Koulouri et al 2014;Ateeq et al 2014;Jayal et al 2011;Perez et al 2010;Lutz, 2001). The data preprocessing steps taken can be summarized as follows:…”
Section: Data Pre-processing Stage Imentioning
confidence: 99%