2021
DOI: 10.31234/osf.io/w8trm
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Managing, storing, and sharing long-form recordings and their annotations

Abstract: The technique of long-form recordings via wearables is gaining momentum in different fields of research, notably linguistics and pathology. This technique, however, poses several technical challenges, some of which are amplified by the peculiarities of the data, including their sensitivity and their volume. In this paper, we begin by outlining key problems related to the management, storage, and sharing of the corpora that emerge when using this technique. We continue by proposing a multi-component solution … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…Corpus organization, processing, and preliminary analyses were done with ChildProject (Gautheron, Rochat, & Cristia, 2022). Additionally, transparency was ensured by publicly posting all of our materials (https://osf.io/t8r5j/?view_only=4e6f8a3b37f84da681b414bc058deca4, Anonymized, 2022), including code to reproduce results thanks to RMarkdown (Baumer & Udwin, 2015) on R (R Consortium Team, 2013), as well as DataLad (Wagner et al, 2020) and GIN (https://gin.g-node.org/).…”
Section: Analysesmentioning
confidence: 99%
“…Corpus organization, processing, and preliminary analyses were done with ChildProject (Gautheron, Rochat, & Cristia, 2022). Additionally, transparency was ensured by publicly posting all of our materials (https://osf.io/t8r5j/?view_only=4e6f8a3b37f84da681b414bc058deca4, Anonymized, 2022), including code to reproduce results thanks to RMarkdown (Baumer & Udwin, 2015) on R (R Consortium Team, 2013), as well as DataLad (Wagner et al, 2020) and GIN (https://gin.g-node.org/).…”
Section: Analysesmentioning
confidence: 99%
“…DataLad can be used as an independent tool to access and manage data (see e.g. Wittkuhn & Schuck (2021), Gautheron et al (2021), Gautheron (2021)) or as a core technology behind another tool or a larger platform (e.g. Far et al (2021)).…”
Section: External Uses and Integrationsmentioning
confidence: 99%
“…Cychosz et al (2021) created a set of Python scripts for efficiently sampling and annotating daylong audio recordings for various features of natural linguistic input (e.g., addressee, language); these scripts are described in detail in the manuscript and freely shared on GitHub (https://github.com/megseekosh/Categorize_app_v2). In addition to these examples, a variety of guidelines, scripts and packages have been created and shared by researchers to facilitate the collection, coding and analysis of naturalistic input (e.g., Anderson et al, 2021; Casillas & Scaff, 2021; Gautheron et al, 2021; Manning et al, 2020; Sanchez et al, 2019; Woon et al, 2021). All of these products serve as excellent examples of how to increase the openness and transparency of descriptive work, but they are also important resources that researchers can implement themselves when coding naturalistic behaviour.…”
Section: Open Coding Procedures and Materialsmentioning
confidence: 99%