2022
DOI: 10.12688/f1000research.109080.2
|View full text |Cite
|
Sign up to set email alerts
|

Recommendations for the formatting of Variant Call Format (VCF) files to make plant genotyping data FAIR

Abstract: In this opinion article, we discuss the formatting of files from (plant) genotyping studies, in particular the formatting of metadata in Variant Call Format (VCF) files. The flexibility of the VCF format specification facilitates its use as a generic interchange format across domains but can lead to inconsistency between files in the presentation of metadata. To enable fully autonomous machine actionable data flow, generic elements need to be further specified. We strongly support the merits of the FAIR princi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…These indeterminacies have made fully automated processing difficult or impossible without manual intervention in terms of interoperability at the level of bioinformatics toolchains. As DivBrowse continues to evolve, we look forward to incorporating recent advances and improvements related to metadata in VCF files to better take advantage of the FAIR data paradigm, such as improved interoperability and reusability [25].…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…These indeterminacies have made fully automated processing difficult or impossible without manual intervention in terms of interoperability at the level of bioinformatics toolchains. As DivBrowse continues to evolve, we look forward to incorporating recent advances and improvements related to metadata in VCF files to better take advantage of the FAIR data paradigm, such as improved interoperability and reusability [25].…”
Section: Discussionmentioning
confidence: 99%
“…For example, the usage of standardized sample-IDs for the genotypes based on BioSamples-IDs in the VCF metadata section would allow DivBrowse to easily read those BioSamples-IDs and use them to automatically interconnect the genotypes in the GUI of DivBrowse with other omics-related web-based information systems [26]. As another example, the appropriate standardized usage of the VCF metadata field "Contig", as recommended in Beier et al [25],…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations