2022
DOI: 10.1101/2022.09.18.508433
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SHEPHARD: a modular and extensible software architecture for analyzing and annotating large protein datasets

Abstract: The emergence of high-throughput experiments and high-resolution computational predictions has led to an explosion in the quality and volume of protein sequence annotations at proteomic scales. Unfortunately, integrating and analyzing complex sequence annotations remains logistically challenging. Here we present SHEPHARD, a software package that makes large-scale integrative protein bioinformatics trivial. SHEPHARD is provided as a stand-alone package and with a pre-compiled set of human annotations in a Googl… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 44 publications
0
7
0
Order By: Relevance
“…The protein sequence analysis was conducted using SHEPHARD, a Python-based framework designed for integrating and analyzing large-scale amino acid sequence properties 124 . IDRs were predicted and annotated using metapredict (V2) 125 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The protein sequence analysis was conducted using SHEPHARD, a Python-based framework designed for integrating and analyzing large-scale amino acid sequence properties 124 . IDRs were predicted and annotated using metapredict (V2) 125 .…”
Section: Methodsmentioning
confidence: 99%
“…Protein abundance analyses (Fig. S7–11) were performed using SHEPHARD 124 and sparrow (https://github.com/idptools/sparrow). Mass spectrometry data were obtained for humans 128 , X. laevis 129 , A. thaliana 130 , E. coli 131 , S. pombe 132 , and S. cerevisiae 133 .…”
Section: Methodsmentioning
confidence: 99%
“…Proteome-wide bioinformatic analyses were performed using SPAR-ROW (https://github.com/idptools/sparrow) and SHEPHARD 96 . SPARROW is an in-development Python package for calculating IDR sequence properties and SHEPHARD is a hierarchical analysis framework for annotating and analyzing large sets of protein sequences.…”
Section: Bioinformaticsmentioning
confidence: 99%
“…Proteome-wide bioinformatics was performed using SPARROW (https://github.com/idptools/sparrow) and SHEPHARD 66 . SPARROW is an in-development Python package for calculating IDR sequence properties, while SHEPHARD is a hierarchical analysis framework for annotating and analyzing large sets of protein sequences.…”
Section: Bioinformaticsmentioning
confidence: 99%
“…4 and Fig. 5 are shared as SHEPHARD-compliant datafiles, and we encourage other groups to explore these predictions in the context of other protein annotations using SHEPHARD and the set of precomputed annotations provided therein 66 . All other data and code used for sequence analysis, training weights, bioinformatic data, the SPARROW implementation, and the Google Colab notebook are linked from this manuscript's main GitHub directory: https://github.com/holehouse-lab/supportingdata/tree/master/2023/ALBATROSS_2023…”
Section: Data and Code Availabilitymentioning
confidence: 99%