2024
DOI: 10.1101/2024.05.14.594226
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ProteinCLIP: enhancing protein language models with natural language

Kevin E. Wu,
Howard Chang,
James Zou

Abstract: Language models have enabled a new era of biological sequence modeling. However, extracting meaningful sequence-level embeddings from these models remains challenging. In this work, we introduce ProteinCLIP, which applies contrastive learning between a protein's amino acid sequence and curated text describing its function. ProteinCLIP thus learns to take a pre-trained protein language model's sequence embedding and refines it produce a function-centric embedding. We show that this embedding space yields sequen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 81 publications
(131 reference statements)
0
0
0
Order By: Relevance