Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.409
|View full text |Cite
|
Sign up to set email alerts
|

Fingerprinting Fine-tuned Language Models in the Wild

Abstract: There are concerns that the ability of language models (LMs) to generate high quality synthetic text can be misused to launch spam, disinformation, or propaganda. Therefore, the research community is actively working on developing approaches to detect whether a given text is organic or synthetic. While this is a useful first step, it is important to be able to further fingerprint the author LM to attribute its origin. Prior work on fingerprinting LMs is limited to attributing synthetic text generated by a hand… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…Another potential solution is embedding a watermark within the LLMs which could be detected through automatic screening processes but would remain invisible to the reader (Diwan et al, 2021). In the submission process, similar to how some journals scan text for plagiarism, the submissions would be scanned for the presence of this watermark, allowing the identification of the origin LLM and the ratio of machine-generated content.…”
Section: Challenges Of Trust But Verifymentioning
confidence: 99%
“…Another potential solution is embedding a watermark within the LLMs which could be detected through automatic screening processes but would remain invisible to the reader (Diwan et al, 2021). In the submission process, similar to how some journals scan text for plagiarism, the submissions would be scanned for the presence of this watermark, allowing the identification of the origin LLM and the ratio of machine-generated content.…”
Section: Challenges Of Trust But Verifymentioning
confidence: 99%