Double Watermark for Large Language Models

NAGATSUKA Koichi, SOGAWA Yasuhiro

doi:10.11517/pjsai.jsai2024.0_4xin281

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

Double Watermark for Large Language Models

DOI

NAGATSUKA Koichi

Hitachi, Ltd.
SOGAWA Yasuhiro

Hitachi, Ltd.

Bibliographic Information

Other Title

大規模言語モデルのための二重電子透かし

Description

<p>Detecting text generated by large language models (LLMs) with high accuracy is crucial for preventing the spread of fake news and misinformation caused by LLMs. Recently, digital watermark for auto-regressive language models has gained attention as a means of detecting text derived from LLMs. This approach embeds specific token patterns in text as a watermark by increasing token probabilities in a token group selected based on a single key. However, this approach cannot identify the source of text when the single key is leaked. To address this issue, we propose a double watermark which embeds two different watermarks with two corresponding keys in text so that the author of the text can be identified even after the first key is leaked. Our proposed method demonstrated the ability to detect a double watermark with high accuracy without significantly degrading the quality of the text.</p>

Journal

Proceedings of the Annual Conference of JSAI

Proceedings of the Annual Conference of JSAI JSAI2024 (0), 4Xin281-4Xin281, 2024

The Japanese Society for Artificial Intelligence

Keywords

Details 詳細情報について

CRID

1390018971042579840
DOI

10.11517/pjsai.jsai2024.0_4xin281
ISSN

27587347
Text Lang

ja
Data Source
- JaLC
Abstract License Flag
Disallowed

Double Watermark for Large Language Models

Bibliographic Information

Description

Journal

Keywords

Details 詳細情報について

Export

Report a problem