Get the latest tech news
Google offers its AI watermarking tech as free open source toolkit
SynthID provides a hidden way to mark LLM output as artificial.
While watermarks can be detected in responses as short as three sentences, the process "works best with longer texts," Google acknowledges in the paper, since having more words to score provides "more statistical certainty when making a decision." Google also notes that this kind of watermarking works best when there is a lot of "entropy" in the LLM distribution, meaning multiple valid candidates for each token (e.g., "my favorite tropical fruit is [mango, lychee, papaya, durian]"). To test how any potential watermark distortions might affect the perceived quality and utility of LLM outputs, Google routed "a random fraction" of Gemini queries through the SynthID system and compared them to unwatermarked counterparts.
Or read this on r/technology