Get the latest tech news

Software engineer exploits Unicode quirks to conceal information within emoji characters


The concept revolves around Unicode's system of representing text as a sequence of codepoints, with each codepoint being a number assigned meaning by the Unicode Consortium. While...

In a nutshell: A recent blog post by software engineer Paul Butler has shed light on a novel technique for concealing data within Unicode characters, specifically emojis. Taking this concept further, by concatenating multiple variation selectors, one can represent any arbitrary byte string, effectively encoding unlimited data within a single character. If leaked, the sender could trace the text back to a specific recipient, raising serious privacy and whistleblower protection concerns.

Get the Android app

Or read this on r/technology

Read more on:

Photo of information

information

Photo of emoji

emoji

Photo of Unicode

Unicode

Related news:

News photo

Google Contacts gets a sweet, emoji-based profile update for people you've saved

News photo

Ignorance is bliss when you're drowning in information

News photo

Smuggling arbitrary data through an emoji