Post
Topic
Board Meta
Merits 2 from 2 users
Re: Plagiarism: the difference between "wаllеt" and "wallet"
by
Coin-1
on 19/03/2018, 07:03:58 UTC
⭐ Merited by LoyceV (1) ,Timelord2067 (1)
In ‘wallet’, the second character is a cyrillic ‘a’ and the fifth a cyrillic ‘ie’, encoded in Unicode. This kind of scam is known as a homograph attack. You can find all characters using normal search as long as you’re searching for those exact characters.
I can say about the russian alphabet. It has some cyrillic symbols which can be used in a homograph attack.

Lower case (6 identical symbols):
aбвгдeёжзийклмнoпpcтyфxцчшщъыьэюя
abcdefghijklmnopqrstuvwxyz

Upper case (11 identical symbols):
AБBГДEЁЖЗИЙКЛMHOПPCTУФXЦЧШЩЪЫЬЭЮЯ
ABCDEFGHIGKLMNOPQRSTUVWXYZ

Note that the cyrillic symbols are encoded as 2 bytes in UTF-8, therefore:
1) wallet = 6 unicode symbols = 6 bytes in UTF-8
2) wallet = 6 unicode symbols  = 8 bytes in UTF-8