Does someone have a table of these characters? I can automatically convert non-standard characters to ASCII.
From my other thread:
I can say about the russian alphabet. It has some cyrillic symbols which can be used in a homograph attack.
Lower case (6 identical symbols):
aбвгдeёжзийклмнoпpcтyфxцчшщъыьэюя
abcdefghijklmnopqrstuvwxyz
Upper case (11 identical symbols):
AБBГДEЁЖЗИЙКЛMHOПPCTУФXЦЧШЩЪЫЬЭЮЯ
ABCDEFGHIGKLMNOPQRSTUVWXYZ
Note that the cyrillic symbols are encoded as 2 bytes in UTF-8, therefore:
1) wallet = 6 unicode symbols = 6 bytes in UTF-8
2) wallet = 6 unicode symbols = 8 bytes in UTF-8