我正在寻找一个包含ASCII字符和外观相同的UTF8字符的表格。我知道它也取决于字体是否相同,但是从通用的角度出发,这已经足够了。
>>> # PY3 code:
>>> a='H' # ascii
>>> b='Н' # utf8
>>> a==b
False
>>> ' '.join(format(ord(x), 'b') for x in a)
'1001000'
>>> ' '.join(format(ord(x), 'b') for x in b)
'10000011101'
>>> a='P' # ascii
>>> b='Ρ' # utf8
>>> a==b
False
>>> ' '.join(format(ord(x), 'b') for x in a)
'1010000'
>>> ' '.join(format(ord(x), 'b') for x in b)
'1110100001'
for x in a
迭代Unicode代码点,而不是UTF-8代码单元。(==
比较的是UTF-8代码单元的序列。) - Tom Blodget