如何替换带重音的字符？

Question

如何替换带重音的字符？

pythonpython-2.7non-ascii-characters

25

我的输出看起来像'àéêöhello!'。我需要将输出更改为'aeeohello'，只需将字符à替换为a即可。

- Ganesh Basuvaraj

3

可能是 What is the best way to remove accents in a Python unicode string? 的重复问题。 - hestellezg

3个回答

13

import unidecode

somestring = "àéêöhello"

#convert plain text to utf-8
u = unicode(somestring, "utf-8")
#convert utf-8 to normal text
print unidecode.unidecode(u)

输出：

aeeohello

- Alpesh Valaki

6

Alpesh Valaki的答案是最好的，但我需要做一些调整才能使其正常工作：

# I changed the import
from unidecode import unidecode

somestring = "àéêöhello"

#convert plain text to utf-8
# replaced unicode by unidecode
u = unidecode(somestring, "utf-8")

#convert utf-8 to normal text
print(unidecode(u))

- Samuel Lacombe

这对我很有用，尝试将像'Đà Nẵng'这样的东西转换为'Da Nang' - 谢谢！ - g0h

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Eswara Moorthy · Accepted Answer

请使用以下代码：

import unicodedata

def strip_accents(text):
    try:
        text = unicode(text, 'utf-8')
    except NameError: # unicode is a default on python 3 
        pass

    text = unicodedata.normalize('NFD', text)\
           .encode('ascii', 'ignore')\
           .decode("utf-8")

    return str(text)

s = strip_accents('àéêöhello')

print s