SyntaxError of Non-ASCII character [duplicate]

Each Answer to this Q is separated by one/two green lines.

I am trying to parse xml which contains the some non ASCII cheracter,

the code looks like below

from lxml import etree
from lxml import objectify
content = u'<?xml version="1.0" encoding="utf-8"?><div>Order date                            : 05/08/2013 12:24:28</div>'
mail.replace('\xa0',' ')
xml = etree.fromstring(mail)

but it shows me error on the line ‘content = …’

syntaxError: Non-ASCII character '\xc2' in file /home/projects/ztest/ on line 3, 
but no encoding declared; see for details

in the terminal it’s working but while running on the eclipse IDE it’s giving me a error.

Don’t know how to overcome..

You should define source code encoding, add this to the top of your script:

# -*- coding: utf-8 -*-

The reason why it works differently in console and in the IDE is, likely, because of different default encodings set. You can check it by running:

import sys
print sys.getdefaultencoding()

Also see:

The answers/resolutions are collected from stackoverflow, are licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0 .

Leave a Reply

Your email address will not be published.