3.7. Character sets and Unicode
Character sets are a little bit nasty. The reason for this is that we are working with three (variable) charsets in the Python bindings:
The nice thing about this is that when you parse commandline arguments or when you are printing to the terminal, you never have to do any charset conversions. The drawback is that if you know that you are receiving, say, UTF-8 from some other library (eg. an XML reader), then you can do any of two things: