Python Unicode

From OLPC

(Difference between revisions)
Jump to: navigation, search
m (+Translations)
m (Reverted edits by 60.27.15.11 (Talk); changed back to last version by Xavi)
 
(One intermediate revision not shown)

Current revision as of 13:10, 5 June 2007

  english | português | español HowTo [ID# 41190]  +/-  

Python has good unicode support, but it is not necessarily easy to use. Some things to note:

  • You must test your application with real Unicode (not ASCII-encodable) text. You can miss lots of bugs if you just use normal ASCII text (i.e., a-z, no accents).
  • You should be careful not to confuse 8-bit strings (that contain binary data and are of type "str"), and text (that contains unicode data and is of type "unicode"). It's easy to substitute one for the other, until you use non-ASCII text, then you'll get a UnicodeEncode/DecodeError.
  • The codecs module has some helpers for reading unicode from files.
  • You can't generally write unicode to a file or the console without setting up something with codecs or another wrapper.

Resources

Some resources to learn about Unicode:

Personal tools
  • Log in
  • Login with OpenID
About OLPC
About the laptop
About the tablet
Projects
OLPC wiki
Toolbox