Hello,
I'm using Python 2.3.4 and I noticed that, when stdout is a terminal, the
'print' statement converts Unicode strings into the encoding defined by
the locales instead of the one returned by sys.getdefaultencoding().
However, I can't find any references to it. Anyone knows where it's
descrbed?
Example:
!/usr/bin/env python
# -*- coding: utf-8 -*-
import sys, locale
print 'Python encoding:', sys.getdefaultencoding()
print 'System encoding:', locale.getpreferredencoding()
print 'Test string: ', u'Olá mundo'
If stdout is a terminal, works fine
$ python x.py
Python encoding: ascii
System encoding: UTF-8
Test string: Olá mundo
If I redirect the output to a file, raises an UnicodeEncodeError exception
$ python x.py > x.txt
Traceback (most recent call last):
File "x.py", line 8, in ?
print 'Test string: ', u'Olá mundo'
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe1' in position 2: ordinal not in range(128)
--
Ricardo 4 2200
Ricardo Bugalho wrote: Hello, I'm using Python 2.3.4 and I noticed that, when stdout is a
terminal, the 'print' statement converts Unicode strings into the encoding defined by the locales instead of the one returned by sys.getdefaultencoding().
Sure. It uses the encoding of you console. Here is explanation why it
uses locale to get the encoding of console: http://www.python.org/moin/PrintFails
However, I can't find any references to it. Anyone knows where it's descrbed?
I've just wrote about it here: http://www.python.org/moin/DefaultEncoding Example:
!/usr/bin/env python # -*- coding: utf-8 -*-
import sys, locale
print 'Python encoding:', sys.getdefaultencoding() print 'System encoding:', locale.getpreferredencoding() print 'Test string: ', u'Olá mundo'
If stdout is a terminal, works fine $ python x.py Python encoding: ascii System encoding: UTF-8 Test string: Olá mundo
If I redirect the output to a file, raises an UnicodeEncodeError
exception $ python x.py > x.txt Traceback (most recent call last): File "x.py", line 8, in ? print 'Test string: ', u'Olá mundo' UnicodeEncodeError: 'ascii' codec can't encode character u'\xe1' in
position 2: ordinal not in range(128) http://www.python.org/moin/ShellRedirectionFails
Feel free to reply here if something is not clear, corrections in wiki
are also welcome.
Serge.
Hi,
thanks for the information. But what I was really looking for was
informaion on when and why Python started doing it (previously, it always
used sys.getdefaultencoding())) and why it was done only for 'print' when
stdout is a terminal instead of always.
On Thu, 13 Jan 2005 14:33:20 -0800, Serge Orlov wrote: Sure. It uses the encoding of you console. Here is explanation why it uses locale to get the encoding of console: http://www.python.org/moin/PrintFails
--
Ricardo
Ricardo Bugalho wrote: Hi, thanks for the information. But what I was really looking for was informaion on when and why Python started doing it (previously, it always used sys.getdefaultencoding()))
I don't have access to any other version except 2.2 at the moment but I
believe it happened between 2.2 and 2.3 for Windows and UNIX terminals.
On other unsupported terminals I suspect sys.getdefaultencoding is
still used. The reason for the change is proper support of unicode
input/output.
and why it was done only for 'print' when stdout is a terminal instead of always.
The real question is why not *never* use sys.getdefaultencoding()
for printing. If you leave sys.getdefaultencoding() at Python default
value ('ascii') you won't need to worry about it <wink>
sys.getdefaultencoding() is a temporary measure for big projects to
use within one Python version.
Serge.
Ricardo Bugalho wrote: thanks for the information. But what I was really looking for was informaion on when and why Python started doing it (previously, it always used sys.getdefaultencoding())) and why it was done only for 'print' when stdout is a terminal instead of always.
It does that since 2.2, in response to many complains that you cannot
print a Unicode string in interactive mode, unless the Unicode string
contains only ASCII characters. It does that only if sys.stdout is
a real terminal, because otherwise it is not possible to determine
what the encoding of sys.stdout is.
Regards,
Martin This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: wolfgang haefelinger |
last post by:
Hi,
I wonder whether someone could explain me a bit what's going on here:
import sys
# I'm running Mandrake 1o and Windows XP.
print sys.version
## 2.3.3 (#2, Feb 17 2004, 11:45:40)
|
by: Spamtrap |
last post by:
I only work in Perl occasionaly, and have been searching for a
solution for a conversion, and everything I found seems much too
complex.
All I need to do is take a simple text file and copy...
|
by: fowlertrainer |
last post by:
Hi !
I want to get the WMI infos from Windows machines.
I use Py from HU (iso-8859-2) charset.
Then I wrote some utility for it, because I want to write it to an XML file.
def...
|
by: Borko |
last post by:
hi
I am having problems getting unicode characters into VB. Using VB6 (sp3) and
Access 2000
Characters are displayed correctly in Access, just when I use ADODB (2.7) to
read them in VB i get ?...
|
by: lorenzo.viscanti |
last post by:
X-No-Archive: yes
Hi, I've found lots of material on the net about unicode html
conversions, but still i'm having many problems converting unicode
characters to html entities. Is there any...
|
by: sonald |
last post by:
Hi,
I am using python2.4.1
I need to pass russian text into python and validate the same.
Can u plz guide me on how to make my existing code support the
russian text.
Is there any module...
|
by: NevilleDNZ |
last post by:
Hi,
Apologies first as I am not a unicode expert.... indeed I the details
probably totally elude me. Not withstanding: how can I convert a
binary string containing UTF-8 binary into a python...
|
by: Jim |
last post by:
Hello,
I'm trying to write exception-handling code that is OK in the
presence
of unicode error messages. I seem to have gotten all mixed up and
I'd
appreciate any un-mixing that anyone can...
|
by: thijs.braem |
last post by:
Hi everyone,
I'm having quite some troubles trying to convert Unicode to String
(for use in psycopg, which apparently doesn't know how to cope with
unicode strings).
The error I keep having...
|
by: CloudSolutions |
last post by:
Introduction:
For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
|
by: Faith0G |
last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome former...
|
by: aa123db |
last post by:
Variable and constants
Use var or let for variables and const fror constants.
Var foo ='bar';
Let foo ='bar';const baz ='bar';
Functions
function $name$ ($parameters$) {
}
...
|
by: ryjfgjl |
last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
|
by: ryjfgjl |
last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
|
by: emmanuelkatto |
last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud.
Please let me know.
Thanks!
Emmanuel
|
by: BarryA |
last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
|
by: Sonnysonu |
last post by:
This is the data of csv file
1 2 3
1 2 3
1 2 3
1 2 3
2 3
2 3
3
the lengths should be different i have to store the data by column-wise with in the specific length.
suppose the i have to...
| |