Commit | Line | Data |
---|---|---|
920dae64 AT |
1 | <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> |
2 | <html> | |
3 | <head> | |
4 | <link rel="STYLESHEET" href="lib.css" type='text/css' /> | |
5 | <link rel="SHORTCUT ICON" href="../icons/pyfav.png" type="image/png" /> | |
6 | <link rel='start' href='../index.html' title='Python Documentation Index' /> | |
7 | <link rel="first" href="lib.html" title='Python Library Reference' /> | |
8 | <link rel='contents' href='contents.html' title="Contents" /> | |
9 | <link rel='index' href='genindex.html' title='Index' /> | |
10 | <link rel='last' href='about.html' title='About this document...' /> | |
11 | <link rel='help' href='about.html' title='About this document...' /> | |
12 | <link rel="next" href="module-encodings.idna.html" /> | |
13 | <link rel="prev" href="node130.html" /> | |
14 | <link rel="parent" href="module-codecs.html" /> | |
15 | <link rel="next" href="module-encodings.idna.html" /> | |
16 | <meta name='aesop' content='information' /> | |
17 | <title>4.9.2 Standard Encodings</title> | |
18 | </head> | |
19 | <body> | |
20 | <DIV CLASS="navigation"> | |
21 | <div id='top-navigation-panel' xml:id='top-navigation-panel'> | |
22 | <table align="center" width="100%" cellpadding="0" cellspacing="2"> | |
23 | <tr> | |
24 | <td class='online-navigation'><a rel="prev" title="4.9.1.5 StreamRecoder Objects" | |
25 | href="stream-recoder-objects.html"><img src='../icons/previous.png' | |
26 | border='0' height='32' alt='Previous Page' width='32' /></A></td> | |
27 | <td class='online-navigation'><a rel="parent" title="4.9 codecs " | |
28 | href="module-codecs.html"><img src='../icons/up.png' | |
29 | border='0' height='32' alt='Up One Level' width='32' /></A></td> | |
30 | <td class='online-navigation'><a rel="next" title="4.9.3 encodings.idna " | |
31 | href="module-encodings.idna.html"><img src='../icons/next.png' | |
32 | border='0' height='32' alt='Next Page' width='32' /></A></td> | |
33 | <td align="center" width="100%">Python Library Reference</td> | |
34 | <td class='online-navigation'><a rel="contents" title="Table of Contents" | |
35 | href="contents.html"><img src='../icons/contents.png' | |
36 | border='0' height='32' alt='Contents' width='32' /></A></td> | |
37 | <td class='online-navigation'><a href="modindex.html" title="Module Index"><img src='../icons/modules.png' | |
38 | border='0' height='32' alt='Module Index' width='32' /></a></td> | |
39 | <td class='online-navigation'><a rel="index" title="Index" | |
40 | href="genindex.html"><img src='../icons/index.png' | |
41 | border='0' height='32' alt='Index' width='32' /></A></td> | |
42 | </tr></table> | |
43 | <div class='online-navigation'> | |
44 | <b class="navlabel">Previous:</b> | |
45 | <a class="sectref" rel="prev" href="stream-recoder-objects.html">4.9.1.5 StreamRecoder Objects</A> | |
46 | <b class="navlabel">Up:</b> | |
47 | <a class="sectref" rel="parent" href="module-codecs.html">4.9 codecs </A> | |
48 | <b class="navlabel">Next:</b> | |
49 | <a class="sectref" rel="next" href="module-encodings.idna.html">4.9.3 encodings.idna </A> | |
50 | </div> | |
51 | <hr /></div> | |
52 | </DIV> | |
53 | <!--End of Navigation Panel--> | |
54 | ||
55 | <H2><A NAME="SECTION006920000000000000000"></A><A NAME="standard-encodings"></A> | |
56 | <BR> | |
57 | 4.9.2 Standard Encodings | |
58 | </H2> | |
59 | ||
60 | <P> | |
61 | Python comes with a number of codecs builtin, either implemented as C | |
62 | functions, or with dictionaries as mapping tables. The following table | |
63 | lists the codecs by name, together with a few common aliases, and the | |
64 | languages for which the encoding is likely used. Neither the list of | |
65 | aliases nor the list of languages is meant to be exhaustive. Notice | |
66 | that spelling alternatives that only differ in case or use a hyphen | |
67 | instead of an underscore are also valid aliases. | |
68 | ||
69 | <P> | |
70 | Many of the character sets support the same languages. They vary in | |
71 | individual characters (e.g. whether the EURO SIGN is supported or | |
72 | not), and in the assignment of characters to code positions. For the | |
73 | European languages in particular, the following variants typically | |
74 | exist: | |
75 | ||
76 | <P> | |
77 | ||
78 | <UL> | |
79 | <LI>an ISO 8859 codeset | |
80 | </LI> | |
81 | <LI>a Microsoft Windows code page, which is typically derived from | |
82 | a 8859 codeset, but replaces control characters with additional | |
83 | graphic characters | |
84 | </LI> | |
85 | <LI>an IBM EBCDIC code page | |
86 | </LI> | |
87 | <LI>an IBM PC code page, which is ASCII compatible | |
88 | </LI> | |
89 | </UL> | |
90 | ||
91 | <P> | |
92 | <div class="center"><table class="realtable"> | |
93 | <thead> | |
94 | <tr> | |
95 | <th class="left" >Codec</th> | |
96 | <th class="left" >Aliases</th> | |
97 | <th class="left" >Languages</th> | |
98 | </tr> | |
99 | </thead> | |
100 | <tbody> | |
101 | <P> | |
102 | ||
103 | <tr><td class="left" valign="baseline">ascii</td> | |
104 | <td class="left" >646, us-ascii</td> | |
105 | <td class="left" >English</td></tr><P> | |
106 | ||
107 | <tr><td class="left" valign="baseline">big5</td> | |
108 | <td class="left" >big5-tw, csbig5</td> | |
109 | <td class="left" >Traditional Chinese</td></tr><P> | |
110 | ||
111 | <tr><td class="left" valign="baseline">big5hkscs</td> | |
112 | <td class="left" >big5-hkscs, hkscs</td> | |
113 | <td class="left" >Traditional Chinese</td></tr><P> | |
114 | ||
115 | <tr><td class="left" valign="baseline">cp037</td> | |
116 | <td class="left" >IBM037, IBM039</td> | |
117 | <td class="left" >English</td></tr><P> | |
118 | ||
119 | <tr><td class="left" valign="baseline">cp424</td> | |
120 | <td class="left" >EBCDIC-CP-HE, IBM424</td> | |
121 | <td class="left" >Hebrew</td></tr><P> | |
122 | ||
123 | <tr><td class="left" valign="baseline">cp437</td> | |
124 | <td class="left" >437, IBM437</td> | |
125 | <td class="left" >English</td></tr><P> | |
126 | ||
127 | <tr><td class="left" valign="baseline">cp500</td> | |
128 | <td class="left" >EBCDIC-CP-BE, EBCDIC-CP-CH, IBM500</td> | |
129 | <td class="left" >Western Europe</td></tr><P> | |
130 | ||
131 | <tr><td class="left" valign="baseline">cp737</td> | |
132 | <td class="left" ></td> | |
133 | <td class="left" >Greek</td></tr><P> | |
134 | ||
135 | <tr><td class="left" valign="baseline">cp775</td> | |
136 | <td class="left" >IBM775</td> | |
137 | <td class="left" >Baltic languages</td></tr><P> | |
138 | ||
139 | <tr><td class="left" valign="baseline">cp850</td> | |
140 | <td class="left" >850, IBM850</td> | |
141 | <td class="left" >Western Europe</td></tr><P> | |
142 | ||
143 | <tr><td class="left" valign="baseline">cp852</td> | |
144 | <td class="left" >852, IBM852</td> | |
145 | <td class="left" >Central and Eastern Europe</td></tr><P> | |
146 | ||
147 | <tr><td class="left" valign="baseline">cp855</td> | |
148 | <td class="left" >855, IBM855</td> | |
149 | <td class="left" >Bulgarian, Byelorussian, Macedonian, Russian, Serbian</td></tr><P> | |
150 | ||
151 | <tr><td class="left" valign="baseline">cp856</td> | |
152 | <td class="left" ></td> | |
153 | <td class="left" >Hebrew</td></tr><P> | |
154 | ||
155 | <tr><td class="left" valign="baseline">cp857</td> | |
156 | <td class="left" >857, IBM857</td> | |
157 | <td class="left" >Turkish</td></tr><P> | |
158 | ||
159 | <tr><td class="left" valign="baseline">cp860</td> | |
160 | <td class="left" >860, IBM860</td> | |
161 | <td class="left" >Portuguese</td></tr><P> | |
162 | ||
163 | <tr><td class="left" valign="baseline">cp861</td> | |
164 | <td class="left" >861, CP-IS, IBM861</td> | |
165 | <td class="left" >Icelandic</td></tr><P> | |
166 | ||
167 | <tr><td class="left" valign="baseline">cp862</td> | |
168 | <td class="left" >862, IBM862</td> | |
169 | <td class="left" >Hebrew</td></tr><P> | |
170 | ||
171 | <tr><td class="left" valign="baseline">cp863</td> | |
172 | <td class="left" >863, IBM863</td> | |
173 | <td class="left" >Canadian</td></tr><P> | |
174 | ||
175 | <tr><td class="left" valign="baseline">cp864</td> | |
176 | <td class="left" >IBM864</td> | |
177 | <td class="left" >Arabic</td></tr><P> | |
178 | ||
179 | <tr><td class="left" valign="baseline">cp865</td> | |
180 | <td class="left" >865, IBM865</td> | |
181 | <td class="left" >Danish, Norwegian</td></tr><P> | |
182 | ||
183 | <tr><td class="left" valign="baseline">cp866</td> | |
184 | <td class="left" >866, IBM866</td> | |
185 | <td class="left" >Russian</td></tr><P> | |
186 | ||
187 | <tr><td class="left" valign="baseline">cp869</td> | |
188 | <td class="left" >869, CP-GR, IBM869</td> | |
189 | <td class="left" >Greek</td></tr><P> | |
190 | ||
191 | <tr><td class="left" valign="baseline">cp874</td> | |
192 | <td class="left" ></td> | |
193 | <td class="left" >Thai</td></tr><P> | |
194 | ||
195 | <tr><td class="left" valign="baseline">cp875</td> | |
196 | <td class="left" ></td> | |
197 | <td class="left" >Greek</td></tr><P> | |
198 | ||
199 | <tr><td class="left" valign="baseline">cp932</td> | |
200 | <td class="left" >932, ms932, mskanji, ms-kanji</td> | |
201 | <td class="left" >Japanese</td></tr><P> | |
202 | ||
203 | <tr><td class="left" valign="baseline">cp949</td> | |
204 | <td class="left" >949, ms949, uhc</td> | |
205 | <td class="left" >Korean</td></tr><P> | |
206 | ||
207 | <tr><td class="left" valign="baseline">cp950</td> | |
208 | <td class="left" >950, ms950</td> | |
209 | <td class="left" >Traditional Chinese</td></tr><P> | |
210 | ||
211 | <tr><td class="left" valign="baseline">cp1006</td> | |
212 | <td class="left" ></td> | |
213 | <td class="left" >Urdu</td></tr><P> | |
214 | ||
215 | <tr><td class="left" valign="baseline">cp1026</td> | |
216 | <td class="left" >ibm1026</td> | |
217 | <td class="left" >Turkish</td></tr><P> | |
218 | ||
219 | <tr><td class="left" valign="baseline">cp1140</td> | |
220 | <td class="left" >ibm1140</td> | |
221 | <td class="left" >Western Europe</td></tr><P> | |
222 | ||
223 | <tr><td class="left" valign="baseline">cp1250</td> | |
224 | <td class="left" >windows-1250</td> | |
225 | <td class="left" >Central and Eastern Europe</td></tr><P> | |
226 | ||
227 | <tr><td class="left" valign="baseline">cp1251</td> | |
228 | <td class="left" >windows-1251</td> | |
229 | <td class="left" >Bulgarian, Byelorussian, Macedonian, Russian, Serbian</td></tr><P> | |
230 | ||
231 | <tr><td class="left" valign="baseline">cp1252</td> | |
232 | <td class="left" >windows-1252</td> | |
233 | <td class="left" >Western Europe</td></tr><P> | |
234 | ||
235 | <tr><td class="left" valign="baseline">cp1253</td> | |
236 | <td class="left" >windows-1253</td> | |
237 | <td class="left" >Greek</td></tr><P> | |
238 | ||
239 | <tr><td class="left" valign="baseline">cp1254</td> | |
240 | <td class="left" >windows-1254</td> | |
241 | <td class="left" >Turkish</td></tr><P> | |
242 | ||
243 | <tr><td class="left" valign="baseline">cp1255</td> | |
244 | <td class="left" >windows-1255</td> | |
245 | <td class="left" >Hebrew</td></tr><P> | |
246 | ||
247 | <tr><td class="left" valign="baseline">cp1256</td> | |
248 | <td class="left" >windows1256</td> | |
249 | <td class="left" >Arabic</td></tr><P> | |
250 | ||
251 | <tr><td class="left" valign="baseline">cp1257</td> | |
252 | <td class="left" >windows-1257</td> | |
253 | <td class="left" >Baltic languages</td></tr><P> | |
254 | ||
255 | <tr><td class="left" valign="baseline">cp1258</td> | |
256 | <td class="left" >windows-1258</td> | |
257 | <td class="left" >Vietnamese</td></tr><P> | |
258 | ||
259 | <tr><td class="left" valign="baseline">euc_jp</td> | |
260 | <td class="left" >eucjp, ujis, u-jis</td> | |
261 | <td class="left" >Japanese</td></tr><P> | |
262 | ||
263 | <tr><td class="left" valign="baseline">euc_jis_2004</td> | |
264 | <td class="left" >jisx0213, eucjis2004</td> | |
265 | <td class="left" >Japanese</td></tr><P> | |
266 | ||
267 | <tr><td class="left" valign="baseline">euc_jisx0213</td> | |
268 | <td class="left" >eucjisx0213</td> | |
269 | <td class="left" >Japanese</td></tr><P> | |
270 | ||
271 | <tr><td class="left" valign="baseline">euc_kr</td> | |
272 | <td class="left" >euckr, korean, ksc5601, ks_c-5601, ks_c-5601-1987, ksx1001, ks_x-1001</td> | |
273 | <td class="left" >Korean</td></tr><P> | |
274 | ||
275 | <tr><td class="left" valign="baseline">gb2312</td> | |
276 | <td class="left" >chinese, csiso58gb231280, euc-cn, euccn, eucgb2312-cn, gb2312-1980, | |
277 | gb2312-80, iso-ir-58</td> | |
278 | <td class="left" >Simplified Chinese</td></tr><P> | |
279 | ||
280 | <tr><td class="left" valign="baseline">gbk</td> | |
281 | <td class="left" >936, cp936, ms936</td> | |
282 | <td class="left" >Unified Chinese</td></tr><P> | |
283 | ||
284 | <tr><td class="left" valign="baseline">gb18030</td> | |
285 | <td class="left" >gb18030-2000</td> | |
286 | <td class="left" >Unified Chinese</td></tr><P> | |
287 | ||
288 | <tr><td class="left" valign="baseline">hz</td> | |
289 | <td class="left" >hzgb, hz-gb, hz-gb-2312</td> | |
290 | <td class="left" >Simplified Chinese</td></tr><P> | |
291 | ||
292 | <tr><td class="left" valign="baseline">iso2022_jp</td> | |
293 | <td class="left" >csiso2022jp, iso2022jp, iso-2022-jp</td> | |
294 | <td class="left" >Japanese</td></tr><P> | |
295 | ||
296 | <tr><td class="left" valign="baseline">iso2022_jp_1</td> | |
297 | <td class="left" >iso2022jp-1, iso-2022-jp-1</td> | |
298 | <td class="left" >Japanese</td></tr><P> | |
299 | ||
300 | <tr><td class="left" valign="baseline">iso2022_jp_2</td> | |
301 | <td class="left" >iso2022jp-2, iso-2022-jp-2</td> | |
302 | <td class="left" >Japanese, Korean, Simplified Chinese, Western Europe, Greek</td></tr><P> | |
303 | ||
304 | <tr><td class="left" valign="baseline">iso2022_jp_2004</td> | |
305 | <td class="left" >iso2022jp-2004, iso-2022-jp-2004</td> | |
306 | <td class="left" >Japanese</td></tr><P> | |
307 | ||
308 | <tr><td class="left" valign="baseline">iso2022_jp_3</td> | |
309 | <td class="left" >iso2022jp-3, iso-2022-jp-3</td> | |
310 | <td class="left" >Japanese</td></tr><P> | |
311 | ||
312 | <tr><td class="left" valign="baseline">iso2022_jp_ext</td> | |
313 | <td class="left" >iso2022jp-ext, iso-2022-jp-ext</td> | |
314 | <td class="left" >Japanese</td></tr><P> | |
315 | ||
316 | <tr><td class="left" valign="baseline">iso2022_kr</td> | |
317 | <td class="left" >csiso2022kr, iso2022kr, iso-2022-kr</td> | |
318 | <td class="left" >Korean</td></tr><P> | |
319 | ||
320 | <tr><td class="left" valign="baseline">latin_1</td> | |
321 | <td class="left" >iso-8859-1, iso8859-1, 8859, cp819, latin, latin1, L1</td> | |
322 | <td class="left" >West Europe</td></tr><P> | |
323 | ||
324 | <tr><td class="left" valign="baseline">iso8859_2</td> | |
325 | <td class="left" >iso-8859-2, latin2, L2</td> | |
326 | <td class="left" >Central and Eastern Europe</td></tr><P> | |
327 | ||
328 | <tr><td class="left" valign="baseline">iso8859_3</td> | |
329 | <td class="left" >iso-8859-3, latin3, L3</td> | |
330 | <td class="left" >Esperanto, Maltese</td></tr><P> | |
331 | ||
332 | <tr><td class="left" valign="baseline">iso8859_4</td> | |
333 | <td class="left" >iso-8859-4, latin4, L4</td> | |
334 | <td class="left" >Baltic languagues</td></tr><P> | |
335 | ||
336 | <tr><td class="left" valign="baseline">iso8859_5</td> | |
337 | <td class="left" >iso-8859-5, cyrillic</td> | |
338 | <td class="left" >Bulgarian, Byelorussian, Macedonian, Russian, Serbian</td></tr><P> | |
339 | ||
340 | <tr><td class="left" valign="baseline">iso8859_6</td> | |
341 | <td class="left" >iso-8859-6, arabic</td> | |
342 | <td class="left" >Arabic</td></tr><P> | |
343 | ||
344 | <tr><td class="left" valign="baseline">iso8859_7</td> | |
345 | <td class="left" >iso-8859-7, greek, greek8</td> | |
346 | <td class="left" >Greek</td></tr><P> | |
347 | ||
348 | <tr><td class="left" valign="baseline">iso8859_8</td> | |
349 | <td class="left" >iso-8859-8, hebrew</td> | |
350 | <td class="left" >Hebrew</td></tr><P> | |
351 | ||
352 | <tr><td class="left" valign="baseline">iso8859_9</td> | |
353 | <td class="left" >iso-8859-9, latin5, L5</td> | |
354 | <td class="left" >Turkish</td></tr><P> | |
355 | ||
356 | <tr><td class="left" valign="baseline">iso8859_10</td> | |
357 | <td class="left" >iso-8859-10, latin6, L6</td> | |
358 | <td class="left" >Nordic languages</td></tr><P> | |
359 | ||
360 | <tr><td class="left" valign="baseline">iso8859_13</td> | |
361 | <td class="left" >iso-8859-13</td> | |
362 | <td class="left" >Baltic languages</td></tr><P> | |
363 | ||
364 | <tr><td class="left" valign="baseline">iso8859_14</td> | |
365 | <td class="left" >iso-8859-14, latin8, L8</td> | |
366 | <td class="left" >Celtic languages</td></tr><P> | |
367 | ||
368 | <tr><td class="left" valign="baseline">iso8859_15</td> | |
369 | <td class="left" >iso-8859-15</td> | |
370 | <td class="left" >Western Europe</td></tr><P> | |
371 | ||
372 | <tr><td class="left" valign="baseline">johab</td> | |
373 | <td class="left" >cp1361, ms1361</td> | |
374 | <td class="left" >Korean</td></tr><P> | |
375 | ||
376 | <tr><td class="left" valign="baseline">koi8_r</td> | |
377 | <td class="left" ></td> | |
378 | <td class="left" >Russian</td></tr><P> | |
379 | ||
380 | <tr><td class="left" valign="baseline">koi8_u</td> | |
381 | <td class="left" ></td> | |
382 | <td class="left" >Ukrainian</td></tr><P> | |
383 | ||
384 | <tr><td class="left" valign="baseline">mac_cyrillic</td> | |
385 | <td class="left" >maccyrillic</td> | |
386 | <td class="left" >Bulgarian, Byelorussian, Macedonian, Russian, Serbian</td></tr><P> | |
387 | ||
388 | <tr><td class="left" valign="baseline">mac_greek</td> | |
389 | <td class="left" >macgreek</td> | |
390 | <td class="left" >Greek</td></tr><P> | |
391 | ||
392 | <tr><td class="left" valign="baseline">mac_iceland</td> | |
393 | <td class="left" >maciceland</td> | |
394 | <td class="left" >Icelandic</td></tr><P> | |
395 | ||
396 | <tr><td class="left" valign="baseline">mac_latin2</td> | |
397 | <td class="left" >maclatin2, maccentraleurope</td> | |
398 | <td class="left" >Central and Eastern Europe</td></tr><P> | |
399 | ||
400 | <tr><td class="left" valign="baseline">mac_roman</td> | |
401 | <td class="left" >macroman</td> | |
402 | <td class="left" >Western Europe</td></tr><P> | |
403 | ||
404 | <tr><td class="left" valign="baseline">mac_turkish</td> | |
405 | <td class="left" >macturkish</td> | |
406 | <td class="left" >Turkish</td></tr><P> | |
407 | ||
408 | <tr><td class="left" valign="baseline">ptcp154</td> | |
409 | <td class="left" >csptcp154, pt154, cp154, cyrillic-asian</td> | |
410 | <td class="left" >Kazakh</td></tr><P> | |
411 | ||
412 | <tr><td class="left" valign="baseline">shift_jis</td> | |
413 | <td class="left" >csshiftjis, shiftjis, sjis, s_jis</td> | |
414 | <td class="left" >Japanese</td></tr><P> | |
415 | ||
416 | <tr><td class="left" valign="baseline">shift_jis_2004</td> | |
417 | <td class="left" >shiftjis2004, sjis_2004, sjis2004</td> | |
418 | <td class="left" >Japanese</td></tr><P> | |
419 | ||
420 | <tr><td class="left" valign="baseline">shift_jisx0213</td> | |
421 | <td class="left" >shiftjisx0213, sjisx0213, s_jisx0213</td> | |
422 | <td class="left" >Japanese</td></tr><P> | |
423 | ||
424 | <tr><td class="left" valign="baseline">utf_16</td> | |
425 | <td class="left" >U16, utf16</td> | |
426 | <td class="left" >all languages</td></tr><P> | |
427 | ||
428 | <tr><td class="left" valign="baseline">utf_16_be</td> | |
429 | <td class="left" >UTF-16BE</td> | |
430 | <td class="left" >all languages (BMP only)</td></tr><P> | |
431 | ||
432 | <tr><td class="left" valign="baseline">utf_16_le</td> | |
433 | <td class="left" >UTF-16LE</td> | |
434 | <td class="left" >all languages (BMP only)</td></tr><P> | |
435 | ||
436 | <tr><td class="left" valign="baseline">utf_7</td> | |
437 | <td class="left" >U7</td> | |
438 | <td class="left" >all languages</td></tr><P> | |
439 | ||
440 | <tr><td class="left" valign="baseline">utf_8</td> | |
441 | <td class="left" >U8, UTF, utf8</td> | |
442 | <td class="left" >all languages</td></tr><P> | |
443 | ||
444 | </tbody> | |
445 | </table></div> | |
446 | ||
447 | <P> | |
448 | A number of codecs are specific to Python, so their codec names have | |
449 | no meaning outside Python. Some of them don't convert from Unicode | |
450 | strings to byte strings, but instead use the property of the Python | |
451 | codecs machinery that any bijective function with one argument can be | |
452 | considered as an encoding. | |
453 | ||
454 | <P> | |
455 | For the codecs listed below, the result in the ``encoding'' direction | |
456 | is always a byte string. The result of the ``decoding'' direction is | |
457 | listed as operand type in the table. | |
458 | ||
459 | <P> | |
460 | <div class="center"><table class="realtable"> | |
461 | <thead> | |
462 | <tr> | |
463 | <th class="left" >Codec</th> | |
464 | <th class="left" >Aliases</th> | |
465 | <th class="left" >Operand type</th> | |
466 | <th class="left" >Purpose</th> | |
467 | </tr> | |
468 | </thead> | |
469 | <tbody> | |
470 | <P> | |
471 | ||
472 | <tr><td class="left" valign="baseline">base64_codec</td> | |
473 | <td class="left" >base64, base-64</td> | |
474 | <td class="left" >byte string</td> | |
475 | <td class="left" >Convert operand to MIME base64</td></tr><P> | |
476 | ||
477 | <tr><td class="left" valign="baseline">bz2_codec</td> | |
478 | <td class="left" >bz2</td> | |
479 | <td class="left" >byte string</td> | |
480 | <td class="left" >Compress the operand using bz2</td></tr><P> | |
481 | ||
482 | <tr><td class="left" valign="baseline">hex_codec</td> | |
483 | <td class="left" >hex</td> | |
484 | <td class="left" >byte string</td> | |
485 | <td class="left" >Convert operand to hexadecimal representation, with two | |
486 | digits per byte</td></tr><P> | |
487 | ||
488 | <tr><td class="left" valign="baseline">idna</td> | |
489 | <td class="left" ></td> | |
490 | <td class="left" >Unicode string</td> | |
491 | <td class="left" >Implements <a class="rfc" id='rfcref-87082' xml:id='rfcref-87082' | |
492 | href="http://www.faqs.org/rfcs/rfc3490.html">RFC 3490</a>. | |
493 | ||
494 | <span class="versionnote">New in version 2.3.</span> | |
495 | ||
496 | See also <tt class="module"><a href="module-encodings.idna.html">encodings.idna</a></tt></td></tr><P> | |
497 | ||
498 | <tr><td class="left" valign="baseline">mbcs</td> | |
499 | <td class="left" >dbcs</td> | |
500 | <td class="left" >Unicode string</td> | |
501 | <td class="left" >Windows only: Encode operand according to the ANSI codepage (CP_ACP)</td></tr><P> | |
502 | ||
503 | <tr><td class="left" valign="baseline">palmos</td> | |
504 | <td class="left" ></td> | |
505 | <td class="left" >Unicode string</td> | |
506 | <td class="left" >Encoding of PalmOS 3.5</td></tr><P> | |
507 | ||
508 | <tr><td class="left" valign="baseline">punycode</td> | |
509 | <td class="left" ></td> | |
510 | <td class="left" >Unicode string</td> | |
511 | <td class="left" >Implements <a class="rfc" id='rfcref-87084' xml:id='rfcref-87084' | |
512 | href="http://www.faqs.org/rfcs/rfc3492.html">RFC 3492</a>. | |
513 | ||
514 | <span class="versionnote">New in version 2.3.</span> | |
515 | </td></tr><P> | |
516 | ||
517 | <tr><td class="left" valign="baseline">quopri_codec</td> | |
518 | <td class="left" >quopri, quoted-printable, quotedprintable</td> | |
519 | <td class="left" >byte string</td> | |
520 | <td class="left" >Convert operand to MIME quoted printable</td></tr><P> | |
521 | ||
522 | <tr><td class="left" valign="baseline">raw_unicode_escape</td> | |
523 | <td class="left" ></td> | |
524 | <td class="left" >Unicode string</td> | |
525 | <td class="left" >Produce a string that is suitable as raw Unicode literal in | |
526 | Python source code</td></tr><P> | |
527 | ||
528 | <tr><td class="left" valign="baseline">rot_13</td> | |
529 | <td class="left" >rot13</td> | |
530 | <td class="left" >byte string</td> | |
531 | <td class="left" >Returns the Caesar-cypher encryption of the operand</td></tr><P> | |
532 | ||
533 | <tr><td class="left" valign="baseline">string_escape</td> | |
534 | <td class="left" ></td> | |
535 | <td class="left" >byte string</td> | |
536 | <td class="left" >Produce a string that is suitable as string literal in | |
537 | Python source code</td></tr><P> | |
538 | ||
539 | <tr><td class="left" valign="baseline">undefined</td> | |
540 | <td class="left" ></td> | |
541 | <td class="left" >any</td> | |
542 | <td class="left" >Raise an exception for all conversion. Can be used as the | |
543 | system encoding if no automatic coercion between byte and | |
544 | Unicode strings is desired.</td></tr><P> | |
545 | ||
546 | <tr><td class="left" valign="baseline">unicode_escape</td> | |
547 | <td class="left" ></td> | |
548 | <td class="left" >Unicode string</td> | |
549 | <td class="left" >Produce a string that is suitable as Unicode literal in | |
550 | Python source code</td></tr><P> | |
551 | ||
552 | <tr><td class="left" valign="baseline">unicode_internal</td> | |
553 | <td class="left" ></td> | |
554 | <td class="left" >Unicode string</td> | |
555 | <td class="left" >Return the internal representation of the operand</td></tr><P> | |
556 | ||
557 | <tr><td class="left" valign="baseline">uu_codec</td> | |
558 | <td class="left" >uu</td> | |
559 | <td class="left" >byte string</td> | |
560 | <td class="left" >Convert the operand using uuencode</td></tr><P> | |
561 | ||
562 | <tr><td class="left" valign="baseline">zlib_codec</td> | |
563 | <td class="left" >zip, zlib</td> | |
564 | <td class="left" >byte string</td> | |
565 | <td class="left" >Compress the operand using gzip</td></tr><P> | |
566 | ||
567 | </tbody> | |
568 | </table></div> | |
569 | ||
570 | <P> | |
571 | ||
572 | <DIV CLASS="navigation"> | |
573 | <div class='online-navigation'> | |
574 | <p></p><hr /> | |
575 | <table align="center" width="100%" cellpadding="0" cellspacing="2"> | |
576 | <tr> | |
577 | <td class='online-navigation'><a rel="prev" title="4.9.1.5 StreamRecoder Objects" | |
578 | href="stream-recoder-objects.html"><img src='../icons/previous.png' | |
579 | border='0' height='32' alt='Previous Page' width='32' /></A></td> | |
580 | <td class='online-navigation'><a rel="parent" title="4.9 codecs " | |
581 | href="module-codecs.html"><img src='../icons/up.png' | |
582 | border='0' height='32' alt='Up One Level' width='32' /></A></td> | |
583 | <td class='online-navigation'><a rel="next" title="4.9.3 encodings.idna " | |
584 | href="module-encodings.idna.html"><img src='../icons/next.png' | |
585 | border='0' height='32' alt='Next Page' width='32' /></A></td> | |
586 | <td align="center" width="100%">Python Library Reference</td> | |
587 | <td class='online-navigation'><a rel="contents" title="Table of Contents" | |
588 | href="contents.html"><img src='../icons/contents.png' | |
589 | border='0' height='32' alt='Contents' width='32' /></A></td> | |
590 | <td class='online-navigation'><a href="modindex.html" title="Module Index"><img src='../icons/modules.png' | |
591 | border='0' height='32' alt='Module Index' width='32' /></a></td> | |
592 | <td class='online-navigation'><a rel="index" title="Index" | |
593 | href="genindex.html"><img src='../icons/index.png' | |
594 | border='0' height='32' alt='Index' width='32' /></A></td> | |
595 | </tr></table> | |
596 | <div class='online-navigation'> | |
597 | <b class="navlabel">Previous:</b> | |
598 | <a class="sectref" rel="prev" href="stream-recoder-objects.html">4.9.1.5 StreamRecoder Objects</A> | |
599 | <b class="navlabel">Up:</b> | |
600 | <a class="sectref" rel="parent" href="module-codecs.html">4.9 codecs </A> | |
601 | <b class="navlabel">Next:</b> | |
602 | <a class="sectref" rel="next" href="module-encodings.idna.html">4.9.3 encodings.idna </A> | |
603 | </div> | |
604 | </div> | |
605 | <hr /> | |
606 | <span class="release-info">Release 2.4.2, documentation updated on 28 September 2005.</span> | |
607 | </DIV> | |
608 | <!--End of Navigation Panel--> | |
609 | <ADDRESS> | |
610 | See <i><a href="about.html">About this document...</a></i> for information on suggesting changes. | |
611 | </ADDRESS> | |
612 | </BODY> | |
613 | </HTML> |