¸÷λ´óÏÀ,Python ±àÂëÎÊÌâ
Àý×ÓÊÇÕâÑùµÄ:
>>> str1 = 'С¹·'
>>> str1
'С¹·'
>>> str1.encode('utf-8')
b'\xe5\xb0\x8f\xe7\x8b\x97'
>>> str2 = str1.encode('utf-8')
>>> str2
b'\xe5\xb0\x8f\xe7\x8b\x97'
ÎÒÒª»¹ÔΪ'С¹·'ÓÃÄÄÖÖ±àÂë¸ñʽ
×¢:python 2.4ºÍpython 3.1
>>>str1.encode('gbk')
Traceback (most recent call last):
File "<pyshell#6>", line 1, in <module>
str2.encode('gbk')
AttributeError: 'bytes' object has no attribute 'encode'
º¹,лл
ÎÒ¸Õ·¢ÍêÌû,ͻȻÏëÆðÊǺ¯ÊýÓôíÁË.
ºÇºÇ
Ïà¹ØÎÊ´ð£º
ËÍÆ¼öÒ»±¾Ñ§Ï°PYTHONµÄÊ飬лл
¡¶python¼òÃ÷½Ì³Ì¡·£¬¡¶pythonºËÐıà³Ì¡·
×÷Ϊϵͳ¹ÜÀí·½Ãæ£¬¡¶Python UNIXºÍLinuxϵͳ¹ÜÀíÖ¸ÄÏ¡·ÊDZ¾·Ç³£²»´íµÄÊé¡£
http://club.book.csdn.net/pic3/255142.jpg
ÒýÓÃ
×÷Î ......
s='aaa111aaa,bbb222,333ccc,444ddd444,555eee666,fff777ggg'
ÓÃÕýÔò±í´ïʽȡ³ö ǰºó×ÖĸÏàͬµÄÊý¾Ý ½á¹ûÈçÏÂ:
111 ddd
лл~
Python code:
import re
s='aaa111aaa,bbb222,333ccc,444ddd444,555eee666,ff ......
ÎÒÿ´ÎÉÏ´«µÄÎļþ¶Áµ½µÄÊý¾Ý¶¼²»ÕýÈ·¡£2M µÄͼƬ¶ÁµÃ10¶àK ¡£¡£ÄÄλ´óÏÀ¿ÉÒÔ°ï°ïÎÒ°¡¡£
#!D:\ProgrammerTools\python26\python.exe
#encoding=utf-8
import cgitb
import os
cgitb.enable()
import cgi,urllib ......
#½«GB2312¸ñʽתΪUTF-8¸ñʽ
f = codecs.open('e:\TestResult.xml', "rb", "gb2312")
text = f.read().encode("utf-8")
  ......
²ËÊÖÇë½ÌÖîλÀÏÄñ£º
ÎÒÏ£Íû±à¼zipѹËõ°üÀïµÄÒ»¸öÎı¾Îļþ¡£Ôõô½â¾ö±È½ÏºÏÀí£¿
ÎÒµÄÏë·¨£ºÏȰÑZIP½âѹ£¬ÔÙ±à¼Îı¾£¬È»ºóÔÙѹËõ³ÉZIP²¢°Ñ½âѹ¹ýµÄÎļþɾ³ý¡£²»¹ý¸Ð¾õÓÐЩ·±Ëö£¬¿É²»¿ÉÒÔÖ±½Ó¶ÁZIPÈ»ºóÐÞ¸ÄÄØ£¿ ......