ÀûÓÃPythonץȡºÍ½âÎöÍøÒ³(ÉÏ)
¶ÔËÑË÷ÒýÇæ¡¢ÎļþË÷Òý¡¢Îĵµ×ª»»¡¢Êý¾Ý¼ìË÷¡¢Õ¾µã±¸·Ý»òÇ¨ÒÆµÈÓ¦ÓóÌÐòÀ´Ëµ£¬¾³£Óõ½¶ÔÍøÒ³(¼´HTMLÎļþ)µÄ½âÎö´¦Àí¡£ÊÂʵÉÏ£¬Í¨¹ýPythonÓïÑÔÌṩµÄ¸÷ÖÖÄ£¿é£¬ÎÒÃÇÎÞÐè½èÖúWeb·þÎñÆ÷
»ò
ÕßWebä¯ÀÀÆ÷¾ÍÄܹ»½âÎöºÍ´¦ÀíHTMLÎĵµ¡£±¾ÎĽ«Ïêϸ½éÉÜÈçºÎÀûÓÃPythonץȡºÍ½âÎöÍøÒ³¡£Ê×ÏÈ£¬ÎÒÃǽéÉÜÒ»¸ö¿ÉÒÔ°ïÖú¼ò»¯´ò¿ªÎ»ÓÚ±¾µØºÍWeb
ÉϵÄHTMLÎĵµµÄPythonÄ£¿é£¬È»ºó£¬ÎÒÃÇÂÛÊöÈçºÎʹÓÃPythonÄ£¿éÀ´Ñ¸ËÙ½âÎöÔÚHTMLÎļþÖеÄÊý¾Ý£¬´Ó¶ø´¦ÀíÌØ¶¨µÄÄÚÈÝ£¬ÈçÁ´½Ó¡¢Í¼ÏñºÍ
CookieµÈ¡£×îºó£¬ÎÒÃÇ»á¸ø³öÒ»¸ö¹æÕûHTMLÎļþµÄ¸ñʽ±êÇ©µÄÀý×Ó£¬Í¨¹ýÕâ¸öÀý×ÓÄú»á·¢ÏÖʹÓÃpython´¦ÀíHTMLÎļþµÄÄÚÈÝÊǷdz£¼òµ¥µÄÒ»¼þ
ÊÂÇé¡£
¡¡¡¡Ò»¡¢½âÎöURL
¡¡¡¡Í¨¹ýPythonËù´øµÄurlparseÄ£¿é£¬ÎÒÃÇÄܹ»ÇáËɵذÑURL·Ö½â³ÉÔª¼þ£¬Ö®ºó£¬»¹Äܽ«ÕâЩԪ¼þÖØÐÂ×é×°³ÉÒ»¸öURL¡£µ±ÎÒÃÇ´¦ÀíHTML ÎĵµµÄʱºò£¬ÕâÏÄÜÊǷdz£·½±ãµÄ¡£
¡¡¡¡
import
urlparse
¡¡¡¡parsedTuple
=
urlparse.urlparse(
¡¡¡¡
"
http://www.google.com/search?
¡¡¡¡hl
=
en
&
q
=
urlparse
&
btnG
=
Google
+
Search
"
)
¡¡¡¡unparsedURL
=
urlparse.urlunparse((URLscheme, \
¡¡¡¡URLlocation, URLpath,
''
,
''
,
''
))
¡¡¡¡newURL
=
urlparse.urljoin(unparsedURL,
¡¡¡¡
"
/module-urllib2/request-objects.html
"
)
¡¡
¡¡º¯Êýurlparse(urlstring [, default_scheme [,
allow_fragments]])µÄ×÷ÓÃÊǽ«URL·Ö½â³É²»Í¬µÄ×é³É²¿·Ö£¬Ëü´ÓurlstringÖÐÈ¡µÃURL£¬²¢·µ»ØÔª×é (scheme,
netloc, path, parameters, query, fragment)¡£×¢Ò⣬·µ»ØµÄÕâ¸öÔª×é·Ç³£ÓÐÓã¬ÀýÈç¿ÉÒÔÓÃÀ´È·¶¨ÍøÂç
ÐÒé(HTTP¡¢FTPµÈµÈ )¡¢·þÎñÆ÷
µØÖ·¡¢Îļþ·¾¶£¬µÈµÈ¡£
¡¡
¡¡º¯Êýurlunparse(tuple)µÄ×÷ÓÃÊǽ«URLµÄ×é¼þ×°Åä³ÉÒ»¸öURL£¬Ëü½ÓÊÕÔª×é(scheme, netloc, path,
parameters, query, fragment)ºó£¬»áÖØÐÂ×é³ÉÒ»¸ö¾ßÓÐÕýÈ·¸ñʽµÄURL£¬ÒԱ㹩PythonµÄÆäËûHTML½âÎöÄ£¿éʹÓá£
¡¡
¡¡º¯Êýurljoin(base, url [, allow_fragments])
µÄ×÷ÓÃÊÇÆ´½ÓURL£¬ËüÒÔµÚÒ»¸ö²ÎÊý×÷ΪÆä»ùµØÖ·£¬È»ºóÓëµÚ¶þ¸ö²ÎÊýÖеÄÏà¶ÔµØÖ·Ïà½áºÏ×é³ÉÒ»¸ö¾ø¶ÔURLµØÖ·¡£º¯ÊýurljoinÔÚͨ¹ýΪURL»ùµØÖ·
¸½¼ÓеÄÎļþÃûµÄ·½Ê½À´´¦ÀíͬһλÖô¦µÄÈô¸ÉÎļþµÄʱºò¸ñÍâÓÐÓá£ÐèҪעÒâµÄÊÇ£¬Èç¹û»ùµØÖ·²¢·ÇÒÔ×Ö·û/½áβµÄ»°£¬ÄÇôURL»ùµØÖ·×îÓұ߲¿·Ö¾Í»á±»Õâ¸ö
Ïà¶Ô·¾¶ËùÌæ»»¡£±ÈÈ磬URLµÄ»ùµ
Ïà¹ØÎĵµ£º
PythonÖÐ×Ö·û´®±»¶¨ÒåΪÒýºÅÖ®¼äµÄ×Ö·û¼¯ºÏ¡£PythonÖ§³ÖʹÓóɶԵĵ¥ÒýºÅ»òË«ÒýºÅ£¬ÈýÒýºÅ°üº¬µÄ×Ö·û´®¡£
ʹÓÃË÷Òý²Ù×÷·û([])ºÍÇÐÆ¬²Ù×÷·û([:])¿ÉÒԵõ½×Ó×Ö·û´®¡£×Ö·û´®ÓÐÆäÌØÓеÄË÷Òý¹æÔò£ºµÚÒ»¸ö×Ö·ûµÄË÷ÒýÊÇ£°
£¬×îºóÒ»¸ö×Ö·ûµÄË÷ÒýÊÇ-1¡£
¼ÓºÅ(+)ÓÃÓÚ×Ö·û´®Á¬½ÓÔËË㣬ÐǺÅ(*)ÔòÓÃÓÚ×Ö·û´®Öظ´¡£ÈçÏÂÀý£º
pystr = " ......
http://blog.csdn.net/myan/archive/2008/01/07/2028545.aspx
http://blog.csdn.net/gashero/archive/2007/06/03/1636030.aspx
ÎҸоõ»¹ÊÇpythonÓ¦Óøü¹ãһЩ£¬RubyµÄRoR×öWeb¿ò¼ÜºÃһЩ°É£¬ÖÁÓÚperl£¬ÏÖÔڸоõʵÔÚÓм¸·Ö¿àɬ…… ......
PythonºÍRubyµÄ¶Ô±È£¬¾ÀÕýһЩÎó½â
ÏÂÃæÊÇÎÒÔÚ¿´Á½Æª¹ØÓÚPythonºÍRuby¶Ô±ÈµÄÎÄÕÂʱ£¬Ëù×÷µÄ¾ÀÕý£¬ÔÎͼÊǹ㷺Á÷Ðеģ¬±È½ÏºÃÕÒ¡£
------------------------------------------------------
¡¶rubyºÍpythonµÄ±È½Ï¡·¸üÕýÒ»µãÊÂÇé
1¡¢Îĵµ¡¢¿ªÔ´ÏîÄ¿¡¢¿âÖ§³Ö£¬ÕâЩ¶«Î÷Ruby²»Òª¸úPython±È£¬²»ÊǼ¸¸öÊýÁ¿¼¶µÄÎÊÌ⣬ºÎ±ØÃ²Ë ......
Ò»¡¢Ê²Ã´ÊÇPython£¿
¶ÔÓÚÕâÖÖÎÊÌâÎÒÃÇÀ´°Ù¶ÈһϾͿÉÒÔÁË¡£
“PythonÊÇÒ»ÖÖ¿ª·ÅÔ´´úÂëµÄ½Å±¾±à³ÌÓïÑÔ£¬ÕâÖֽű¾ÓïÑÔÌØ±ðÇ¿µ÷¿ª·¢ËٶȺʹúÂëµÄÇåÎú³Ì¶È¡£Ëü¿ÉÒÔÓÃÀ´¿ª·¢¸÷ÖÖ³ÌÐò£¬´Ó¼òµ¥µÄ½Å±¾ÈÎÎñµ½¸´Ôӵġ¢ÃæÏò¶ÔÏóµÄÓ¦ÓóÌÐò¶¼ÓдóÏÔÉíÊֵĵط½¡£Python»¹±»µ±×÷Ò»ÖÖÈëÃųÌÐòÔ±×îÊʺÏÕÆÎÕµÄÓÅÐãÓïÑÔ£¬ÒòΪËüÃâ·Ñ¡¢Ã ......
¹«Ë¾µÄ´úÀí¿ÉÒÔÖ±½Ó´©Ç½£¬×ÔÓÉ·ÃÎÊTwitter¡¢FacebookµÈÍøÕ¾£¬ÕâÁ½ÌìÑо¿ÁËÒ»ÏÂTwitterÌṩµÄAPI£¬ÓÃpythonдÁËÒ»¸ötwitter client£¬Ö»ÊµÏÖÁË»ù±¾¹¦ÄÜ£¬²é¿´×Ô¼ºµÄtwitterÏûÏ¢£¬Ò²¿ÉÒÔ²»ÑéÖ¤£¬²é¿´publicµÄtwitterÏûÏ¢¡£ÆäËû¹¦ÄÜʵÏÖÀàËÆ¡£Ö÷Òªº¯ÊýÈçÏ£º
def fetch_with_proxy(proxy, username, password, url):
&n ......