Ò׽ؽØÍ¼Èí¼þ¡¢µ¥Îļþ¡¢Ãâ°²×°¡¢´¿ÂÌÉ«¡¢½ö160KB

HTML ¸½Â¼£¨3£©URL·¾¶


 --------------
Ò»¡¢Ö÷»ú£¨HOST£©/·þÎñÆ÷£¨Server£©
һ̨´æÔÚÓÚÍøÂçÉϵļÆËã»ú£¬Èç¹ûͨ¹ýijÖÖÍøÂçЭÒ飨ÈçTCP/IPЭÒ飩½«×ÔÉíµÄ×ÊÔ´±©Â¶¸øÍøÂçÉÏµÄÆäËü»úÆ÷·ÃÎÊ£¬ÄÇôÕâЩ»úÆ÷¾Í×é³ÉÁËÖ÷»ú/¿Í»§»úģʽ»ò·þÎñÆ÷/¿Í»§»ú£¨C/S£©Ä£Ê½£¬ÕâÊÇĿǰ×î³£¼ûµÄÍøÂç·þÎñÌṩ·½Ê½¡£
 
IPµØÖ·
IPµØÖ·ÊÇËÄ×é10½øÖÆÊý×ÖÒÔ . ·Ö¸ô×é³ÉµÄÒ»¸ö±àºÅ£¬Ã¿×éÊý×ÖµÄȡֵ·¶Î§Îª0-255£¬ÀýÈç192.168.0.12¡£
IPµØÖ·µÄ×÷ÓÃÊÇÓÃÀ´±êÊ¶ÍøÂçÉϵÄһ̨É豸£¨Èçһ̨¼ÆËã»ú£©¡£Êý¾Ý¿ÉÒÔ¸ù¾ÝÕâ¸öµØÖ·ÕýÈ·µÄÔÚÉ豸֮¼äÏ໥´«µÝ¡£
ͼ1 IPµØÖ·Ê¾Òâͼ
×¢£º¹ØÓÚIPµØÖ·µÄÏêϸÄÚÈÝ£¨ÈçIPµØÖ·µÄ·ÖÀàºÍ¼ÆË㣩£¬Çë²Î¼ûרҵµÄ¼ÆËã»úÍøÂçÊé¼®
 --------------
¶þ¡¢ÓòÃû¡¢ÓòÃûÓ³É䣨DNS£©
ÓòÃû£ºIPÖ»ÊÇÒ»¸ö±àºÅ£¬±¾ÉíûÓÐÌØ±ðµÄÒâÒ壬±È½ÏÄÑÒÔ¼ÇÒ䣬ËùÒÔ»¥ÁªÍøÉÏÒ»°ã²»Ö±½ÓʹÓÃIPµØÖ·À´±êʶÖ÷»ú£¬¶ø²ÉÓÃÓòÃû×öÒ»´ÎÓ³Éä¡£
Ò»°ãÀ´Ëµ£¬ÓòÃûÓÉÈçϼ¸¸ö²¿·Ö×é³É£º
ÓòÃûÒÔwww¿ªÍ·£¬±íʾ¸ÃÓòÃûÓ³ÉäµÄÊÇһ̨ÍòÎ¬Íø£¨WorldWideWeb£©Ö÷»ú£»
ÓòÃûÒÔÒ»¸ö±êʶ¹ú¼Ò£¨µØÇø£©»ò×éÖ¯½á¹¹µÄºó׺½áÊø£¬ÀýÈ磺com±íʾÉÌÒµ»ú¹¹£¬org±íʾ¹Ù·½»ú¹¹£¬edu±íʾ½ÌÓý»ú¹¹£¬gov±íʾÕþ¸®»ú¹¹µÈ£¬ÀýÈ磺www.baidu.comΪ°Ù¶ÈÉÌÒµÍøÕ¾£¬¶øwww.whitehouse.govÔòÊǰ׹¬µÄÕþ¸®ÍøÕ¾£»
ÓòÃû×îºó»¹¿ÉÒÔ×·¼ÓÒ»¸ö±íʾ¹ú¼ÒµÄºó׺£¬ÀýÈ磺cn±íʾÖйú£¬jp±íʾÈÕ±¾£¬tw±íʾ̨ÍåµØÇø¡£Öм䲿·ÖÊÇÒ»¸öÈÎÒâ×Ö·û´®£¬¿ÉÒÔ´ú±íÈκκ¬Òå¡£ÕâÈý²¿·ÖʹÓÃ.·Ö¸ô¿ª£¬ÀýÈ磺www.sina.com.cn¾Í
±íʾÐÂÀË
ÍøÖйúÍøÕ¾£»
Ò»°ã°ÑÎÞ¹ú¼Òºó׺µÄÓòÃû£¨Èý²¿·Ö×é³É£©³ÆÎª¹ú¼Ê¶¥¼¶ÓòÃû£¬´øÓйú¼Òºó׺µÄÓòÃû£¨ËIJ¿·Ö×é³É£©³ÆÎª´Î¼¶ÓòÃû»ò¹ú¼ÒÓòÃû¡£ÔÚÎÒ¹ú£¬»¹ÓÐÒ»ÖÖÖ±½ÓÒÔcn½áÊøµÄÓòÃû£¬³ÆÎª¹ú¼Ò¶¥¼¶ÓòÃû£¬ÀýÈ磺www.taobao.cn£»
Ò»°ãÔÚÓòÃûǰ£¬»¹Òª¼ÓÉÏ·ÃÎʸÃÓòÃûËù±íʾÖ÷»úµÄͨѶЭÒéÃû£¬´Ó¶ø×é³ÉÁËÍêÕûµÄÓòÃû¸ñʽ£¬ÀýÈ磺http://www.google.cn±íʾÒÔhttpЭÒé·ÃÎÊwww.google.cnËù±íʾµÄÖ÷»ú£»
ÓòÃûÓ³É䣺ÔÚ»¥ÁªÍøÉÏ£¬ÓÐÒ»ÀàÌØÊâÓÃ;µÄÖ÷»ú£¬³ÆÎªDNS·þÎñÆ÷£¬ËüÃǵÄ×÷ÓþÍÊǰïÖúÍøÂçÉ豸ͨ¹ýÓòÃû²éÕÒIPµØÖ·¡£ÔÚDNSÖ÷»úÉÏ£¬¼Ç¼ÁËÊýÒÔÍò¼ÆµÄÓòÃûºÍIPµØÖ·µÄ¶ÔÓ¦£¬µ±Ò»Ì¨ÍøÂçÉ豸ÇëÇó²éѯһ¸öÓòÃûʱ£¬DNSÖ÷»ú¾Í»áÏòÕą̂É豸·µ»ØÒ»¸öIPµØÖ·£¬´Ó¶ø¿ÉÒÔͨ¹ýÓòÃû·ÃÎʵ½Õâ¸öIPµØÖ·Ëù¶ÔÓ¦µÄÖ÷»ú¡£
 --------------
Èý¡¢URL£¬Ïà¶Ô·¾¶¡¢¾ø¶Ô·¾¶
ΪÁËÈÃÍøÂçÉϵÄÉ豸¿ÉÒÔ·ÃÎÊÖ÷»úµÄ×ÊÔ´£¬Ö÷»úÒ»°ã¶¼¿ª·Å²¿·Ö×ÊÔ´×÷Ϊ¹«¹²×ÊÔ´£¬×


Ïà¹ØÎĵµ£º

ʹÓÃpython»ñÈ¡htmlÒ³ÃæµÄÄÚÈÝ

import urllib
from HTMLParser import HTMLParser
class TitleParser(HTMLParser):
def __init__(self):
self.title = ''
self.divcontent = ''
self.readingtitle = 0
self.readingdiv = 0
HTMLParser.__init__(self)
def handle_starttag(self, tag, attrs): ......

HTML ÖÐnoscriptµÄÓ÷¨

noscript ÔªËØÓÃÀ´¶¨ÒåÔڽű¾Î´±»Ö´ÐÐʱµÄÌæ´úÄÚÈÝ£¨Îı¾£©¡£´Ë±êÇ©¿É±»ÓÃÓÚ¿Éʶ±ð <script> ÔªËØÓÃÀ´¶¨ÒåÔڽű¾Î´±»Ö´ÐÐʱµÄÌæ´úÄÚÈÝ£¨Îı¾£©¡£ ±êÇ©µ«ÎÞ·¨Ö§³ÖÆäÖеĽű¾µÄä¯ÀÀÆ÷¡£Èç¹ûä¯ÀÀÆ÷Ö§³Ö½Å±¾£¬Ôò²»»áÏÔʾnoscript ±êÇ©µÄÄÚÈÝ¡£
noscript±êǩʹÓÃʾÀý£º
<html>
<head>
<meta http-equiv ......

Web¿ª·¢ µÚÒ»²¿·Ö HTML½Ì³Ì»ù´¡£¨¶þ£© head²¿·Ö

head±êÇ©ÑÝʾ´úÂ룺
×¢£º<!-- ºÍ -->Ö®¼äµÄÄÚÈÝΪHTML×¢ÊÍ¡£
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<!-- ÉèÖÃÒ³ÃæÎÄ×Ö±àÂë -->
<me ......

Web¿ª·¢ µÚÒ»²¿·Ö HTML½Ì³Ì»ù´¡£¨Èý£© body²¿·Ö

ÏÈ¿´ÈçÏ´úÂ룺
index.html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=utf-8" />
<title>bodyÔªËØ</title> ......
© 2009 ej38.com All Rights Reserved. ¹ØÓÚE½¡ÍøÁªÏµÎÒÃÇ | Õ¾µãµØÍ¼ | ¸ÓICP±¸09004571ºÅ