Sinosoft Co. LTD Website www.sinosoftgroup.com
13 100080 010 62638881 010 62636710 Email fangyi@sinosoftgroup.com
...3...3 2.1....3 2.2....4 2.3....5 2.4....6 2.5....7 2.6.... 2.7....9 2.8....10...10 3.1....10 3.2....11 3.2.1... 11 3.2.1.1...11 3.2.1.2...13 3.2.1.3...15 3.2.1.4...16 3.2.2...18 3.2.2.1...18 3.2.2.2...20 3.2.3...22 3.2.3.1...22 3.2.4...24 3.2.4.1...24 3.2.4.2...26 3.2.4.3...27...28 4.1....28 4.2...29 4.2.1...29 4.2.2...31 4.2.2.1...31 4.2.2.2...32 4.2.2.3...33 4.2.2.4...33 4.2.3.5...34 1
4.2.3...35 4.2.3.1...35 4.2.3.2...36 4.2.3.3...36 2
2.1. Internet Internet 3
2.2. None 1 16 12032 KB 27200 KB 304896 KB, Calis 0 16 119896 KB 209360 KB 304896 KB Running / : 2.75 (8.44) KB/ : 38 (84) 4 min. and 11 sec. 50 of 50 20 MB / 12% (2051 of 16540) 4
0 16 2309 7, 50 2.3. ( ): 19 0 (24 ): 8 (24 ): 0 2.4. URL Calis Abnormal exit from crawling URL news-sina-com-cn Abnormal exit from crawling URL 5
: Calis : calis URL: http://www.sohu.com ( URL, URL, IP URL) URL http://www.kepu.net.cn/gb/basic/scigate/start/sta24.html http://www.kepu.net.cn/gb/basic/scigate/nature/ntr005.html http://www.kepu.net.cn/gb/basic/scigate/bio/bio005.html http://192.168.1.66 http://www.kepu.net.cn/gb/basic/scigate/hitech/hit011.html httpa;;a;sdfa URL URL IP URL URL URL URL 2.5. Internet 6
2.6. google 100 google2 100-200 google 7
> 8
2.7. 70 70 70 70 70 70 70 70 70 70 9
1 4205 : 2.8. 3.1. 10
3.2. 3.2.1 3.2.1.1 1 2 ( ) 11
12
2 URL 3.2.1.2 1 13
2 14
3.2.1.3 1 2 3 15
4 5 6 7 3.2.1.4 1 2 16
3 17
4 5 3.2.2 3.2.2.1 1 18
2 19
3 URL 3.2.2.2 1 2 20
3 21
4 5 6 3.2.3 3.2.3.1 1 2 3 22
4 5 23
6 3.2.4 3.2.4.1 1 2 24
3 25
4 5 3.2.4.2 1 zip 2 26
3.2.4.3 1 27
4.1. 28
4.2 4.2.1 29
30
4.2.2 4.2.2.1 31
4.2.2.2 32
4.2.2.3 4.2.2.4 33
4.2.3.5 34
4.2.3 4.2.3.1 35
4.2.3.2 4.2.3.3 36
37