yihua.huang
|
6c11718566
|
Clean project structure #70
|
11 years ago |
yihua.huang
|
9606a173cd
|
fix ZipCodePageProcessor
|
11 years ago |
yihua.huang
|
757cc9b942
|
[maven-release-plugin] prepare for next development iteration
|
11 years ago |
yihua.huang
|
63ffb5c792
|
[maven-release-plugin] prepare release webmaigc-0.4.3
|
11 years ago |
yihua.huang
|
66d4d3c192
|
Merge branch 'master' into 0.4.x
|
11 years ago |
yihua.huang
|
af07280176
|
remove defend code for httpclient 4.3.1 because it is fixed in 4.3.3 #59
|
11 years ago |
yihua.huang
|
d5a978e00f
|
update version back to 0.4.3
|
11 years ago |
yihua.huang
|
55368919df
|
add attribute 'text' support for CssSelector #66
|
11 years ago |
yihua.huang
|
88b50d4182
|
bigfix: cycleTry will not work when spawnUrl is set to false #62
|
11 years ago |
yihua.huang
|
2768a1cae4
|
add test for cycleTriedTimes and fix cycleTriedTimes inc error #60
|
11 years ago |
yihua.huang
|
bbd0d7e600
|
update httpclient version to 4.3.3 #59
|
11 years ago |
yihua.huang
|
571061454a
|
#58 add CYCLE_TRIED_TIMES support to QueueScheduler and PriorityScheduler
|
11 years ago |
yihua.huang
|
0e98183f74
|
Change log4j to slf4j #55
|
11 years ago |
yihua.huang
|
fa33b15843
|
property loader
|
11 years ago |
yihua.huang
|
af809c4d55
|
update version to 0.5.0-snapshot
|
11 years ago |
Almark Ming
|
2b46b11e55
|
Update RegexSelector.java
Optimize regex format check
Conflicts:
webmagic-core/src/main/java/us/codecraft/webmagic/selector/RegexSelector.java
|
11 years ago |
yihua.huang
|
b51fb2696b
|
update ut for cookie
|
11 years ago |
yihua.huang
|
ff2f588c41
|
#48 nullpointer exception
|
11 years ago |
yihua.huang
|
d274310cb2
|
[maven-release-plugin] prepare for next development iteration
|
11 years ago |
yihua.huang
|
e8c32a32dc
|
[maven-release-plugin] prepare release webmagic-0.4.2
|
11 years ago |
yihua.huang
|
6a828e923c
|
#46 Downloader thread hang up when timeout
|
11 years ago |
shijinping
|
9a524aa364
|
double-check 中再取次httpClient的内容
|
11 years ago |
yihua.huang
|
e7083dc39d
|
[maven-release-plugin] prepare for next development iteration
|
11 years ago |
yihua.huang
|
ae623567b3
|
[maven-release-plugin] prepare release webmagic-0.4.1
|
11 years ago |
yihua.huang
|
59ad4cad27
|
#42 Add jsonpath in annotation mode for json result
|
11 years ago |
yihua.huang
|
c2d6d495b3
|
#41 add getThreadAlive(),getStatus,getPageCount() to spider
|
11 years ago |
yihua.huang
|
cf62d707e0
|
#36 Spider does not exit when success
|
11 years ago |
yihua.huang
|
a01312930a
|
#39 Parsing html after page.getHtml()
|
11 years ago |
yihua.huang
|
f63d33b457
|
update some comments
|
11 years ago |
yihua.huang
|
04fcf3193f
|
#38 Change algorithm of SmartContentSelector
|
11 years ago |
yihua.huang
|
296a68920e
|
fix javadoc and add setPipelines() for spider
|
11 years ago |
yihua.huang
|
47a0360783
|
#35 add status code to page
|
12 years ago |
yihua.huang
|
bc5c30de17
|
update scripts
|
12 years ago |
yihua.huang
|
f9daae39cf
|
[maven-release-plugin] prepare for next development iteration
|
12 years ago |
yihua.huang
|
fdb9441519
|
[maven-release-plugin] prepare release webmagic-0.4.0
|
12 years ago |
yihua.huang
|
1d75ae7f5b
|
rollback version to 0.4.0 because not deploy success
|
12 years ago |
yihua.huang
|
df8ca8ad09
|
add scripts
|
12 years ago |
yihua.huang
|
775eb9732f
|
[maven-release-plugin] prepare for next development iteration
|
12 years ago |
yihua.huang
|
0b4fadc24d
|
[maven-release-plugin] prepare release webmagic-0.4.0
|
12 years ago |
yihua.huang
|
fe6d9bb2e2
|
get keep-alive rework
|
12 years ago |
yihua.huang
|
fd6d2fd6f8
|
try to keepalive TCP connection
|
12 years ago |
yihua.huang
|
425df08523
|
update version to 0.4.0
|
12 years ago |
yihua.huang
|
e046bb0723
|
remove useless code
|
12 years ago |
yihua.huang
|
6e32a19f80
|
update api for direct download
|
12 years ago |
yihua.huang
|
807aefe9df
|
change EntityUtil to IOUtil because some encoding error
|
12 years ago |
yihua.huang
|
00b0a751b4
|
#33 ignore 'content-encoding' when redirect
|
12 years ago |
yihua.huang
|
8f774afc84
|
add direct download
|
12 years ago |
yihua.huang
|
c18b603399
|
optimize long compare
|
12 years ago |
yihua.huang
|
ed3f3583cc
|
downloader refactor
|
12 years ago |
yihua.huang
|
a37f40e6e6
|
add cookie supoort
|
12 years ago |