Commit Graph

270 Commits (17e95f2a7feb713fcacd61f9ade7b140d9587f87)

Author SHA1 Message Date
yihua.huang 17e95f2a7f comments 11 years ago
yihua.huang 05eb7831b6 refactor and comments #110 11 years ago
yihua.huang 375e64e845 more monitor status 11 years ago
yihua.huang 018061d2cd fix error in thread pool 11 years ago
yihua.huang cdc423f2bf log 11 years ago
yihua.huang c6661899fd new thread pool #110 11 years ago
yihua.huang 179baa7a22 return when page is null 11 years ago
yihua.huang 0336f4cdb4 remove IllegalStateException when download error for less error log 11 years ago
yihua.huang 11ba5beb42 [refactor]move monitor to webmagic-extension #98 11 years ago
yihua.huang d61f65cef8 update mbean to mxbean #98 11 years ago
yihua.huang ad6a273b12 update test url 11 years ago
yihua.huang 30af23d003 split monitor to server and client mode #98 11 years ago
yihua.huang ced79630d3 specify jndi and jmx #98 11 years ago
yihua.huang 95d3802e77 add formdata support for post request #108 11 years ago
yihua.huang f49bb877c8 clean some code #109 11 years ago
yihua.huang e1aaf1dd11 fix mistake of guava Table #109 11 years ago
yihua.huang 8ba2da146c request method #108 and more cookie #109 config 11 years ago
yihua.huang b06aa489fb [BugFix]Only one url from sourceRegion can be extracted #107 11 years ago
Bo LIANG 08fa3b01c1 when download error, throw an exception instead of calling onError and returning peacefully. #105 11 years ago
yihua.huang 27b37e8164 extension point and sample for JMX support #98 11 years ago
yihua.huang a5db6cf292 some monitor and JMX support #98 11 years ago
yihua.huang f39aa435cf add null check #104 11 years ago
yihua.huang 42bbe40a37 [Bugfix]Urls will be lost when call setScheduler() #104 11 years ago
Bo LIANG 163773af6b combine two try-catch block into one, make it cleaner. 11 years ago
yihua.huang ec446277b1 some refactor in httpclientdownloader 11 years ago
yihua.huang 4a035e729a extension point for LocalDuplicatedRemovedScheduler #95 11 years ago
yihua.huang b249e49748 [Bugfix]loop error when add TargetRequest #99 11 years ago
Yihua Huang da2f023c12 Merge pull request #96 from ouyanghuangzheng/master
修改了Spider 和site  几处注释
11 years ago
yihua.huang f7950ebcab fix tests 11 years ago
愤怒的番茄 32ba1b8889 修复几处注释问题 11 years ago
yihua.huang 84b897f83b update AngularJSProcessor 11 years ago
yihua.huang 03c251237b add Json parse support 11 years ago
愤怒的番茄 644e8d1f72 同步官方源码 11 years ago
yihua.huang 969ad1766b change logger style to slf4j for cleaner code 11 years ago
yihua.huang 9b2cb43f47 ConfigurablePageProcessor #91 11 years ago
Bo LIANG b043ac76d6 change the formatter of log.
To use slf4j, we should insert {} into the formatter string.
11 years ago
yihua.huang 7aaf837e15 change logger to slf4j style for performance #84 11 years ago
yihua.huang f9b157951d Merge branch 'master' of github.com:code4craft/webmagic 11 years ago
yihua.huang 22c394e629 [doc] 11 years ago
Bo LIANG 762a3973fd Modify the log levels of LocalDuplicatedRemovedScheduler.java
The old version will print a debug level log each time the push method is
called. So sometimes, when a html page have multiple links for the same
page, the debug log will appears more than once. Also, when we meet a
duplicate URL, it will also print a log, which will be confusing.
I change the level of it to trace. And each time a URL is really push into
queue, print a debug level log.
11 years ago
yihua.huang a1c7e826f7 fix dep of slf4j-log4j12 11 years ago
yihua.huang 01848301d4 encode illegal charactors in url #80 11 years ago
yihua.huang 2780423e60 enable blank space in quotes in UrlUtils.fixAllRelativeHrefs #80 11 years ago
yihua.huang 97b6f46280 Bugfix: break loop in addTargetRequests #81 11 years ago
yihua.huang 8d8194bee4 Change HashMap to LinkedHashMap in ResultItems for same order of input and output #76 11 years ago
yihua.huang 8b35d79569 Do not cache document in Selectable for selected Html element #73 11 years ago
yihua.huang 6201fd6966 add worker as container 11 years ago
yihua.huang 6c11718566 Clean project structure #70 11 years ago
yihua.huang 9606a173cd fix ZipCodePageProcessor 11 years ago
yihua.huang 757cc9b942 [maven-release-plugin] prepare for next development iteration 11 years ago