Commit Graph

556 Commits (f49bb877c858617fa74cfd2c464adbfe616e45de)
 

Author SHA1 Message Date
yihua.huang f49bb877c8 clean some code
yihua.huang e1aaf1dd11 fix mistake of guava Table
yihua.huang 8ba2da146c request method and more cookie config
yihua.huang b06aa489fb [BugFix]Only one url from sourceRegion can be extracted
Bo LIANG 08fa3b01c1 when download error, throw an exception instead of calling onError and returning peacefully.
yihua.huang 023c2ac84e spider config draft
yihua.huang 27b37e8164 extension point and sample for JMX support
yihua.huang a5db6cf292 some monitor and JMX support
yihua.huang f39aa435cf add null check
yihua.huang 42bbe40a37 [Bugfix]Urls will be lost when call setScheduler()
yihua.huang aae1ab2cd6 fix compile error
yihua.huang bc8d0220eb Merge branch 'master' of github.com:code4craft/webmagic
yihua.huang 1fbfc92de2 Inherit support of Field annotation in Model
Yihua Huang 93c4a2afb7 Merge pull request from ccliangbo/waitNewUrl
combine two try-catch block into one, make it cleaner.
Bo LIANG 163773af6b combine two try-catch block into one, make it cleaner.
yihua.huang c8014a9ae6 update readme
yihua.huang ec446277b1 some refactor in httpclientdownloader
yihua.huang 4a035e729a extension point for LocalDuplicatedRemovedScheduler
yihua.huang b249e49748 [Bugfix]loop error when add TargetRequest
yihua.huang 3a79b1b64a [Bugfix]formatter property does not work when field is String#100
Yihua Huang cc9d319fd9 Merge pull request from sebastian1118/master
update:PatternHandler
Yihua Huang da2f023c12 Merge pull request from ouyanghuangzheng/master
修改了Spider 和site  几处注释
yihua.huang f7950ebcab fix tests
yihua.huang b14f0ee479 fix jsonpath in AngularJSProcessor
愤怒的番茄 32ba1b8889 修复几处注释问题
yihua.huang 84b897f83b update AngularJSProcessor
yihua.huang 03c251237b add Json parse support
Tian 99e12aafaa update:PatternHandler
愤怒的番茄 53184f0390 test
愤怒的番茄 644e8d1f72 同步官方源码
愤怒的番茄 610ac42c07 更新
愤怒的番茄 5b254e446b 更新
yihua.huang 843e928c2c comments on sinablogprocessor sample
yihua.huang be37d8b216 sinablogprocessor sample
yihua.huang 094f9d1552 rename assets for spell mistake
yihua.huang 2b023c95c2 qqmeishi demo
yihua.huang db65dfafb8 add baidunews sample
yihua.huang 3669e73e4a update News163: use Xsoup 0.2.0 syntax instead of ComboExtract
yihua.huang 02b441ad38 disable NativeObject in Rhino because it is a hotspot internal api and compile error in OpenJDK
yihua.huang 9f5a6494a0 add support for JDK6
yihua.huang c6c56ad511 Merge branch 'master' of github.com:code4craft/webmagic
yihua.huang c2873928c8 [prototype] extractrule
Yihua Huang 7cb4e37812 Merge pull request from friddle/master
update the script
friddle 933800147b update ruby
friddle 37666a7151 update the script
yihua.huang c1e7207869 add FileCacheQueueScheduler support for cycleRetryTimes
yihua.huang 969ad1766b change logger style to slf4j for cleaner code
yihua.huang 9b2cb43f47 ConfigurablePageProcessor
Yihua Huang 1090d070d9 Merge pull request from ccliangbo/removeUnusedLines
Remove unused variable to make the project cleaner.
Bo LIANG 159eeea2f5 Remove unused variable to make the project cleaner.