Commit Graph

527 Commits (b14f0ee47992e0456f05dd22363fb216e014a246)
 

Author SHA1 Message Date
yihua.huang b14f0ee479 fix jsonpath in AngularJSProcessor 11 years ago
yihua.huang 84b897f83b update AngularJSProcessor 11 years ago
yihua.huang 03c251237b add Json parse support 11 years ago
yihua.huang 843e928c2c comments on sinablogprocessor sample 11 years ago
yihua.huang be37d8b216 sinablogprocessor sample 11 years ago
yihua.huang 094f9d1552 rename assets for spell mistake 11 years ago
yihua.huang 2b023c95c2 qqmeishi demo 11 years ago
yihua.huang db65dfafb8 add baidunews sample 11 years ago
yihua.huang 3669e73e4a update News163: use Xsoup 0.2.0 syntax instead of ComboExtract 11 years ago
yihua.huang 02b441ad38 disable NativeObject in Rhino because it is a hotspot internal api and compile error in OpenJDK #93 11 years ago
yihua.huang 9f5a6494a0 add support for JDK6 #93 11 years ago
yihua.huang c6c56ad511 Merge branch 'master' of github.com:code4craft/webmagic 11 years ago
yihua.huang c2873928c8 [prototype] extractrule 11 years ago
Yihua Huang 7cb4e37812 Merge pull request #93 from friddle/master
update the script
11 years ago
friddle 933800147b update ruby 11 years ago
friddle 37666a7151 update the script 11 years ago
yihua.huang c1e7207869 add FileCacheQueueScheduler support for cycleRetryTimes 11 years ago
yihua.huang 969ad1766b change logger style to slf4j for cleaner code 11 years ago
yihua.huang 9b2cb43f47 ConfigurablePageProcessor #91 11 years ago
Yihua Huang 1090d070d9 Merge pull request #90 from ccliangbo/removeUnusedLines
Remove unused variable to make the project cleaner.
11 years ago
Bo LIANG 159eeea2f5 Remove unused variable to make the project cleaner. 11 years ago
yihua.huang c143fc662c add SubPageProcessor #86 11 years ago
Yihua Huang 2b2ce9ce13 Merge pull request #89 from ccliangbo/slf4jFormat
change the formatter of log.
11 years ago
Bo LIANG b043ac76d6 change the formatter of log.
To use slf4j, we should insert {} into the formatter string.
11 years ago
Yihua Huang 474f785dab Merge pull request #86 from sebastian1118/master
new feature: PatternProcessor
11 years ago
yihua.huang 8fe967ba8d [BugFix]exclude log4j.xml from maven jar plugin #82 11 years ago
Tian 38a12f8641 new feature: PatternProcessor 11 years ago
yihua.huang dafd0b5875 [BugFix]multi model in one pageprocessor will be skipped #85 11 years ago
yihua.huang 7aaf837e15 change logger to slf4j style for performance #84 11 years ago
yihua.huang f9b157951d Merge branch 'master' of github.com:code4craft/webmagic 11 years ago
yihua.huang 22c394e629 [doc] 11 years ago
Yihua Huang 3efa774191 Merge pull request #84 from ccliangbo/logInScheduler
Modify the log levels of LocalDuplicatedRemovedScheduler.java
11 years ago
Bo LIANG 762a3973fd Modify the log levels of LocalDuplicatedRemovedScheduler.java
The old version will print a debug level log each time the push method is
called. So sometimes, when a html page have multiple links for the same
page, the debug log will appears more than once. Also, when we meet a
duplicate URL, it will also print a log, which will be confusing.
I change the level of it to trace. And each time a URL is really push into
queue, print a debug level log.
11 years ago
yihua.huang 44293cd894 [doc]add qq group in readme 11 years ago
yihua.huang 9a0a4051ed [doc] ch3 part1 11 years ago
yihua.huang 7ca644cdd9 format readme 11 years ago
yihua.huang a1c7e826f7 fix dep of slf4j-log4j12 11 years ago
yihua.huang a34e92d11a fix huabanprocessor 11 years ago
yihua.huang 50cee4c7bb [doc] complete docs2.0 ch1 11 years ago
yihua.huang 9ec0ca02c6 doc2.0 ch1 11 years ago
yihua.huang 7e0e5b0969 clean ui 11 years ago
yihua.huang 94f97da4dc [Avalon] fix spring config for static and ignore google fonts for better loading speed 11 years ago
yihua.huang 22e8697671 add forger to folder 11 years ago
yihua.huang 05abd566a4 remove submodule 11 years ago
yihua.huang 01848301d4 encode illegal charactors in url #80 11 years ago
yihua.huang 2780423e60 enable blank space in quotes in UrlUtils.fixAllRelativeHrefs #80 11 years ago
yihua.huang 97b6f46280 Bugfix: break loop in addTargetRequests #81 11 years ago
yihua.huang d1563da33b add contributor 11 years ago
yihua.huang b13f1da039 reformat 11 years ago
yihua.huang 7038c00a9a reformat 11 years ago