org.sonatype.ossoss-parent7us.codecraft0.5.0-SNAPSHOT4.0.0pomUTF-8UTF-84.0.0.RELEASEwebmagic-parentwebmagic-parent
A crawler framework. It covers the whole lifecycle of crawler: downloading, url management, content
extraction and persistent. It can simply the development of a specific crawler.
https://github.com/code4craft/webmagic/code4craftYihua huangcode4crafer@gmail.comyuanyLigang Yaoligang.yao@answers.comscm:git:git@github.com:code4craft/webmagic.gitscm:git:git@github.com:code4craft/webmagic.gitgit@github.com:code4craft/webmagic.gitHEADApache License, Version 2.0http://www.apache.org/licenses/LICENSE-2.0webmagic-corewebmagic-extension/webmagic-scripts/webmagic-seleniumwebmagic-saxonwebmagic-sampleswebmagic-avalonjunitjunit4.11testorg.apache.httpcomponentshttpclient4.3.3com.google.guavaguava15.0org.slf4jslf4j-api1.7.6org.slf4jslf4j-log4j121.7.6us.codecraftxsoup0.2.2com.alibabafastjson1.1.37log4jlog4j1.2.17org.assertjassertj-core1.5.0org.apache.commonscommons-lang33.1commons-collectionscommons-collections3.2.1org.apache.commonscommons-io1.3.2org.jsoupjsoup1.7.2org.mockitomockito-all1.9.5testorg.apache.maven.pluginsmaven-surefire-pluginpertest-Xms1024m -Xmx1024m -Xss1m org.apache.maven.pluginsmaven-compiler-plugin3.11.6UTF-8org.apache.maven.pluginsmaven-dependency-plugin2.8copy-dependenciespackagecopy-dependencies${project.build.directory}/libfalsefalsetrueorg.apache.maven.pluginsmaven-resources-plugin2.6UTF-8org.apache.maven.pluginsmaven-jar-pluginlog4j.xmlorg.apache.maven.pluginsmaven-source-plugin2.2.1attach-sourcesjarorg.apache.maven.pluginsmaven-javadoc-plugin2.9.1UTF-8attach-javadocsjarorg.apache.maven.pluginsmaven-release-plugin2.4.1release-sign-artifactsperformReleasetrueorg.apache.maven.pluginsmaven-gpg-plugin1.1sign-artifactsverifysign