us.codecraft 0.2.0 4.0.0 pom webmagic webmagic A crawler framework. It covers the whole lifecycle of crawler: downloading, url management, content extraction and persistent. It can simply the development of a specific crawler. https://github.com/code4craft/webmagic/ code4craft Yihua huang code4crafer@gmail.com scm:git:git@github.com:code4craft/webmagic.git scm:git:git@github.com:code4craft/webmagic.git git@github.com:code4craft/webmagic.git Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0 webmagic-core webmagic-extension/ webmagic-samples junit junit 4.7 test org.apache.httpcomponents httpclient 4.2.4 net.sf.saxon Saxon-HE 9.5.1-1 log4j log4j 1.2.17 org.apache.commons commons-lang3 3.1 commons-collections commons-collections 3.2.1 net.sourceforge.htmlcleaner htmlcleaner 2.5 org.apache.commons commons-io 1.3.2 org.jsoup jsoup 1.7.2 org.apache.maven.plugins maven-compiler-plugin 3.1 1.6 1.6 UTF-8 org.apache.maven.plugins maven-dependency-plugin 2.8 copy-dependencies package copy-dependencies ${project.build.directory}/lib false false true org.apache.maven.plugins maven-resources-plugin 2.6 UTF-8 org.apache.maven.plugins maven-source-plugin 2.2.1 attach-sources jar org.apache.maven.plugins maven-javadoc-plugin 2.9.1 UTF-8 attach-javadocs jar org.apache.maven.plugins maven-release-plugin 2.4.1