You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
webmagic/zh_docs/us/codecraft/webmagic/Request-cmnt.xml

57 lines
1.7 KiB
XML

This file contains ambiguous Unicode characters!

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<javadoc>
<meta>
<date-generated>Sat Aug 17 14:14:45 CST 2013</date-generated>
</meta>
<comment>
<key><![CDATA[us.codecraft.webmagic.Request]]></key>
<data><![CDATA[ <div class="zh">
Request对象封装了待抓取的url信息。<br/>
在PageProcessor中Request对象可以通过{@link us.codecraft.webmagic.Page#getRequest()} 获取。<br/>
<br/>
Request对象包含一个extra属性可以写入一些必须的上下文这个特性在某些场合会有用。<br/>
<pre>
Example:
抓取<a href="${link}">${linktext}</a>时希望提取链接link并保存linktext的信息。
在上一个页面:
public void process(Page page){
Request request = new Request(link,linktext);
page.addTargetRequest(request)
}
在下一个页面:
public void process(Page page){
String linktext = (String)page.getRequest().getExtra()[0];
}
</pre>
</div>
@author code4crafter@gmail.com <br>
Date: 13-4-21
Time: 上午11:37
]]></data>
</comment>
<comment>
<key><![CDATA[us.codecraft.webmagic.Request(java.lang.String)]]></key>
<data><![CDATA[ 构建一个request对象
@param url 必须参数待抓取的url
]]></data>
</comment>
<comment>
<key><![CDATA[us.codecraft.webmagic.Request.setPriority(double)]]></key>
<data><![CDATA[ 设置优先级用于URL队列排序<br>
需扩展Scheduler<br>
目前还没有对应支持优先级的Scheduler实现 =。= <br>
@param priority 优先级,越大则越靠前
@return this
]]></data>
</comment>
<comment>
<key><![CDATA[us.codecraft.webmagic.Request.getUrl()]]></key>
<data><![CDATA[ 获取待抓取的url
@return url 待抓取的url
]]></data>
</comment>
</javadoc>