网络爬虫种子长什么样

boonya

浏览: 801791 次
性别:
来自: 成都

最近访客更多访客>>

ileme

zhuhai189

燃烧丶胸毛

limengna845567

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

spider seed

因为网络爬虫是SEO的一部分故归类到SEO，以下是一些有用的网络爬虫种子，当然你也可以去找一些；
这是以前同事找的，感觉很有用跟大家分享一下：
1、天气内容
全天预报：http://www.weather.com.cn/data/cityinfo/{101020100}.html
实时天气：http://www.weather.com.cn/data/sk/{101270101}.html
6 天预报：http://m.weather.com.cn/data/{101210701}.html
注：{…….}部分为行政编码，如101270101为成都。以上url返回内容都是json格式

2、中国天气网城市编码
省级行政单位编码：http://www.weather.com.cn/data/citydata/china.html
地市级行政单位编码：http://www.weather.com.cn/data/citydata/district/{10101}.html
区县级行政单位编码：http://www.weather.com.cn/data/citydata/city/{1010100}.html
注：{…….}部分为行政编码，如101270101为成都。以上url返回内容都是json格式

3、新浪新闻
焦点新闻：http://rss.sina.com.cn/news/allnews/auto.xml
购车指导：http://rss.sina.com.cn/auto/guide/index.xml
行业动态：http://rss.sina.com.cn/auto/news/t/index.xml
汽车保养：http://rss.sina.com.cn/auto/servicing/index.xml
汽车用品：http://rss.sina.com.cn/auto/automotive/index.xml
注：以上url返回内容都是xml格式，它们并不是真正的新闻，而是RSS，是新闻列表。通过解析Rss内容，获取真正的新闻地址。

4、手机归属地
http://vip.showji.com/locating/?m={13550360786}&outfmt=json
注：{13550360786}部分为手机号码，outfmt参数用于指明返回的内容格式，此处是json

5、飞机票信息
http://jipiao.9588.com/Flight/FlightInfo?MoreTrip[0].fromcity=%s&MoreTrip[0].tocity=%s&MoreTrip[0].from=%s&MoreTrip[0].to=%s&MoreTrip[0].date=%s
注： %s部分为查询参数，依次对应为1)出发地中文名称、2)目的地中文名称、3)出发机场代码、4)目的地机场代码、5)出发日期。

分享到：