爬虫 用java实现一个简易爬取网页超链接的程序

在这里插入图片描述
`在这里插入图片描述
爬取结果截取部分

<a href="http://news.163.com/special/2019qglh/" class="zt_link" target="_blank" title="2019全国两会">2019全国两会_网易新闻_网易网</a>
<a class="ntes-nav-index-title ntes-nav-entry-wide c-fl" href="http://www.163.com/" title="网易首页">网易首页</a>
<a href="http://www.163.com/#f=topnav" class="ntes-nav-select-title ntes-nav-entry-bgblack JS_NTES_LOG_FE" data-module-name="n_topnavapp">应用<em class="ntes-nav-select-arr"></em></a>
<a href="http://m.163.com/newsapp/#f=topnav"><span><em class="ntes-nav-app-newsapp">网易新闻</em></span></a>
<a href="http://open.163.com/#f=topnav"><span><em class="ntes-nav-app-open">网易公开课</em></span></a>
<a href="http://live.ent.163.com/download?f=163.homeNavApp"><span><em class="ntes-nav-app-bohe">薄荷直播</em></span></a>
<a href="https://hongcai.163.com/?from=pcsy-button"><span><em class="ntes-nav-app-hongcai">网易红彩</em></span></a>
<a href="http://u.163.com/aosoutbdbd8"><span><em class="ntes-nav-app-yanxuan">网易严选</em></span></a>
<a href="http://mail.163.com/client/dl.html?from=mail46"><span><em class="ntes-nav-app-mail">邮箱大师</em></span></a>
<a href="http://study.163.com/client/download.htm?from=163app&utm_source=163.com&utm_medium=web_app&utm_campaign=business"><span><em class="ntes-nav-app-study">网易云课堂</em></span></a>
<a href="https://app.kaola.com"><span><em class="ntes-nav-app-kaola-hg">网易考拉</em></span></a>
<a href="https://news.163.com">新闻</a>
<a href="http://news.163.com/domestic">国内</a>
<a href="http://news.163.com/world">国际</a>
<a href="http://news.163.com/photo">图片</a>
<a href="http://view.163.com">评论</a>
<a href="http://discovery.163.com">探索</a>
<a href="http://war.163.com">军事</a>
<a href="http://news.163.com/localnews/">本地新闻</a>
<a href="http://news.163.com/special/wangsansanhome/">王三三</a>
<a href="http://sports.163.com">体育</a>
<a href="http://sports.163.com/nba">NBA</a>
<a href="http://sports.163.com/cba">CBA</a>
<a href="http://sports.163.com/allsports">综合</a>

猜你喜欢

转载自blog.csdn.net/weixin_43752167/article/details/88230151