使用Jsoup爬取互联网信息 - 代码天地

使用Jsoup爬取互联网信息

编程语言 2018-05-14 10:36:04 阅读次数: 2

public static void parserURLsByPost(){
       try {
           Document doc = Jsoup.connect("http://search.51job.com/jobsearch/search_result.php?fromJs=1&jobarea=0000&district=0000&funtype=0000&industrytype=00&issuedate=9&providesalary=99&keyword=java&keywordtype=2&curr_page=1&lang=c&stype=1&postchannel=0000&workyear=99&cotype=99&degreefrom=99&jobterm=01&lonlat=0%2C0&radius=-1&ord_field=0&list_type=0&fromType=14").data("query", "Java")
           .userAgent("Mozilla")
           .cookie("auth", "token")
           .timeout(30000)
           .post();
           Elements link = doc.select("a");
           for (Element element : link) {
               Elements s=element.getElementsByAttributeValue("class", "jobname");
               for (Element element2 : s) {
                  String relHref= element2.attr("href");
                   System.out.println(element2.text());
                   System.out.println(relHref);
            }
              /* Element relSrc = element.attr("class", "jobname"); // == "/"
               if(relSrc.hasClass("jobname")){
               System.out.println(element.text());
               }
              // String linkHref = element.attr("href");
*/               //System.out.println(linkHref);
           }
           //String title = doc.title(); // == "/"
          // String absHref = link.attr("abs:href"); // "http://jsoup.org/"
           //System.out.println(title);
          
       } catch (IOException e) {
           // TODO Auto-generated catch block
           e.printStackTrace();
       }
   }

猜你喜欢

转载自yangfuchao418.iteye.com/blog/763074

使用Jsoup爬取互联网信息

爬虫第三课：互联网中网页信息的爬取

Jsoup爬取简单信息

scrapy 爬取豆瓣互联网图书

【Java爬虫】使用Jsoup爬取网页表格的分页信息

使用jsoup爬取网页信息，保存到txt中

使用Jsoup进行疫情数据爬取

使用Jsoup爬虫爬取相关图片

python爬取拉勾网互联网大数据职业情况

Python 爬取 4027 条脉脉职言，解读互联网人的苦与难！

如何爬取互联网地图矢量数据-POI、建筑、道路和路况

Java使用Jsoup包批量爬取智联招聘上招聘信息

【JAVA-爬虫】使用 Jsoup+HttpClient 爬取网页信息

使用HttpClient和Jsoup爬取京东手机信息案例

爬取图片 jsoup

jsoup 爬取电影

jsoup爬取图片

Jsoup 爬取文章

spider-java (Jsoup) (媒体信息的爬取)

Jsoup实现爬取多个网页的多条固定信息

jsoup爬取网站信息之《冰与火之歌》

jsoup爬取网站信息之《庆余年》

移动互联网是信息传播的载体

Google是如何搜集互联网信息

地球互联网的信息交易的度量

互联网设备信息：Censys

互联网时代无信息

java爬虫爬取互联网上的各大影视网站---360影视（附源码下载）

互联网信息服务（仅限互联网信息服务）

使用jsoup爬取所有成语

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)