Java爬虫入门 - Code World

Java爬虫入门

Others 2021-11-28 20:09:39 views: null

相比于C#，java爬虫，python爬虫更为方便简要，首先呢，python的urllib2包提供了较为完整的访问网页文档的API，再者呢对于摘下来的文章，python的beautifulsoap提供了简洁的文档处理功能，这就成就了他爬虫的优势。
那么今天呢就来给大家分享一个我喜欢但是不好用的java爬虫系列。
一：引入依赖

  <dependency>
            <groupId>org.apache.httpcomponents</groupId>
            <artifactId>httpclient</artifactId>
            <version>4.5.11</version>
        </dependency>

二：入门级别的demo，四步即可
1)创建HttpClient对象:

 CloseableHttpClient httpClient = HttpClients.createDefault();

2)发请求Get，创建HttpGet对象

 HttpGet httpGet = new HttpGet("https://www.zhipin.com/wuhan/");

3)使用HttpClient对象发起请求

 CloseableHttpResponse response = httpClient.execute(httpGet);

4)解析响应，返回数据—正确的状态码是返回200

  if (response.getStatusLine().getStatusCode() == 200) {
    
    
            HttpEntity httpEntity = response.getEntity();
            String content = EntityUtils.toString(httpEntity, "utf8");
            System.out.println(content);
        }

在这里插入图片描述
三：如下图所示，可以看到静态页面的内容被爬下来了。

Guess you like

Origin blog.csdn.net/qq_35529931/article/details/104869312

Java爬虫入门

python爬虫之xpath入门

Python爬虫从入门到精通（三）简单爬虫的实现

Python爬虫从入门到精通（六）表单与爬虫登录问题

Python爬虫从入门到精通（六）表单与爬虫登录问题

python网络爬虫 Requests库入门

30分钟入门Python爬虫

Java爬虫入门

Python爬虫从入门到精通（五）动态网页的挑战

JAVA入门----排序

JAVA入门----变量与常量

JAVA入门----循环结构

JAVA入门----选择结构

JAVA入门----数组

Java如何入门学习

JAVA入门----运算符

JAVA入门-------OOP(异常处理)

JAVA学习笔记2（入门）

Python 爬虫从入门到入坑全系列教程（详细教程 + 各种实战）

2021-11-03 Python爬虫新手入门第一步

Python实训day08am【网络爬虫selenium、图像处理入门】

【WebMagic】Java 爬虫框架初实践

基于java网络爬虫的设计与实现

基金变动信息获取之 Java 爬虫

Java网络爬虫（一）HttpClient使用

【Java基础】1.Java入门

Java 技术新手入门

Java面试题入门到高级

java代码审计入门--01

java入门基础知识的认识

Recommended

Ranking

Empire cms smart tag calls four first-level recommended articles, starting from the fourth article

Linux environment installation and configuration Elasticsearch7.17

Big Data processing architecture and Lambda Kappa architecture

Explore the top of the AI large model platform - Wenxin Qianfan

Beijing car PK10 lucky airship Guanya size and value of the odd and even tips

W3B x Sui Hacker House｜In-depth understanding of Sui and Move language

Know almost Ko Chan: Chinese what any decent open source software products? (Finishing from my original answer)

Comprehensively improve AD domain security authentication | Zhuyun IDaaS

Android Update Engine Analysis (24) What happened when making the downgrade package?

Spark Architecture and Operating Mechanism (1) - System Architecture

Daily

More

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)

2024-04-28(12)