java爬取网站信息(js,css,html)

新建一个普通的maven项目 加入maven依赖即可

org.apache.httpcomponents httpclient 4.5.8

public class JavaProject {
public static void main(String[] args) {
//1.生成httpclient,相当于该打开一个浏览器
CloseableHttpClient httpClient = HttpClients.createDefault();
CloseableHttpResponse response = null;
//2.创建get请求,相当于在浏览器地址栏输入 网址https://cn.made-in-china.com/showroom/gonyedmm
HttpGet request = new HttpGet(“https://cn.made-in-china.com/script/pop/jquery.micpop.js”);
try {
//3.执行get请求,相当于在输入地址栏后敲回车键
response = httpClient.execute(request);

        //4.判断响应状态为200,进行处理
        if(response.getStatusLine().getStatusCode() == HttpStatus.SC_OK) {
            //5.获取响应内容
            HttpEntity httpEntity = response.getEntity();
            String html = EntityUtils.toString(httpEntity, "utf-8");
            System.out.println(html);
        } else {
            //如果返回状态不是200,比如404(页面不存在)等,根据情况做处理,这里略
            System.out.println("返回状态不是200");
            System.out.println(EntityUtils.toString(response.getEntity(), "utf-8"));
        }
    } catch (ClientProtocolException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    } finally {
        //6.关闭
        HttpClientUtils.closeQuietly(response);
        HttpClientUtils.closeQuietly(httpClient);
    }
}

}

发布了104 篇原创文章 · 获赞 13 · 访问量 8万+

猜你喜欢

转载自blog.csdn.net/Liutt55/article/details/102538785
今日推荐