用js做爬虫帮朋友爬取图片。 - 代码天地

用js做爬虫帮朋友爬取图片。

编程语言 2023-04-07 13:31:30 阅读次数: 0

这个教程通俗易懂。

根据满哥学爬虫

需要下载axios yarn add axios -s
需要下载 cheerio yarn add cheerio -s
需要下载 express
更改 axios.get() 里面的链接就可以。
本文代码可以直接复制运行。

整体的逻辑基于这个页面

先用apifox 测试了一下接口，拿到了整体也页面的数据。
然后用cheerio去看也面的分页情况 .pagination 找到下面的a标签。
迭代获取a标签的内容然后存储起来判断有没有下一页递归调用函数。。
up主讲的详细。


const axios = require("axios");

const cheerio = require("cheerio");
const fs = require("fs");
const path = require("path")
// console.log(axios);
const urls = [];
const baseUrl = "https://www.jpmn5.com"
const nextText = "下一页"
let index = 0;
const getCosplay = async () => {
    
    
    console.log(index);
    const body = await axios.get(`https://www.jpmn5.com/Cosplay/Cosplay18126${
      
      index ? "_" + index : ""}.html`).then(async res => res.data);
    const $ = cheerio.load(body)

    const page = $(".pagination").eq(0).find("a");

    const pageArr = page.map(function () {
    
    
        return $(this).text()
    }).toArray()
    if (pageArr.includes(nextText)) {
    
    
        $(".article-content p img").each(function () {
    
    
            urls.push(baseUrl + $(this).attr("src"))
        })
        index++;
        await getCosplay()
    }
    // console.log(urls);
    writeFile(urls)
}

const writeFile = function (urls) {
    
    
    urls.forEach(async url => {
    
    
        console.log(url);
        const buffer = await axios.get(url, {
    
     responseType: "arraybuffer" }).then(res => res.data);
        const ws = fs.createWriteStream(path.join(__dirname, '../cos' + new Date().getTime() + ".jpg"))
        ws.write(buffer)
    });
}
getCosplay()
// console.log();

猜你喜欢

转载自blog.csdn.net/qq_43198727/article/details/127449371

用js做爬虫帮朋友爬取图片。

【快速上手】用Node.js做简单的图片爬取

用爬虫爬取某妹子图片网站图片

用kettle做爬虫(一)get请求爬取日期

用Python 爬虫爬取贴吧图片

python用爬虫爬取一张图片

【Python爬虫】之爬取页面内容、图片以及用selenium爬取

python爬虫－爬取图片

python爬虫爬取图片

爬虫--爬取图片（1）

Python爬虫：爬取图片

python爬虫 - 爬取图片

【爬虫】爬取网页图片

爬虫爬取图片练习

【python爬虫】—图片爬取

用Node.js爬取图片的踩坑日志

用HttpClient和用HttpURLConnection做爬虫发现爬取的代码少了的问题

用requests爬取图片

用Python爬取图片

用Scrapy帮妹子爬取王者皮肤海报~

帮朋友写一个爬取地区信息的脚本

网络爬虫之爬取图片

Python编程（一）--爬虫爬取图片

Python爬虫——爬取网站的图片

Python爬虫爬取相关图片

简易爬虫--360图片爬取

python爬虫的图片信息爬取

python网络爬虫，爬取图片信息

Python爬虫之——爬取妹子图片

node：爬虫爬取网页图片

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)