ScrapeKit and Swift writing programs - Code World

ScrapeKit and Swift writing programs

Enterprise 2023-10-27 19:05:59 views: null

The following is a crawler program written using ScrapeKit and Swift for crawling images. At the same time, we used the proxy code to obtain the proxy.

import ScrapeKit

class PeopleImageCrawler: NSObject, ScrapeKit.Crawler {
    let url: URL
    let proxyUrl: URL

    init(url: URL, proxyUrl: URL) {
        self.url = url
        self.proxyUrl = proxyUrl
    }

    func crawl() -> [String: Any] {
        var images = [String]()
        
        let html = try? String(contentsOf: url, encoding: .utf8)
        let doc = try? HTML(html: html, encoding: .utf8)
        
        if let imgElems = doc?.css("img") {
            for imgElem in imgElems {
                if let imgUrl = imgElem.attr("data-src") {
                    images.append(imgUrl)
                }
            }
        }
        
        return ["images": images]
    }
}

let targetUrl = URL(string: "https://www.people.com.cn")!
let proxyUrl = URL(string: "https://www.duoip.cn/get_proxy")!

let crawler = PeopleImageCrawler(url: targetUrl, proxyUrl: proxyUrl)
let result = crawler.crawl()

print(result)

This program first imports the ScrapeKit library, and then defines a class called PeopleImageCrawler, which inherits from ScrapeKit.Crawler. We provide an initialization method for passing in the target URL and proxy URL. In the crawl method, we use the ScrapeKit library to parse the HTML document and find all <img> tags. If found, we add the image's data-src attribute value to the images array. Finally, we return the images array as a key-value pair of the dictionary.

In the main function, we create a targetUrl and a proxyUrl, and then instantiate an object of the PeopleImageCrawler class. Next, we call the crawler.crawl() method to start crawling and print the results.

Guess you like

Origin blog.csdn.net/weixin_73725158/article/details/133981306

ScrapeKit and Swift writing programs

Run Swift programs on the terminal

node (writing structured programs)

Start writing a Swift macro

Hang on writing small programs of mpvue

Writing basic programs: Hello World

Writing crawler programs in Lua language

Reading and writing User Defaults in Swift

Swift can write programs on Windows system

Программы для написания ScrapeKit и Swift.

Hadoop study notes (9): Writing MapReduce programs

PyCharm's method of writing Python programs

Writing crawler programs using Scala and Sttp libraries

Writing eBPF tc programs in Ubuntu 21.10

Writing Better Swift Code: DI (Dependency Injection)

Writing Better Swift Code: Tips and Tricks

"Writing quality code: Java programs to improve the 151 recommendations," study notes

Writing high-quality Python programs (3) Basic syntax

C language-study notes for writing large programs

[C Language] Data, Statements, Operators and Expressions for Writing C Programs (Preparation)

Common errors and solutions when writing programs in Visual Studio!

How to exit the writing of Python programs in the dos command line

Java Keyword and Modifier Analysis: Writing Efficient, Structured, and Safe Java Programs

[Assembly language] by Wang Shuang | Experiment 13: Writing and applying interrupt programs

When writing Java programs, privacy such as age cannot be checked at will.

Cracking Codes with Python—Translation (Chapter 9 Writing Test Programs to Test Programs)

Kafka transmits data to Spark Streaming by writing programs java and scala programs to realize operations

Technology blog writing resources (2020 edition) iOS Python Swift UI

The writing method of nsmutabledicionary read from file under swift

IOS Swift3.1 writing manual code written in a front view of the front view of Swift

Recommended

微软回应中国区AI团队“打包赴美”传闻

Ranking

How to use ChatGPT to write 100,000+ hot articles for public accounts (includes prompt words)

sudo echo command execution Permission denied

ADO: Using Transactions to Operate Oracle Database

[Java] [Class and Object] equals, hashCode, clone methods

Linux virtual host function

Порт управления rabbitmq открыт

on,where

jQuery WeUI

How can there be a weed pilot in Hangzhou?

tcp / ip model four

Daily

More

2024-05-15(5)

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)