01 UIPath crawl the web data and export Excel (non-Table Forms)

Last reprint an article "UIPath crawl the web data and export Excel" article, because that is the exported data table tag, it is relatively simple. Reality pages, there are many not by table showing the label, and then how to deal with it, and today I will introduce how to export this data page. This time we selected a site is: https: //www.51job.com/.

First, the preparation phase:

1, open website

2, in the search, enter the keyword you want to find, I find the conditions here are "Java Development Engineer + Chengdu" to enter the page as follows:

3, through F12, we can see that there is not data to show a table form.

So we want to change a way to get this type of data.

4, open uipath, create a new project Get51JobData.

Second, the core steps:

1. Click on the menu "data scraping" icon.

2. Click on "Next", marquee selection map portion.

Note: because it is not the form, it can not directly choose his party. Select a row and also may be a problem.

3, then enter a new interface:

In this step, we continue to click "Next."

4, another selection of the same row, note that in addition can not select another.

At this point the case is selected as follows:

5, "crawling" screen as follows:

Here modify the information according to their needs:

6, and then click "Next" to preview the data interface.

In this interface, if in doubt, you can click "Back" to modify it.

In this interface, the maximum number of results, can modify.

7, if only to extract these data, you can click "Finish" on it, here we have the relevant data are extracted, click the "present relevant data," continued extraction, the extract is the company name .

8, select the name of the company

When selected, there will be selected with a "job name" like interface,

We follow the above operation, click Next;

 

9, the second selection

10, the extraction wizard appears and choose a similar interface,

11, to amend the relevant information in the interface

12. Click "Next" to preview the interface again:

We can see, the company name has been extracted, because did not choose to extract URL, so there is no data related to the URL.

 

13, follow the steps above, continue to select "place of work", "salary", "Name" column and so on.

14, the preview data finally selected as follows:

Here, I extract 100. Then "OK."

15, pop-up prompts to obtain multi-faceted data:

Click to select "Yes", the next there is so much data in the page, the extracts enough data for you.

16, after the interface performs the following:

17, for gripping and several other related parameters, can be set to the right as shown below.

Third, other operations

1, write excel file, drag a "write" and controls

2, the relevant settings:

3, when setting excel write, write header set.

4, just finished running the project

 

5, the last generation of results:

 

Conclusion:

If everyone in the course of the operation, any problems, you can give me a message in the comments area. I can also focus on the micro-channel public number: IT human growth

Published 36 original articles · won praise 0 · views 90000 +

Guess you like

Origin blog.csdn.net/binjuny/article/details/104891121