【New function】Document Text Extraction Function—Support multiple types of document text extraction

Document Text Extraction Function Introduction

Document text extraction is a built-in application of Jijian Cloud, which is used to extract text in a certain type of document. The size of the extracted text is supported within 10M, and it supports extraction of various document types, including PDF, DOCX, TXT, PPTX, HTML , XLSX, etc., which are convenient for users to use better.

Scenario Example of Document Text Extraction Function

In order to recruit more suitable talents, companies usually recruit through multiple channels such as BOSS direct recruitment, 51Job, and Zhilian. After receiving the resume sent by the applicant, the human resources department needs to summarize and analyze the resume, provide recruitment suggestions, and enter the information into a form system, such as Mingdao Cloud, so that relevant recruiting departments can view and use it.

Now build an automated process through Jijianyun, use Jijianyun email to trigger the acquisition of files in the email, introduce ChatGPT for summary and evaluation, and then synchronize to Mingdaoyun. The specific flow chart is as follows:

However, in this process, the steps of converting files to text still need to be completed manually. In the face of a large number of resumes, manual synchronization of information is undoubtedly time-consuming, laborious and error-prone. Once there is an inconsistency in the extraction of certain information, it needs to be checked again. , increase a large amount of workload, also can cause unnecessary trouble at the same time.

Based on this situation, Jijian Cloud has developed a document text extraction function, which can complete the text extraction of a large number of documents in a short time, improve work efficiency, reduce error rates, and save time and energy. The specific process is as follows:

Document Text Extraction Function Operation Steps

  • Step 1: Mail trigger - when mail is received

  • Step 2: Document Text Extraction - Extract the text in the document

  • Step 3: ChatGPT: Dialog ChatGPT (native ChatGPT 3.5 Turbo model)

  • Step 4: Mingdao cloud application - add data to the specified worksheet

The operation of step 2-document text extraction is as follows:

(1) Select Execution Application - Execute Action: Document Text Extraction - Extract text in the document

(2) Field matching

  • Document type: PDF, DOCX, TXT, PPTX, HTML, XLSX, etc. can be selected, example: PDF

  • File URL: Insert variable (A file URL usually contains file name and path information, ending with the file type's extension. Example: https://test.com/1.pdf. )

(3) Click to send data

(4) You can get a piece of sample data

​More scene examples of document text extraction function

● Email system + document text extraction + OA system: Whenever an enterprise personnel receives an email containing an attached document, the text information required in the document can be automatically extracted through the document text extraction application, and through OA systems such as WeChat/DingTalk Send message notification.

● Form system + document text extraction + form system: Enterprise form systems store a large number of resumes, often including PDF, DOCX and other formats, and can automatically extract candidate text information through document text extraction applications, and return the information to the form system for summary , which is convenient for personnel viewing.

Guess you like

Origin blog.csdn.net/weixin_49220773/article/details/129957757