[python crawler] Batch recognition of English in pdf, automatic translation into Chinese

Whether you are going to school or going to work, sometimes it is unavoidable to read English articles, especially when writing a graduation thesis. The more troublesome thing is to translate highly professional English pdf articles into Chinese. I remember when I was in school, I copied paragraph by paragraph, or Baidu translated words I didn’t know, which was very time-consuming. This article provides a method for batch recognition of pdf in Chinese and English, and the follow-up article will automatically convert pdf from English to Chinese, so stay tuned.


  

1. Install the pdfplumber library

  
To identify the content in pdf, you need to use the pdfplumber library, so first install the pdfplumber library, the general code is as follows:

pip install pdfplumber

But the following error will be reported during the installation process:
  
picture

I searched for information on the Internet and found that I can use the domestic mirror image for installation

国内的一些pip源:
阿里云 http://mirrors.aliyun.com/pypi/simpl

Guess you like

Origin blog.csdn.net/qq_32532663/article/details/132653169