Python编程：pypdf2和pdfplumber获取pdf文件的页数

其他 2019-01-03 00:29:31 阅读次数: 0

版权声明：本文为博主原创文章，欢迎转载，请注明出处 https://blog.csdn.net/mouday/article/details/85269745

pypdf2

安装

pip install pypdf2

代码实例

from PyPDF2 import PdfFileReader

filename = "test.pdf"
reader = PdfFileReader(filename)

# 不解密可能会报错：PyPDF2.utils.PdfReadError: File has not been decrypted
if reader.isEncrypted:
    reader.decrypt('')

page = reader.getNumPages()
print(page)

"""
如果加密是高版本的（3, 4），可能会报错
NotImplementedError: only algorithm code 1 and 2 are supported

原因是：
代码中有版本判断
if not (encrypt['/V'] in (1, 2)):
    raise NotImplementedError("only algorithm code 1 and 2 are supported")
"""

参考：
https://github.com/mstamy2/PyPDF2/issues/51#issuecomment-437839902

pdfplumber

安装

pip install pdfplumber

代码示例

import pdfplumber

filename = "test.pdf"
f = pdfplumber.open(filename)
print(len(f.pages))

就是那么简单，没有过多的繁琐操作，暂时没有发现其他莫名问题

参考
https://github.com/jsvine/pdfplumber

猜你喜欢

转载自blog.csdn.net/mouday/article/details/85269745

Python编程：pypdf2和pdfplumber获取pdf文件的页数

Python利用PyPDF2库获取PDF文件总页码

Python：使用pypdf2合并、分割、加密pdf文件。

通过Python的PyPDF2库合并多个pdf文件

Python应用【PDF处理-pypdf2】

pdf各种处理 PDF 的实用代码：PyPDF2、PDFMiner、pdfplumber

利用PyPDF2删除PDF文件首页

Python 深入浅出 - PyPDF2 处理 PDF 文件

实用代码Python（二）：使用PyPDF2融合多个PDF文件

python常用库自动化办公类 —— PyPDF2（处理pdf文件）

python将签名自动插入到PDF文件(PyPDF2)

【Python军火库】PyPDF2：操纵PDF的利器

通过Python的PyPDF2库提取pdf中的文字

通过Python的PyPDF2库提取pdf中的图片

python之PyPDF2:操作PDF文档示例详解

PyPDF2 合并PDF文档

python.pdf 利用python PyPDF2 实现pdf操作全集

Python之PyPDF2模块的使用

python3 集成PyPDF2

PyPDF2 pdf 文件写入提示如下错误:PyPDF2.utils.PdfReadError: Illegal character in Name Object

PyPDF2读取PDF文件内容保存到本地TXT

PyPDF2的使用

Python—遇到的问题，使用PyPDF2转化pdf时候遇到的各种问题。

利用 Python PyPDF2库轻松提取PDF文本（及其他高级操作）

PyPDF2读取文件只能得到‘\n’的问题

使用PyPDF2结合pdfminer拆分PDF，并提取关键字重命名拆分出来的文件

[转]PyPDF2详解

python 之 pip、pypdf2 安装与卸载

python常用库简单使用（ PyPDF2 ）

[python3] pypdf2 处理书签

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)