爬虫提取非结构化数据

pdf:pdfBox解析pdf文档

word:poi

rtf:rtfconverter4j

excel:jxl,poi,数据库访问jsqlparser

powerpoint:poi

图片:javax.imageio.Imageio

        二值化:

猜你喜欢

转载自www.cnblogs.com/davidwang456/p/8709351.html