【转】SEE: Towards Semi-Supervised End-to-End Scene Text Recognition - 代码天地

【转】SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

其他 2019-03-01 23:14:36 阅读次数: 0

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

@jxlxt 推荐

#Object Recognition

本文设计了一个端到端的半监督文本检测和识别模型，通过在 SVNH 和 FSNS 数据集上验证了该模型的 work。文章的模型不需要提供文本检测的 bounding box 只需要提供正确的 label，然后通过预测误差反向传播修正文本检测结果。

端到端的模型 loss 设计困难，通常识别只专注于文本检测或文本识别，但本文使用了 STN 来进行文本检测结合 ResNet 进行识别。先通过 STN 检测文本位置，输出特定区域的文本图片后再通过 CNN 识别文本。

▲ 论文模型：点击查看大图

640

论文链接

https://www.paperweekly.site/papers/2113

源码链接

https://github.com/Bartzi/see
---------------------
作者：Paper_weekly
来源：CSDN
原文：https://blog.csdn.net/c9yv2cf9i06k2a9e/article/details/81255972
版权声明：本文为博主原创文章，转载请附上博文链接！

猜你喜欢

转载自blog.csdn.net/Maisie_Nan/article/details/86506446

【转】SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

深度学习论文翻译解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Towards End-to-end

车牌识别--Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline

车牌识别--Towards End-to-End License Plate Detection and Recognition: 提供强大的数据集

ICCV 2017 《Towards End-to-End Text Spotting with Convolutional Recurrent Neural Network》论文笔记

【个人开源】论文复现SRN：Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

SRN: Towards Accurate Scene Text Recognition with Semantic Reasoning Networks ---论文阅读笔记

论文精读:End-to-End Semi-Supervised Object Detection with Soft Teacher

Tacotron: Towards End-to-End Speech Synthesis

E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text(论文解读)

《E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text》论文笔记

2021 目标检测知识蒸馏 SOTA：End-to-End Semi-Supervised Object Detection with Soft Teacher

EraseNet:End-to-End Text Removal in the wild

CBHG 模块来自TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS

Towards End-to-End Lane Detection: an Instance Segmentation Approach

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

【USE】《An End-to-End System for Automatic Urinary Particle Recognition with CNN》

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

（ICASSP 19）Streaming End-to-end Speech Recognition for Mobile Devices

Semi-supervised learning for Text Classification by Layer Partitioning

A semi-supervised graph-based approach for text classification and inference

ICCV2021-Soft Teacher-End-to-End Semi-Supervised Object Detection with Soft Teacher

Paddle的场景文字识别 (STR, Scene Text Recognition)

【文字识别】Scene Text Recognition With Finer Grid Rectification论文阅读

CVPR 2020-Scene Text Detection&Recognition

ReadLikeHumans: Autonomous,Bidirectional and Iterative Language Modeling for Scene Text Recognition

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

论文笔记|Towards End-to-End Lane Detection: an Instance Segmentation

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)