• 検索結果がありません。

情景内文字の検出と認識

N/A
N/A
Protected

Academic year: 2021

シェア "情景内文字の検出と認識"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

九州大学学術情報リポジトリ

Kyushu University Institutional Repository

情景内文字の検出と認識

黄, 栄

http://hdl.handle.net/2324/1398391

出版情報:Kyushu University, 2013, 博士(学術), 課程博士 バージョン:

権利関係:Public access to the fulltext file is restricted for unavoidable reason (2)

(2)

(別紙様式2)

氏 名 :黄 栄

論文題名 :

Character Localization and Recognition in Natural Scenes

情景内文字の検出と認識

区 分 :甲

論 文 内 容 の 要 旨

This thesis focuses on developing methods geared towards the applications of camera-based Optical Character Recognition (OCR), specifically, addressing problems of scene character localization and recognition. The realization of a high-performance camera-based OCR, which is the technology to read the texts captured by camera, will not only extend a new area of OCR but can also develop applications related to vision.

We survey the existing scene text localization and recognize works according to different features or methodologies in Chapter 2. Specifically, scene text localization methods are classified into seven categories: color/intensity-based, edge-based, stroke-based, keypoints-based, texture-based, other clues-based and hybrid. On the other hand, scene text recognition methods contain two categories:

OCR-based and machine learning-based. Moreover, benchmark datasets and evaluation criteria are also introduced.

In Chapter 3, we propose a cooperative multiple-hypothesis framework which consists of an image operator set module, an OCR module and an integration module. Multiple image operators activated by multiple parameters probe suspected character regions. The OCR module is then applied to each suspected region and returns multiple candidates with weight values for future integration. Without the aid of the heuristic rules which impose constraints on segmentation area, aspect ratio, color consistency, text line orientations, etc., the integration module automatically prunes the redundant localizations/recognitions and compensates the missing localizations/recognitions. The proposed framework bridges the gap between scene character localization and recognition, in the sense that a practical OCR engine is effectively leveraged for result refinement. In addition, the proposed method achieves the localization and recognition at the character level, which enables dealing with special

(3)

layouts such as single character, text along arbitrary orientations or text along curves.

In Chapter 4, a new edge-based method, called edge-ray filter, is proposed to localize scene characters.

Edges are extracted by a combination of Canny and Edge Preserving Smoothing Filter (EPSF). To effectively improve the performance of filtering out false alarms, we develop a new Edge Quasi-Connectivity Analysis (EQCA) to unify complex edges and broken contours of characters. Label Histogram Analysis (LHA) module, which can preserve stroke ray and remove redundant ones, is designed to localize scene characters. In the edge-ray filter, only two frequently-used heuristic rules, namely aspect ratio and occupation, are exploited to wipe off distinct false alarms. In addition to have the ability to handle special layouts, the proposed method can accommodate dark-on-bright and bright-on-dark characters simultaneously, and provides accurate character segmentation masks.

Conclusion and future works are drawn in Chapter 5.

参照

関連したドキュメント

This paper is devoted to the investigation of the global asymptotic stability properties of switched systems subject to internal constant point delays, while the matrices defining

In this paper, we focus on the existence and some properties of disease-free and endemic equilibrium points of a SVEIRS model subject to an eventual constant regular vaccination

This paper develops a recursion formula for the conditional moments of the area under the absolute value of Brownian bridge given the local time at 0.. The method of power series

Then it follows immediately from a suitable version of “Hensel’s Lemma” [cf., e.g., the argument of [4], Lemma 2.1] that S may be obtained, as the notation suggests, as the m A

Our method of proof can also be used to recover the rational homotopy of L K(2) S 0 as well as the chromatic splitting conjecture at primes p > 3 [16]; we only need to use the

Yin, “Global existence and blow-up phenomena for an integrable two-component Camassa-Holm shallow water system,” Journal of Differential Equations, vol.. Yin, “Global weak

We study the classical invariant theory of the B´ ezoutiant R(A, B) of a pair of binary forms A, B.. We also describe a ‘generic reduc- tion formula’ which recovers B from R(A, B)

For X-valued vector functions the Dinculeanu integral with respect to a σ-additive scalar measure on P (see Note 1) is the same as the Bochner integral and hence the Dinculeanu