My Account Log in

1 option

Machine Learning and Applications; Proceedings: International Conference on Machine Learning and Applications (6th: 2007: Cincinnati, Ohio)

IEEE Xplore (IEEE/IET Electronic Library - IEL) Available online

View online
Format:
Book
Author/Creator:
Wani, M. Arif, author.
Language:
English
Subjects (All):
Machine learning--Congresses.
Machine learning.
Physical Description:
1 online resource
Place of Publication:
[Place of publication not identified] IEEE Computer Society Press 2007
Language Note:
English
Summary:
An optical character recognition (OCR) system with a high recognition rate is challenging to develop. One of the major contributors to OCR errors is smeared characters. Several factors lead to the smearing of characters such as bad scanning quality and a poor binarization technique. Typical approaches to character segmentation falls into three major categories: image-based, recognition-based, and holistic-based. Among these approaches, the segmentation path can be linear or non-linear. Our paper proposes a non-linear approach to segment characters on grayscale document images. Our method first determines whether characters are smeared together using general character features. The correct segmentation path is found using a shortest path approach. We achieved a segmentation accuracy of 95% over a set of about 2,000 smeared characters.
Notes:
Bibliographic Level Mode of Issuance: Monograph
ISBN:
9781509089468
1509089462

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account