Preview

Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in Ocr

Powerful Essays
Open Document
Open Document
2858 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in Ocr
Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in OCR

Srivardhini Mandipati, Gottumukkala Asisha, Preethi Raj S, and Chitrakala S

Department of Computer Science and Engineering, Easwari Engineering College, Chennai, India

Abstract. Optical Character Recognition converts text in images into a form that the computer can manipulate. The need for faster OCRs stems from the abundance of such text. This paper presents a Two-Stage Rejection Algorithm for reducing the search space of an OCR. It is tacit that the reduction in search space expedites an OCR. Preprocessing operations are applied on the input and features are extracted from them. These feature vectors are clustered and the Two-Stage Rejection Algorithm is applied for character recognition. With about the same character recognition rate as other OCRs, an OCR reinforced with the Two-Stage Rejection Algorithm is considerably faster.

Keywords: Optical Character Recognition, Feature Extraction, K-means.
1 Introduction Optical character recognition has been an active area of research for many decades. The fact that OCRs have the potential to simplify data entry in the future adds value to research in this area. OCRs use various pattern matching techniques for character recognition. Most OCRs typically use classifiers like SVM or neural networks for character recognition. The training process for these classifiers is time consuming. Moreover, with an increase in the number of classes, the comparisons made increases and consequently the time taken for character recognition increases. Hence, they cannot be easily extended to recognize characters from additional languages. The proposed system uses a structural approach as opposed to statistical approach for feature extraction. The strength of the structural method over the statistical one is its representation of a pattern that is similar to the way human perceive it. The structural features help



References: [1]GWeijie Su, Xin Jin, “Hidden Markov Model with Parameter-Optimized K-means Clustering for Handwriting Recognition”, International Conference on Internet Computing and Information Services, pp:435-438, 2011 [2]Karthik Sheshadri, Pavan Kumar T Ambekar, Deeksha Padma Prasad and Dr.Ramakanth P Kumar, “An OCR system for Printed Kannada using K-means clustering”, International Conference on Industrial Technology ,pp:183-187, 2010 [3]Mu-King Tsay, Keh-Hwashyu, Pao-Chung Chang, “Feature Transformation with Generalized Learning Vector Quantization for Hand-Written Chinese Character Recognition”, IEICE Transactions on Information & System, Vol.E82-D, 1992 [4]B. Vijay Kumar, A. G. Ramakrishnan, “Radial Basis Function And Subspace Approach For Printed Kannada Text Recognition”, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp: V-321-4 vol.5, 2004 [5]Premnath Dubey, Wasin Sinthupinyo, “New Approach on Structural Feature Extraction for Character Recognition”, International Symposium on Communications and Information Technologies, pp:946-949, 2010 [6]Igor Kleiner, Daniel Keren, Llan Newman, Oren Ben-Zwi,“Applying property testing to an image partitioning problem”, IEEE Transactions On Pattern Analysis And Machine Intelligence, Vol. 33, No.2, 2011 [7]Sanghamitra Mohanty, Himadri Nandini Dasbebartta, Tarun Kumar Behera, “An Efficient Bilingual Optical Character Recognition(English-Oriya) System for Printed Documents”, Seventh International Conference on Advances in Pattern Recognition, pp: 398 – 401, 2009 [8]Oivind Due Trier, Anil K Jain, and Torfinn Taxt ,“Feature Extraction Methods For Character Recognition–A Survey ”, Pattern Recognition, Vol 29, pp 641-662, 1995 [9]Vuokko Vuori, Jorma Laaksonen , “A Comparison of Techniques for Automatic Clustering of Handwritten Characters”, 16th International Conference on Pattern Recognition, Vol 3, pp:168-171, 2002

You May Also Find These Documents Helpful

  • Powerful Essays

    Why Did You Kill Me?

    • 2033 Words
    • 9 Pages

    Layton, Julia. "How Handwriting Analysis Works." HowStuffWorks. Discovery Communications, Oct. 2008. Web. 07 Aug. 2012.…

    • 2033 Words
    • 9 Pages
    Powerful Essays
  • Good Essays

    Biometrics technology aims at utilizing major and distinctive characteristics such as behavioral or biological, for the sake of positively indentifying people. With the help of a combination of hardware and specific identifying sets of rules, a basic human attribute, automated biometric recognition mimics to distinguish and categorize other people as individual and unique. But the challenges surrounding biometrics are great as well.…

    • 1008 Words
    • 5 Pages
    Good Essays
  • Good Essays

    There is a massive disagreement in the present day concerning the biometric identification technology which is used to boost the security through travel. The research inside these technologies has been used to extend ways in how the individuals identify faces for detection and develop the similar strategy in a replicated mechanical system that will scan faces and conclude their likeness with those in a database.…

    • 990 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Spatial Analysis

    • 453 Words
    • 2 Pages

    Text/graphics separation works very smoothly in this case (Fig. 18), as do vectorization and arcs detection (Fig. 19) and loop extraction.…

    • 453 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    Gcvmdbkjvdhf

    • 3380 Words
    • 14 Pages

    OCR Advanced Subsidiary GCE in Applied ICT: H115/H315 Candidate A: Unit G040: Using ICT to communicate…

    • 3380 Words
    • 14 Pages
    Powerful Essays
  • Satisfactory Essays

    (optical character recognition), and specify the language setting for OCR. Click OK to close the…

    • 305 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    Decoding. Recognizing the pronunciation of printed words by applying the many correspondences between particular letters and phonemes (Neuman & Dickinson, 2003).…

    • 2364 Words
    • 10 Pages
    Powerful Essays
  • Good Essays

    The first automated speech recognition system the author will analyze is produced by a company called Application Technology, or AppTek. AppTek is located in McLean, Virginia, and has been in the Human Language Technology field for over 20 years. AppTek’s ASR product is called PlainSpeech, and is used for speech dictation, broadcast and telephony. This program can do anything from a simple chain of numbers to vocabularies of up to 100,000 words. PlainSpeech recognizes continuous speech, offers gender-independent speech recognition, as well as speaker dependent and speaker independent modes. PlainSpeech also offers a scalable vocabulary as well as a scalable number of recognized languages. At this time however, the author of this paper was unable to locate a price for this product on the manufacturers website. (apptek.com, 2009)…

    • 606 Words
    • 2 Pages
    Good Essays
  • Powerful Essays

    An optical scanner is a hardware input device that allows a user to take an image or text and convert it into a digital file, allowing the computer to read or display the scanned object. Features of the optical scanner include-its design, software compatibility, type of feeder, speed, resolution, etc. Some advantages of the optical scanner include :…

    • 2633 Words
    • 11 Pages
    Powerful Essays
  • Powerful Essays

    In this document we analyse the two programs that are presently in operation in the Textprint…

    • 3135 Words
    • 22 Pages
    Powerful Essays
  • Satisfactory Essays

    R.O.G Sdn Bhd aims to become the market leader in hand-held text scanning consumer electronic products. The product and its future derivatives are aimed for rich variety of market niches including students, researchers, professionals, manager etc. providing innovation through product and service enhancements; including wireless capabilities and smartphone applications.…

    • 306 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Image Segmentation

    • 683 Words
    • 3 Pages

    1. Introduction - Image pattern recognition ..............................................................................1 - Image segmentation .............................................................................. 2 2. Relative work to region-based approach on image segmentation 2.1 Feature extraction approach ........................................................................... 2 2.2 Classification approach and segmentation results .........................................4 2.2.1 Supervised classification .............................................5 2.2.2 Unsupervised classification .........................................6 3. Conclusion ...........................................................................................................12 4. References ............................................................................................................12…

    • 683 Words
    • 3 Pages
    Good Essays
  • Good Essays

    kas timur

    • 926 Words
    • 4 Pages

    Magnetic-ink character recognition (распознавание символов, нанесённых магнитными чернилами) (MICR, pronounced "mike-er") involves using a machine to read characters made of magnetized particles (намагниченные частицы). The…

    • 926 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    cs 322

    • 492 Words
    • 2 Pages

    An algorithm is a large-scale continuous study and research for the most time-convenient and resource-efficient mode of systematically doing things accurately. It predates the existence of computers. As such, algorithms arise more significantly as compared to computing technology.…

    • 492 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Question: Compare and contrast spelling (orthography) and phonetic transcriptions. Your terms should be well defined and your discussion should be well illustrated. Write full sentences in Standard English.…

    • 1393 Words
    • 6 Pages
    Good Essays