Preview

Speech Recognition Technology System: Development and Applications

Better Essays
Open Document
Open Document
1283 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Speech Recognition Technology System: Development and Applications
SPEECH RECOGNITION TECHNOLOGY Final Research Project

Health Care Information System 649 December 02, 2012

Abstract
This research paper presents an overview of speech recognition technology system development and applications. My research begins with the definition of the speech recognition system and continues on to exploring its usages, benefits, costs and the level of accuracy that one should expect from using this system. The benefits and problems users are facing from utilizing this innovative technology are the highlights of this research. An application of speech recognition technology in the healthcare industry is another section that this paper explores briefly. Please note that speech recognition technology in general recognizes speech and speaker, but this paper is concerned with speech recognition rather than speaker recognition.
Keywords: SR, ASR, STT,

Introduction
In computer science, speech recognition (SR) is the translation of spoken words to text. It is also known as automated speech recognition (ASR), computer speech recognition, speech to text or just (STT) (Kirriemuir, 2003, Para. 1). Some SR systems use training where an individual speaker reads section of text into the SR system. These SR systems are analyze the person’s specific voice and use it to fine tune the recognition of that person’s speech, resulting in more accurate transcription. Systems that do not use training are called “Speaker Independent” systems. Systems that use training are called “Speaker Dependent” systems (Kirriemuir, 2003, Para. 4). A speech recognition system consists of the following: a microphone for the person to speak into, speech recognition software, a computer to take and interpret the speech, and a good quality soundcard for input and /or output. How does it



References: University of Edinburgh. (n.d.). Mobiusing advanced technologies for care at home. Retrieved on November 30, 2012, from http://www.cs.stir.ac.uk CNN (2000, May 12). Technology is voice recognition dangerous for your health. Retrieved on November 13, 2012, from http://http://articles.cnn.com/2000-05-23/tech/voice. saving.tips.idg_1_speach-recognition-dragon-systmes Kirriemuir, John. (2003, March 30). Speech Recognition Technologies. Retrieved on November 30, 2012, from www.Jisc.ac.uk

You May Also Find These Documents Helpful

  • Satisfactory Essays

    speech generating devices work by helping an individual communicate verbally. ACC is so important because it helps individuals produce or comprehend written or spoken language.…

    • 438 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Hcs 245-Week 5

    • 1224 Words
    • 5 Pages

    Net’s Solution – A provider may find communicating with someone who is hearing impaired very difficult to deal with at times. Although, it may be difficult one should always know that there is several people who can assist them when addressing a hearing impaired person. Some people who can assist a provider could be an interpreter. An interpreter is a person who converts a thought or expression in a source language into an expression with a comparable meaning in a target language either simultaneously in "real time" or consecutively after one party has finished speaking. Interpreting is "a form of translation (in the wider sense) in which (a) the source-language text is presented only once and thus cannot be reviewed or replayed, and (b) the target-language text is produced under time pressure, with little chance for correction and revision" (Munday 2009, p.133).The interpreter's function is to convey every semantic element or to express tone and register every intention and feeling of the message that the source-language speaker is directing to target-language recipients. Depending on the situation one is facing it could require a speech, sign or oral language interpreter. Speech interpreters help people understand a specific way to correctly say or use words. Speech interpreters also can help someone who doesn’t fluently speak a specific language. Sign language,…

    • 1224 Words
    • 5 Pages
    Good Essays
  • Better Essays

    The human service profession involves various obstacles to overcome when working with a variety of clients. Obstacles are seen in all phases of human services in areas providing services, planning programs, and funding troubles. However, the elimination of some of these barriers can be done with the use of proper technology. Providing services to the aging population can be challenging, when providing services to this particular group because of the rising elderly population needing help and the decline of mental and physical aging individual. The following sections of this assignment will attempt to identify some of the technological applications that can be of use to overcome these barriers.…

    • 1181 Words
    • 5 Pages
    Better Essays
  • Good Essays

    Text to Speech Engine

    • 432 Words
    • 2 Pages

    A Text-To-Speech (TTS) synthesizer is a computer-based system that should be able to read any text aloud, whether it was directly introduced in the computer by an operator or scanned and submitted to an Optical Character Recognition (OCR) system. Let us try to be clear. There is a fundamental difference between the system we are about to discuss here and any other talking machine (as a cassette-player for example) in the sense that we are interested in the automatic production of new sentences. This definition still needs some refinements. Systems that simply concatenate isolated words or parts of sentences, denoted as Voice Response Systems, are only applicable when a limited vocabulary is required (typically a few one hundreds of words), and when the sentences to be pronounced respect a very restricted structure, as is the case for the announcement of arrivals in train stations for instance. In the context of TTS synthesis, it is impossible (and luckily useless) to record and store all the words of the language. It is thus more suitable to define Text-To-Speech as the automatic production of speech, through a grapheme-to-phoneme transcription of the sentences to utter.…

    • 432 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Such information should be available only to the physician of record and other health care and insurance personnel as necessary. Privacy is an individual’s constitutional right to be left alone, to be free from unwarranted publicity, and to conduct his or her life without its being made public.…

    • 999 Words
    • 4 Pages
    Good Essays
  • Good Essays

    SPEECH Is the vocalised sounds made by a human of their learned language, to communicate to others.…

    • 962 Words
    • 4 Pages
    Good Essays
  • Satisfactory Essays

    In 2010, in the Yerba Buena Center for the Arts in San Francisco, Apple co-founder Steve Jobs announced the iPad.…

    • 529 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Healthcare Professionals

    • 981 Words
    • 4 Pages

    Handbook of Informatics for Nurses & Healthcare Professionals Hebda 5th Edition Test Bank Handbook of Informatics for Nurses & Healthcare Professionals Hebda 5th Edition Test Bank…

    • 981 Words
    • 4 Pages
    Satisfactory Essays
  • Powerful Essays

    Peter B. Southard, Soongoo Hong, Keng Siau Department of Management College of Business Administration University of Nebraska-Lincoln Lincoln, NE 68588-0491 USA…

    • 6281 Words
    • 26 Pages
    Powerful Essays
  • Satisfactory Essays

    The PMP has a computerized scheduling feature used to schedule patient appointments. The program has a feature which will search for an available time for a doctor where the next appointment can be entered. The appointments for the day can then be printed allowing the staff to have medical records ready when the patient checks in. The PMP is used not only to record the patient information but also create and transmit electronic claims, receive electronic payments, bill patients, create financial reports, and collect on past due accounts. Medical offices just like any other business need to get paid to continue servicing their patients. The PMP assists with receiving insurance payments without a long delay because claims are generated and then can be sent directly to a clearinghouse or health plan. Claims sent to a clearinghouse will check the claims accuracy, if anything needs to be corrected the claim will be sent back to the medical office. The claim will need to be corrected before being processed. The PMP can also be used to post payments to the patient’s accounts once received by a remittance…

    • 805 Words
    • 4 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Lip Reading

    • 386 Words
    • 2 Pages

    According to Roland Goecke (1998), current speech recognition systems use powerful statistical models of the audio components of spoken language but can fail unpredictably in non-ideal acoustic conditions. This in addition to the short period of time caused us to limit our experiment on basic digits only (from 0-9), presuming ideal conditions.…

    • 386 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    This paper will evaluate several different types automatic speech recognition software packages. The author will address the following questions as it relates to ASR systems: price point of each software program; Whether or not these systems are speaker independent or speaker dependent; Whether or not they support continuous speech recognition or discreet speech recognition; Do the programs offer add-on vocabularies for purchase. In addition, the author will evaluate his level of comfort in speaking the contents of term paper as opposed to typing one. And lastly, the level of organization required to use speech recognition as opposed to typing.…

    • 606 Words
    • 2 Pages
    Good Essays
  • Better Essays

    Speech is the fundamental and common medium, hence important for us, to communicate. In general, there exists a need for voice based communications,human-machine/machine-machine interfaces, and automatic speech recognition systems to increase the reliability of these systems in noisy environments. In many cases, these systems work well in nearly noise-free conditions, but their performance deteriorates rapidly in noisy conditions. Therefore, improvement in existing pre-processing algorithms or introducing entire new class of algorithm for speech enhancement is always the…

    • 3824 Words
    • 16 Pages
    Better Essays
  • Powerful Essays

    Linear Predictive Coding

    • 6950 Words
    • 28 Pages

    References: [1] [2] V. Hardman and O. Hodson. Internet/Mbone Audio (2000) 5-7. Scott C. Douglas. Introduction to Adaptive Filters, Digital Signal Processing Handbook (1999) 7-12. Poor, H. V., Looney, C. G., Marks II, R. J., Verdú, S., Thomas, J. A., Cover, T. M. Information Theory. The Electrical Engineering Handbook (2000) 56-57. R. Sproat, and J. Olive. Text-to-Speech Synthesis, Digital Signal Processing Handbook (1999) 9-11 . Richard C. Dorf, et. al.. Broadcasting (2000) 44-47. Richard V. Cox. Speech Coding (1999) 5-8. Randy Goldberg and Lance Riek. A Practical Handbook of Speech Coders (1999) Chapter 2:1-28, Chapter 4: 1-14, Chapter 9: 1-9, Chapter 10:1-18. Mark Nelson and Jean-Loup Gailly. Speech Compression, The Data Compression Book (1995) 289-319. Khalid Sayood. Introduction to Data Compression (2000) 497-509. Richard Wolfson, Jay Pasachoff. Physics for Scientists and Engineers (1995) 376-377.…

    • 6950 Words
    • 28 Pages
    Powerful Essays
  • Satisfactory Essays

    Reading Skills Chart

    • 351 Words
    • 2 Pages

    Word Recognition (definition only here) - The ability to go from the printed form of a work to the spoken form.…

    • 351 Words
    • 2 Pages
    Satisfactory Essays

Related Topics