SPEECH RECOGNITION

SPEECH RECOGNITION

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). It incorporates knowledge and research in the computer science, linguistics and computer engineering fields. The reverse process is speech synthesis.

Some speech recognition systems require "solly" (also called "enrollment") where an individual speaker reads text or isolated vocabulary into the system. The system analyzes the person's specific voice and uses it to fine-tune the recognition of that person's speech, resulting in increased accuracy. Systems that do not use training are called "speaker-independent" systems. Systems that use training are called "speaker dependent".

Speech recognition applications include voice user interfaces such as voice dialing (e.g. "call home"), call routing (e.g. "I would like to make a collect call"), domotic appliance control, search key words (e.g. find a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g. a radiology report), determining speaker characteristics,speech-to-text processing (e.g., word processors or emails), and aircraft (usually termed direct voice input).

The term voice recognition monitor speaker identification refers to identifying the speaker, rather than what they are saying. Recognizing the speaker can simplify the task of translating speech in systems that have been trained on a specific person's voice or it can be used to authenticate or verify the identity of a speaker as part of a security process.

From the technology perspective, speech recognition has a long history with several waves of major innovations. Most recently, the field has benefited from advances in deep learning and big data. The advances are evidenced not only by the surge of academic papers published in the field, but more importantly by the worldwide industry adoption of a variety of deep learning methods in designing and deploying speech recognition systems.

To view or add a comment, sign in

More articles by Mohammed Shafi

  • CLOTH SPINNING MILLS

    Cloth spinning mills are industrial facilities that convert raw fibers, such as cotton, wool, or synthetic materials…

    1 Comment
  • THE ROLE OF ELECTRONICS IN BIOMEDICAL ENGINEERING

    The integration of electronics into biomedical engineering has transformed the healthcare industry, enabling…

  • QUANTUM COMPUTING

    Quantum computing is a transformative technology that leverages principles from quantum mechanics—such as superposition…

    1 Comment
  • THE FUTURE OF OLED TECHNOLOGY

    OLED (Organic Light Emitting Diode) technology is one of the most significant advancements in display technology…

  • THE EVOLUTION OF SATELLITE COMMUNICATION

    The evolution of satellite communication spans over six decades of innovation and advancement, transforming the way we…

  • REMOTE PATIENT MONITORING FOR CHRONIC HEART FAILURE

    Chronic heart failure (CHF) is a long-term condition where the heart struggles to pump blood effectively…

  • Tableau

    Tableau is a powerful data visualization tool used by businesses and analysts to explore and present data in an…

  • Introduction to Robotics

    Robotics is an incredibly diverse field that encompasses various disciplines such as mechanical engineering, electrical…

  • Machine Learning

    Machine learning (ML) is a type of artificial intelligence (AI) focused on building computer systems that learn from…

  • Computer Vision

    Computer vision is a field of computer science that focuses on enabling computers to identify and understand objects…

Others also viewed

Explore content categories