Vakilsearch LogoIs NowZolvit Logo
close icon
image
image
user-login
Patent search/

DEEP LEARNING MODEL FOR SPEECH-TO-TEXT CONVERSION

search

Patent Search in India

  • tick

    Extensive patent search conducted by a registered patent agent

  • tick

    Patent search done by experts in under 48hrs

₹999

₹399

Talk to expert

DEEP LEARNING MODEL FOR SPEECH-TO-TEXT CONVERSION

ORDINARY APPLICATION

Published

date

Filed on 10 November 2024

Abstract

The present invention provides a deep learning-based system for converting speech to text, utilizing a transformer-based neural network architecture to achieve high transcription accuracy across diverse languages, accents, and noisy environments. The system comprises an audio pre-processing module that enhances input audio quality by reducing noise and segmenting audio into feature-rich representations. A transformer model with self-attention and cross-attention layers captures phonetic, syntactic, and semantic context, ensuring continuity and precision in transcription. A post-processing module corrects grammatical errors and refines output text for readability. An adaptive learning module enables continuous improvement by updating the model based on user interactions, accommodating accent variations, domain-specific vocabulary, and new linguistic patterns. The system is optimized for real-time applications and can be deployed on resource-constrained devices, providing an efficient, versatile, and adaptable

Patent Information

Application ID202441086572
Invention FieldELECTRONICS
Date of Application10/11/2024
Publication Number46/2024

Inventors

NameAddressCountryNationality
Dr. N. PenchalaiahAssistant Professor, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
M. Harsha VardhansaiFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Maruputi BabyFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Mathakala SubramanyamFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Mattepu SathvikaFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Miriyala ThejaswiFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Mohammad Althaf BashaFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
M. Sujith Kumar ReddyFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Munagala SravanthiFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia
Muppala Anil KumarFinal Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India.IndiaIndia

Applicants

NameAddressCountryNationality
Audisankara College of Engineering & TechnologyAudisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist, Andhra Pradesh, India-524101, India.IndiaIndia

Specification

Description:The embodiments of the present invention generally relates to the field of automatic speech recognition (ASR), particularly to a deep learning-based method and system for converting spoken language into text. This invention employs advanced neural network architectures to accurately transcribe speech in real-time, designed to handle complex language processing challenges such as accent variations, background noise, and domain-specific vocabulary, thus enhancing the effectiveness of speech-to-text applications across a range of industries.
BACKGROUND OF THE INVENTION
The following description of related art is intended to provide background information pertaining to the field of the disclosure. This section may include certain aspects of the art that may be related to various features of the present disclosure. However, it should be appreciated that this section be used only to enhance the understanding of the reader with respect to the present disclosure, and not as admissions of prior art.

Converting spoken , Claims:1. A system for converting speech to text, comprising:
an audio pre-processing module configured to receive and process audio signals by reducing noise, enhancing speech quality, and converting the audio into feature-rich representations;
a transformer-based neural network configured to encode the processed audio into intermediate feature representations using multiple attention layers to capture phonetic, syntactic, and semantic context;
a post-processing module configured to refine and correct the initial text transcription by applying linguistic rules and contextual error correction; and
an adaptive learning module that continuously updates the transformer model to recognize accents, dialects, and domain-specific vocabulary based on user interactions and feedback.

2. The system of claim 1, wherein the audio pre-processing module further includes a spectral analysis unit that converts the audio input into spectrograms to enhance the feature representation for the neural network.

3. The system of claim 1,

Documents

NameDate
202441086572-COMPLETE SPECIFICATION [10-11-2024(online)].pdf10/11/2024
202441086572-DECLARATION OF INVENTORSHIP (FORM 5) [10-11-2024(online)].pdf10/11/2024
202441086572-DRAWINGS [10-11-2024(online)].pdf10/11/2024
202441086572-FORM 1 [10-11-2024(online)].pdf10/11/2024
202441086572-FORM-9 [10-11-2024(online)].pdf10/11/2024
202441086572-REQUEST FOR EARLY PUBLICATION(FORM-9) [10-11-2024(online)].pdf10/11/2024

footer-service

By continuing past this page, you agree to our Terms of Service,Cookie PolicyPrivacy Policy  and  Refund Policy  © - Uber9 Business Process Services Private Limited. All rights reserved.

Uber9 Business Process Services Private Limited, CIN - U74900TN2014PTC098414, GSTIN - 33AABCU7650C1ZM, Registered Office Address - F-97, Newry Shreya Apartments Anna Nagar East, Chennai, Tamil Nadu 600102, India.

Please note that we are a facilitating platform enabling access to reliable professionals. We are not a law firm and do not provide legal services ourselves. The information on this website is for the purpose of knowledge only and should not be relied upon as legal advice or opinion.