Consult an Expert
Trademark
Design Registration
Consult an Expert
Trademark
Copyright
Patent
Infringement
Design Registration
More
Consult an Expert
Consult an Expert
Trademark
Design Registration
Login
DEEP LEARNING MODEL FOR SPEECH-TO-TEXT CONVERSION
Extensive patent search conducted by a registered patent agent
Patent search done by experts in under 48hrs
₹999
₹399
Abstract
Information
Inventors
Applicants
Specification
Documents
ORDINARY APPLICATION
Published
Filed on 10 November 2024
Abstract
The present invention provides a deep learning-based system for converting speech to text, utilizing a transformer-based neural network architecture to achieve high transcription accuracy across diverse languages, accents, and noisy environments. The system comprises an audio pre-processing module that enhances input audio quality by reducing noise and segmenting audio into feature-rich representations. A transformer model with self-attention and cross-attention layers captures phonetic, syntactic, and semantic context, ensuring continuity and precision in transcription. A post-processing module corrects grammatical errors and refines output text for readability. An adaptive learning module enables continuous improvement by updating the model based on user interactions, accommodating accent variations, domain-specific vocabulary, and new linguistic patterns. The system is optimized for real-time applications and can be deployed on resource-constrained devices, providing an efficient, versatile, and adaptable
Patent Information
Application ID | 202441086572 |
Invention Field | ELECTRONICS |
Date of Application | 10/11/2024 |
Publication Number | 46/2024 |
Inventors
Name | Address | Country | Nationality |
---|---|---|---|
Dr. N. Penchalaiah | Assistant Professor, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
M. Harsha Vardhansai | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Maruputi Baby | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Mathakala Subramanyam | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Mattepu Sathvika | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Miriyala Thejaswi | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Mohammad Althaf Basha | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
M. Sujith Kumar Reddy | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Munagala Sravanthi | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Muppala Anil Kumar | Final Year B.Tech Student, Department of Computer Science & Engineering, Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist., Andhra Pradesh, India-524101, India. | India | India |
Applicants
Name | Address | Country | Nationality |
---|---|---|---|
Audisankara College of Engineering & Technology | Audisankara College of Engineering & Technology, NH-16, By-Pass Road, Gudur, Tirupati Dist, Andhra Pradesh, India-524101, India. | India | India |
Specification
Description:The embodiments of the present invention generally relates to the field of automatic speech recognition (ASR), particularly to a deep learning-based method and system for converting spoken language into text. This invention employs advanced neural network architectures to accurately transcribe speech in real-time, designed to handle complex language processing challenges such as accent variations, background noise, and domain-specific vocabulary, thus enhancing the effectiveness of speech-to-text applications across a range of industries.
BACKGROUND OF THE INVENTION
The following description of related art is intended to provide background information pertaining to the field of the disclosure. This section may include certain aspects of the art that may be related to various features of the present disclosure. However, it should be appreciated that this section be used only to enhance the understanding of the reader with respect to the present disclosure, and not as admissions of prior art.
Converting spoken , Claims:1. A system for converting speech to text, comprising:
an audio pre-processing module configured to receive and process audio signals by reducing noise, enhancing speech quality, and converting the audio into feature-rich representations;
a transformer-based neural network configured to encode the processed audio into intermediate feature representations using multiple attention layers to capture phonetic, syntactic, and semantic context;
a post-processing module configured to refine and correct the initial text transcription by applying linguistic rules and contextual error correction; and
an adaptive learning module that continuously updates the transformer model to recognize accents, dialects, and domain-specific vocabulary based on user interactions and feedback.
2. The system of claim 1, wherein the audio pre-processing module further includes a spectral analysis unit that converts the audio input into spectrograms to enhance the feature representation for the neural network.
3. The system of claim 1,
Documents
Name | Date |
---|---|
202441086572-COMPLETE SPECIFICATION [10-11-2024(online)].pdf | 10/11/2024 |
202441086572-DECLARATION OF INVENTORSHIP (FORM 5) [10-11-2024(online)].pdf | 10/11/2024 |
202441086572-DRAWINGS [10-11-2024(online)].pdf | 10/11/2024 |
202441086572-FORM 1 [10-11-2024(online)].pdf | 10/11/2024 |
202441086572-FORM-9 [10-11-2024(online)].pdf | 10/11/2024 |
202441086572-REQUEST FOR EARLY PUBLICATION(FORM-9) [10-11-2024(online)].pdf | 10/11/2024 |
Talk To Experts
Calculators
Downloads
By continuing past this page, you agree to our Terms of Service,, Cookie Policy, Privacy Policy and Refund Policy © - Uber9 Business Process Services Private Limited. All rights reserved.
Uber9 Business Process Services Private Limited, CIN - U74900TN2014PTC098414, GSTIN - 33AABCU7650C1ZM, Registered Office Address - F-97, Newry Shreya Apartments Anna Nagar East, Chennai, Tamil Nadu 600102, India.
Please note that we are a facilitating platform enabling access to reliable professionals. We are not a law firm and do not provide legal services ourselves. The information on this website is for the purpose of knowledge only and should not be relied upon as legal advice or opinion.