• Home
  • Browse
    • Current Issue
    • By Issue
    • By Author
    • By Subject
    • Author Index
    • Keyword Index
  • Journal Info
    • About Journal
    • Aims and Scope
    • Editorial Board
    • Publication Ethics
    • Peer Review Process
  • Guide for Authors
  • Submit Manuscript
  • Contact Us
 
  • Login
  • Register
Home Articles List Article Information
  • Save Records
  • |
  • Printable Version
  • |
  • Recommend
  • |
  • How to cite Export to
    RIS EndNote BibTeX APA MLA Harvard Vancouver
  • |
  • Share Share
    CiteULike Mendeley Facebook Google LinkedIn Twitter
The Egyptian Journal of Language Engineering
arrow Articles in Press
arrow Current Issue
Journal Archive
Volume Volume 11 (2024)
Volume Volume 10 (2023)
Volume Volume 9 (2022)
Volume Volume 8 (2021)
Issue Issue 2
Issue Issue 1
Volume Volume 7 (2020)
Volume Volume 6 (2019)
Volume Volume 5 (2018)
Volume Volume 4 (2017)
Volume Volume 3 (2016)
Volume Volume 2 (2015)
Volume Volume 1 (2014)
Gbaily, M. (2021). Automatic Database Segmentation using Hybrid Spectrum -Visual Approach. The Egyptian Journal of Language Engineering, 8(2), 28-43. doi: 10.21608/ejle.2021.89867.1024
Manar Othman Gbaily. "Automatic Database Segmentation using Hybrid Spectrum -Visual Approach". The Egyptian Journal of Language Engineering, 8, 2, 2021, 28-43. doi: 10.21608/ejle.2021.89867.1024
Gbaily, M. (2021). 'Automatic Database Segmentation using Hybrid Spectrum -Visual Approach', The Egyptian Journal of Language Engineering, 8(2), pp. 28-43. doi: 10.21608/ejle.2021.89867.1024
Gbaily, M. Automatic Database Segmentation using Hybrid Spectrum -Visual Approach. The Egyptian Journal of Language Engineering, 2021; 8(2): 28-43. doi: 10.21608/ejle.2021.89867.1024

Automatic Database Segmentation using Hybrid Spectrum -Visual Approach

Article 3, Volume 8, Issue 2, September 2021, Page 28-43  XML PDF (1.51 MB)
Document Type: Original Article
DOI: 10.21608/ejle.2021.89867.1024
View on SCiNiTO View on SCiNiTO
Author
Manar Othman Gbaily email
Electrical Engineering Department, faculty of engineering, fayoum university,Egypt
Abstract
Nowadays automated segmentation of speech signals has been attracted many of researchers all-over the world, Many speech processing systems require segmentation of speech waveform into principal acoustic units. In this research, TIMIT DataBase (DB) is utilized to carry on this process and justify its operation or results. Thus, this paper presents a novel method of segmentation of speech phonemes, where the proposed strategy helps in the selection of appropriate feature extraction technique for speech segmentation. There are three main techniques of feature extraction used in our research; the first technique is the Mel Frequency Cepstral Coefficient (MFCC), the second technique is known by Best Tree Encoding (BTE), while the third is Image Normalized Encoder (INE), which is a hybrid technique between the Best Tree Image (BTI), and the Convolution Neural Network (CNN) ResNet-50. Then, data are trained using a hybrid model that consists of Hidden Markov Model (HMM), and Gaussian Mixture Model (GMM) to improve the performance of automatic speech recognition. The proposed model is tested and verified against the most widely used feature Mel Frequency Cepstral Coefficient (MFCC) plus delta and delta-delta coefficients (39 parameters) to evaluate its performance. This approach has the potential to be used in applications such as automatic speech recognition and automatic language identification. The experimental results show that BTE technique achieved the highest success rate (𝜂) (92.64%) than using the (INE) technique. However, the INE technique gives confusion success rate for Tr and NTr of values 97.1% and 99.1%, respectively.
Keywords
ASR; MFCC; BTE; CNN; HMM
Statistics
Article View: 232
PDF Download: 495
Home | Glossary | News | Aims and Scope | Sitemap
Top Top

Journal Management System. Designed by NotionWave.