• Home
  • Browse
    • Current Issue
    • By Issue
    • By Author
    • By Subject
    • Author Index
    • Keyword Index
  • Journal Info
    • About Journal
    • Aims and Scope
    • Editorial Board
    • Publication Ethics
    • Peer Review Process
  • Guide for Authors
  • Submit Manuscript
  • Contact Us
 
  • Login
  • Register
Home Articles List Article Information
  • Save Records
  • |
  • Printable Version
  • |
  • Recommend
  • |
  • How to cite Export to
    RIS EndNote BibTeX APA MLA Harvard Vancouver
  • |
  • Share Share
    CiteULike Mendeley Facebook Google LinkedIn Twitter
The Egyptian Journal of Language Engineering
arrow Articles in Press
arrow Current Issue
Journal Archive
Volume Volume 11 (2024)
Volume Volume 10 (2023)
Volume Volume 9 (2022)
Volume Volume 8 (2021)
Volume Volume 7 (2020)
Volume Volume 6 (2019)
Volume Volume 5 (2018)
Volume Volume 4 (2017)
Volume Volume 3 (2016)
Volume Volume 2 (2015)
Issue Issue 2
Issue Issue 1
Volume Volume 1 (2014)
Gody, A., Abul Seoud, R., Ezz El-Din, M. (2015). Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition. The Egyptian Journal of Language Engineering, 2(1), 10-24. doi: 10.21608/ejle.2015.60254
Amr Gody; Rania Abul Seoud; Mai Ezz El-Din. "Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition". The Egyptian Journal of Language Engineering, 2, 1, 2015, 10-24. doi: 10.21608/ejle.2015.60254
Gody, A., Abul Seoud, R., Ezz El-Din, M. (2015). 'Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition', The Egyptian Journal of Language Engineering, 2(1), pp. 10-24. doi: 10.21608/ejle.2015.60254
Gody, A., Abul Seoud, R., Ezz El-Din, M. Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition. The Egyptian Journal of Language Engineering, 2015; 2(1): 10-24. doi: 10.21608/ejle.2015.60254

Using Mel-Mapped Best Tree Encoding for Baseline-Context-Independent-Mono-Phone Automatic Speech Recognition

Article 2, Volume 2, Issue 1, April 2015, Page 10-24  XML PDF (1.09 MB)
Document Type: Original Article
DOI: 10.21608/ejle.2015.60254
View on SCiNiTO View on SCiNiTO
Authors
Amr Godyorcid ; Rania Abul Seoud; Mai Ezz El-Din email
Electronics and Communications Engineering Department, Faculty of Engineering, Fayoum University, Egypt
Abstract
Best-Tree Encoding (BTE) is first introduced by Amr M. Gody [1] as new features for Automatic Speech Recognition (ASR) problem. BTE is basically acting as spectrum analyzer. It relies on Wavelet packets to get projection of signal power into predefined filter banks. The feature components are encoded into digital form using certain entropy method and certain digital encoding procedure. In this research BTE is further developed by including two more key factors into the BTE process. The key factors are Mel-scale (MS) and baseband Bandwidth mapping (BM).This Research provides a baseline performance evaluation for Context-independent mono-phone recognition (Without Grammar) of English by using Vid-TIMIT database. Vid-TIMIT consists of 43 speakers (19 female and 24 male), reciting short sentences. The recording of this database was done in a noisy environment (mostly computer fan noise) and also it is not hand verified. Total of 15643 phone segments are used for testing and evaluating the newly proposed features. HMM is used as recognition engine via HTK toolkit for its popularity in ASR. Comparison to MFCC on the same database is considered to evaluate the system results. Although it gives the same recognition efficiency as MFCC on the same testing database, the proposed model saves almost 66% of the required storage than the feature vector of MFCC.
Keywords
Automatic Speech recognition (ASR); Arabic Phone Recognition; Wavelet packets; Mel-Scale; WPBTE; MFCC; HTK and BTE
Statistics
Article View: 209
PDF Download: 469
Home | Glossary | News | Aims and Scope | Sitemap
Top Top

Journal Management System. Designed by NotionWave.