• Home
  • Browse
    • Current Issue
    • By Issue
    • By Author
    • By Subject
    • Author Index
    • Keyword Index
  • Journal Info
    • About Journal
    • Aims and Scope
    • Editorial Board
    • Publication Ethics
    • Peer Review Process
  • Guide for Authors
  • Submit Manuscript
  • Contact Us
 
  • Login
  • Register
Home Articles List Article Information
  • Save Records
  • |
  • Printable Version
  • |
  • Recommend
  • |
  • How to cite Export to
    RIS EndNote BibTeX APA MLA Harvard Vancouver
  • |
  • Share Share
    CiteULike Mendeley Facebook Google LinkedIn Twitter
The Egyptian Journal of Language Engineering
arrow Articles in Press
arrow Current Issue
Journal Archive
Volume Volume 11 (2024)
Issue Issue 2
Issue Issue 1
Volume Volume 10 (2023)
Volume Volume 9 (2022)
Volume Volume 8 (2021)
Volume Volume 7 (2020)
Volume Volume 6 (2019)
Volume Volume 5 (2018)
Volume Volume 4 (2017)
Volume Volume 3 (2016)
Volume Volume 2 (2015)
Volume Volume 1 (2014)
Kotait, R. (2024). Richness Lost in Machine Translationese:. The Egyptian Journal of Language Engineering, 11(1), 66-85. doi: 10.21608/ejle.2024.267336.1064
Radwa Kotait. "Richness Lost in Machine Translationese:". The Egyptian Journal of Language Engineering, 11, 1, 2024, 66-85. doi: 10.21608/ejle.2024.267336.1064
Kotait, R. (2024). 'Richness Lost in Machine Translationese:', The Egyptian Journal of Language Engineering, 11(1), pp. 66-85. doi: 10.21608/ejle.2024.267336.1064
Kotait, R. Richness Lost in Machine Translationese:. The Egyptian Journal of Language Engineering, 2024; 11(1): 66-85. doi: 10.21608/ejle.2024.267336.1064

Richness Lost in Machine Translationese:

Article 5, Volume 11, Issue 1, April 2024, Page 66-85  XML PDF (1.86 MB)
Document Type: Original Article
DOI: 10.21608/ejle.2024.267336.1064
View on SCiNiTO View on SCiNiTO
Author
Radwa Kotait email orcid
English Department, Faculty of Al-Alsun, Ain Shams University
Abstract
Neural Machine Translation (NMT) might have been pronounced as faster and better than human translation. However, NMT inherently overgeneralizes the more frequently appearing patterns detected in their training data at the expense of the less frequently appearing ones in a phenomenon dubbed “machine translationese”. This machine translationese has been noticed to reflect some controversial asymmetries. One usually overlooked facet of this machine bias is the loss of "lexical richness". The generated translations have only recently been noticed to be disproportionately deformed and impoverished, negatively impacted with the NMT’s tendency to overgeneralize. Lexical richness, notwithstanding its worth, has not received the same attention that lexical accuracy and error-measuring have received, and more important, it has not received any attention at all in under-researched language pairs, such as Arabic–English. This study aims to shed light on lexical richness in the output of Arabic-into-English NMT as opposed to human translation (HT), answering the question: Does HT exhibit more lexical richness than NMT does? The study adopts the most agreed-upon definition of lexical richness as a superordinate term that includes “lexical diversity”, “lexical density”, and “lexical sophistication”; all three are statistical metrics that gauge the lexical richness of a text. The study analyses the outputs of two NMTs, Google Translate and Microsoft Translator, in terms of lexical richness, using both quantitative and qualitative methods, and then compares the results to those of the HT output. The corpus of the study is comprised of a news subcorpus and a literary subcorpus.
Keywords
lexical density; lexical diversity; lexical richness; lexical sophistication; neural machine translation
Statistics
Article View: 233
PDF Download: 294
Home | Glossary | News | Aims and Scope | Sitemap
Top Top

Journal Management System. Designed by NotionWave.