Zaazaa, A., Rashwan, M., Emam, O. (2022). Community Question Answering Ranking: Methodology Survey. The Egyptian Journal of Language Engineering, 9(2), 1-22. doi: 10.21608/ejle.2022.138720.1031
Ahmed Zaazaa; Mohsen Rashwan; Ossama Emam. "Community Question Answering Ranking: Methodology Survey". The Egyptian Journal of Language Engineering, 9, 2, 2022, 1-22. doi: 10.21608/ejle.2022.138720.1031
Zaazaa, A., Rashwan, M., Emam, O. (2022). 'Community Question Answering Ranking: Methodology Survey', The Egyptian Journal of Language Engineering, 9(2), pp. 1-22. doi: 10.21608/ejle.2022.138720.1031
Zaazaa, A., Rashwan, M., Emam, O. Community Question Answering Ranking: Methodology Survey. The Egyptian Journal of Language Engineering, 2022; 9(2): 1-22. doi: 10.21608/ejle.2022.138720.1031
Community Question Answering Ranking: Methodology Survey
2Electronics and Communication Department, Faculty of Engineering, Cairo University, Giza, Egypt
3IBM
Abstract
This paper surveys the evolution of word embeddings along with the methodologies used in Community Question Answering (cQA), and how these methodologies use word embeddings to achieve higher performance metrics. The paper first discusses vector modelling and how it affected Natural Language Processing (NLP) as a whole, then it details some of the approaches used like the one-hot-encoding, word2vec and others. The paper then discusses contextualized embeddings and how they improve on the previous techniques. The paper then sheds some light on language modelling along with new attention-based architectures (Transformers), discussing briefly how they work and how they affected not only cQA but NLP in general. Then the paper discusses in brief the shift in the field from model-based AI where most of the focus is on producing a model with high performance metrics to Data Centric AI where the focus is on trying to have a systemic way of labelling the data to ease the generation of a high-performance model.