Corpus Construction for Arabic Question Answering Subjectivity Classification

SOUFFI, Soumia; BOUAMEUR, Mounia

Corpus Construction for Arabic Question Answering Subjectivity Classification

Files

Master_Corpus_construction_for_Arabic_Question_Answering_Subjectivity__Ghardaia_2023_ (5).pdf (3.37 MB)

Date

2023

Authors

SOUFFI, Soumia

BOUAMEUR, Mounia

Publisher

university ghardaia

Abstract

Subjectivity and sentiment analysis, have gained significant attention in the field of Natural Language Processing (NLP) due to their ability to extract and classify subjective information expressed in textual data. Although, extensive research has been conducted on major languages such as English, Arabic with its dialectal variations lacks sufficient resources and research in this domain. This study aims to overcome the scarcity of resources in Arabic subjectivity analysis by constructing an extensive Arabic Question-Answering (QA) corpus specifically designed for subjectivity analysis. The corpus construction involves the following steps: data collection through web scraping, and data cleaning to ensure quality, followed by the annotation process by affecting subjectivity labels using two models that we developed utilizing the fine-tuning technique with two pre-trained models, XLM-RoBERTa and AraBERT. The availability of this corpus stimulates further research, drives advancements in Arabic NLP, and contributes to various applications in sentiment analysis and opinion mining.

Keywords

Subjectivity analysis, sentiment analysis, fine-tuning, AraBERT, XLMRoBERTa.

URI

https://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/6419

Collections

Mémoires de Master

Full item page

Corpus Construction for Arabic Question Answering Subjectivity Classification

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By