Corpus Construction for Arabic Question Answering Subjectivity Classification

dc.contributor.authorSOUFFI, Soumia
dc.contributor.authorBOUAMEUR, Mounia
dc.date.accessioned2023-09-20T09:17:03Z
dc.date.available2023-09-20T09:17:03Z
dc.date.issued2023
dc.description.abstractSubjectivity and sentiment analysis, have gained significant attention in the field of Natural Language Processing (NLP) due to their ability to extract and classify subjective information expressed in textual data. Although, extensive research has been conducted on major languages such as English, Arabic with its dialectal variations lacks sufficient resources and research in this domain. This study aims to overcome the scarcity of resources in Arabic subjectivity analysis by constructing an extensive Arabic Question-Answering (QA) corpus specifically designed for subjectivity analysis. The corpus construction involves the following steps: data collection through web scraping, and data cleaning to ensure quality, followed by the annotation process by affecting subjectivity labels using two models that we developed utilizing the fine-tuning technique with two pre-trained models, XLM-RoBERTa and AraBERT. The availability of this corpus stimulates further research, drives advancements in Arabic NLP, and contributes to various applications in sentiment analysis and opinion mining.EN_en
dc.identifier.urihttps://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/6419
dc.publisheruniversity ghardaiaEN_en
dc.subjectSubjectivity analysis, sentiment analysis, fine-tuning, AraBERT, XLMRoBERTa.EN_en
dc.titleCorpus Construction for Arabic Question Answering Subjectivity ClassificationEN_en
dc.typeThesisEN_en

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Master_Corpus_construction_for_Arabic_Question_Answering_Subjectivity__Ghardaia_2023_ (5).pdf
Size:
3.37 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: