Topic Modeling with Word Embeddings

dc.contributor.authorAhmed, ITBIRENE
dc.contributor.authorCHIHANI, Brahim
dc.date.accessioned2022-05-22T14:00:33Z
dc.date.available2022-05-22T14:00:33Z
dc.date.issued2021
dc.description.abstractWith the great development in the field of digitization, the extraction of topics through information that is in the form of unmarked texts, is not an easy matter. Therefore, we need a topic modeling technique, which is based on unsupervised algorithms. In our thesis, we clarify the concept of topic modeling and the inherent approaches, such as Latent Dirichlet Allocation (LDA), Embedded Topic Model (ETM), Gaussian LDA (G-LDA), and LDA with Word2Vec (LDA2Vec). In the experimental work, we make an empirical comparison between both LDA and ETM methods on the 20 newsgroups, in terms of runtime and topic coherence. The results are in favor of the ETM methodEN_en
dc.identifier.urihttps://dspace.univ-ghardaia.edu.dz/xmlui/handle/123456789/1022
dc.publisheruniversité GhardaiaEN_en
dc.subjecttopic modeling, topic coherence, Latent Dirichlet Allocation (LDA), Embedded Topic Model (ETM), Gaussian LDA (G-LDA), LDA2Vec.EN_en
dc.titleTopic Modeling with Word EmbeddingsEN_en
dc.typeThesisEN_en

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Topic_Modeling_with_Word_Embeddings.pdf
Size:
2.86 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: