dc.contributor.advisor | Hoare, Terri | en |
dc.contributor.author | Pathela, Chirag Kumar | |
dc.date.accessioned | 2021-04-28T18:43:09Z | |
dc.date.available | 2021-04-28T18:43:09Z | |
dc.date.issued | 2020 | |
dc.identifier.citation | Pathela, C.K. (2020). Exploring the space of topic modelling and topic coherence on short and long text corpora. Masters Thesis, Dublin Business School. | en |
dc.identifier.uri | https://esource.dbs.ie/handle/10788/4232 | |
dc.description.abstract | Topic Modelling, a discipline of Natural Language Processing, is widely prevalent and its application on social network communications has become essential in identifying key themes impacting society. In this dissertation titled- “Exploring the space of Topic Modelling and Topic Coherence on short and long text corpora” a comparative study of topic modelling algorithms is presented including LDA (Latent Dirichlet Allocation), LSA(Latent Semantic Analysis), NMF(Non Negative Matrix Factorization) ,BTM(Biterm Topic Modelling). Algorithms are applied on Zomato and Ovarian Cancer Tweets extracted from Twitter and on Amazon Food Reviews. Six robust performance metrics are used for comparative purposes using the online Palmetto tool. The results obtained reveal that all models have strong potential for topic modelling. BTM performed the best in detecting more coherent topics on short texts measured across the six coherence metrics, whereas LDA outperformed on long texts. NMF outperforms other algorithms in terms of execution time. | en |
dc.language.iso | en | en |
dc.publisher | Dublin Business School | en |
dc.rights | Items in eSource are protected by copyright. Previously published items are made available in accordance with the copyright policy of the publisher/copyright holder. | en |
dc.rights.uri | http://esource.dbs.ie/copyright | en |
dc.title | Exploring the space of topic modelling and topic coherence on short and long text corpora | en |
dc.type | Thesis | en |
dc.rights.holder | Copyright: The publisher | en |
dc.type.degreename | MSc in Data Analytics | en |
dc.type.degreelevel | MSc | en |