Understanding Document Thematic Structure: A Systematic Review of Topic Modeling Algorithms

Authors

  • Victor Odumuyiwa University of Lagos
  • Seun Bamidele Osuntoki
  • Oladipupo Sennaike Department of Computer Sciences, University of Lagos, Nigeria

DOI:

https://doi.org/10.31341/jios.46.2.3

Keywords:

Topic models, Information Retrieval, Text Mining, NMF, PLSA, LSA, LDA, Document Structure

Abstract

The increasing usage of the Internet and other digital platforms has brought in the era of big data with the attending increase in the quantity of unstructured data that is available for processing and storage. However, the full benefits of analyzing this large quantity of unstructured data will not be realized without proper techniques and algorithms. Topic modeling algorithms have seen a major success in this area. Different topic modeling algorithms exist and each one either employs probabilistic or linear algebra approaches. Recent reviews on topic modeling algorithms dwell majorly on probabilistic methods without giving proper treatment to the linear-algebra-based algorithms. This review explores linear-algebra-based topic models as well as probability-based topic models. An overview of how models generated by each of these algorithms represent document thematic structure is also presented.

Downloads

Published

2022-12-22

How to Cite

[1]
V. Odumuyiwa, S. B. Osuntoki, and O. Sennaike, “Understanding Document Thematic Structure: A Systematic Review of Topic Modeling Algorithms”, J. inf. organ. sci. (Online), vol. 46, no. 2, Dec. 2022.

Issue

Section

Articles