Social-Child-Case Document Clustering based on Topic Modeling using Latent Dirichlet Allocation

Children are the future of the nation. All treatment and learning they get would affect their future. Nowadays, there are various kinds of social problems related to children.  To ensure the right solution to their problem, social workers usually refer to the social-child-case (SCC) documents to fin...

Full description

Bibliographic Details
Main Authors: Nur Annisa Tresnasari, Teguh Bharata Adji, Adhistya Erna Permanasari
Format: Article
Language:English
Published: Universitas Gadjah Mada 2020-04-01
Series:IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
Subjects:
Online Access:https://jurnal.ugm.ac.id/ijccs/article/view/54507
Description
Summary:Children are the future of the nation. All treatment and learning they get would affect their future. Nowadays, there are various kinds of social problems related to children.  To ensure the right solution to their problem, social workers usually refer to the social-child-case (SCC) documents to find similar cases in the past and adapting the solution of the cases. Nevertheless, to read a bunch of documents to find similar cases is a tedious task and needs much time. Hence, this work aims to categorize those documents into several groups according to the case type. We use topic modeling with Latent Dirichlet Allocation (LDA) approach to extract topics from the documents and classify them based on their similarities. The Coherence Score and Perplexity graph are used in determining the best model. The result obtains a model with 5 topics that match the targeted case types. The result supports the process of reusing knowledge about SCC handling that ease the finding of documents with similar cases
ISSN:1978-1520
2460-7258