Classification of New Titles by Two Stage Latent Dirichlet Allocation

Guven Z. A., DİRİ B., Cakaloglu T.

Innovations in Intelligent Systems and Applications Conference (ASYU), Adana, Türkiye, 4 - 06 Ekim 2018, ss.99-103, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası:
Doi Numarası: 10.1109/asyu.2018.8554027
Basıldığı Şehir: Adana
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.99-103
Recep Tayyip Erdoğan Üniversitesi Adresli: Hayır

Özet

With the rapid development of the Internet, thousands of different news reports from different channels are presented to us. So much news, particularly in the media sector, is an important question to be categorized and archived without human effort. In this study, it is aimed to be able to determine which news item belongs to large news headlines collected from news sites. For this, a two stage method is proposed, which is based on the classical Latent Dirichlet Allocation (LDA) algorithm used in the model. With the developed two stage LDA method, comparison of the conventional LDA was made. Then, by creating a file with an arff extension from the word weights of the topics, the success of the machine learning methods in Weka was measured.