Classification of New Titles by Two Stage Latent Dirichlet Allocation


Guven Z. A., DİRİ B., Cakaloglu T.

Innovations in Intelligent Systems and Applications Conference (ASYU), Adana, Türkiye, 4 - 06 Ekim 2018, ss.99-103 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1109/asyu.2018.8554027
  • Basıldığı Şehir: Adana
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.99-103
  • Recep Tayyip Erdoğan Üniversitesi Adresli: Hayır

Özet

With the rapid development of the Internet, thousands of different news reports from different channels are presented to us. So much news, particularly in the media sector, is an important question to be categorized and archived without human effort. In this study, it is aimed to be able to determine which news item belongs to large news headlines collected from news sites. For this, a two stage method is proposed, which is based on the classical Latent Dirichlet Allocation (LDA) algorithm used in the model. With the developed two stage LDA method, comparison of the conventional LDA was made. Then, by creating a file with an arff extension from the word weights of the topics, the success of the machine learning methods in Weka was measured.