Welcome To UTPedia

We would like to introduce you, the new knowledge repository product called UTPedia. The UTP Electronic and Digital Intellectual Asset. It stores digitized version of thesis, final year project reports and past year examination questions.

Browse content of UTPedia using Year, Subject, Department and Author and Search for required document using Searching facilities included in UTPedia. UTPedia with full text are accessible for all registered users, whereas only the physical information and metadata can be retrieved by public users. UTPedia collaborating and connecting peoples with university’s intellectual works from anywhere.

Disclaimer - Universiti Teknologi PETRONAS shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.Best viewed using Mozilla Firefox 3 or IE 7 with resolution 1024 x 768.

AN INTEGRATED GENERIC TEXT CLASSIFICATION ALGORITHM FOR INDONESIAN AND MALAY NEWS DOCUMENTS

,, ZUL INDRA (2015) AN INTEGRATED GENERIC TEXT CLASSIFICATION ALGORITHM FOR INDONESIAN AND MALAY NEWS DOCUMENTS. Masters thesis, Universiti Teknologi PETRONAS.

[img] PDF
Restricted to Registered users only

Download (6Mb)

Abstract

Text classification (TC)provides a better wayto organize information since it allows better understanding and interpretation of the content. It deals with the assignment of labels into a group of similar textual document. However, TC research for Asian language documents is relatively limited compared to English documents and even lesser particularly for news articles. Apart from that, TC research to classify textual documents in similar morphology such Indonesian and Malay is still scarce. Hence, the aimof this study is to develop an integrated generic TCalgorithm which is able to identify the language and then classify the category for identified news documents. Furthermore, top-ra feature selection method is utilised to improve TCperformance andto overcome theonline news corpora classification challenges: rapid datagrowth of online news documents, and the high computational time.

Item Type: Thesis (Masters)
Academic Subject : Academic Department - Information Communication Technology
Subject: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Sciences and Information Technology > Computer and Information Sciences
Depositing User: Ahmad Suhairi Mohamed Lazim
Date Deposited: 18 Sep 2021 21:14
Last Modified: 18 Sep 2021 21:14
URI: http://utpedia.utp.edu.my/id/eprint/21420

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...