Welcome To UTPedia

We would like to introduce you, the new knowledge repository product called UTPedia. The UTP Electronic and Digital Intellectual Asset. It stores digitized version of thesis, final year project reports and past year examination questions.

Browse content of UTPedia using Year, Subject, Department and Author and Search for required document using Searching facilities included in UTPedia. UTPedia with full text are accessible for all registered users, whereas only the physical information and metadata can be retrieved by public users. UTPedia collaborating and connecting peoples with university’s intellectual works from anywhere.

Disclaimer - Universiti Teknologi PETRONAS shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.Best viewed using Mozilla Firefox 3 or IE 7 with resolution 1024 x 768.



[img] PDF
Restricted to Registered users only

Download (1249Kb)


The way of news consumption has changed drastically since the past as 87% of the respondent of a survey has cited online as their primary news source which makes the news article more accessible as ever for the readers, but the accessibility may overwhelm the readers due to the vast amount of news that is published online every day. Thus, having news classification is important but with the amount of the news published, we need the help of the machine learning to classify the news article in which this project was set to do. The objective of this project is to find the best machine learning model that can be used for the news classification, developing process for online news extraction and finally developing an automated system to extract and classify the news articles. Support Vector Machine (SVM) with TF-IDF Vectorization method was found to be the best machine learning model for news article classification and online news article extraction process was done using Python script. After that, extracted articles are used to develop the machine learning model with an accuracy of 89.92%. The developed model was then used for the classification process in the automated news classification program which was build in Python as well. At the end, this project has helped to develop an automated news classification system will be helpful for the readers as they are able to view the news articles that are interesting to them.

Item Type: Final Year Project
Academic Subject : Academic Department - Information Communication Technology
Subject: Q Science > Q Science (General)
Divisions: Sciences and Information Technology > Computer and Information Sciences
Depositing User: Ahmad Suhairi Mohamed Lazim
Date Deposited: 23 Sep 2021 23:46
Last Modified: 23 Sep 2021 23:46
URI: http://utpedia.utp.edu.my/id/eprint/21687

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...