Welcome To UTPedia

We would like to introduce you, the new knowledge repository product called UTPedia. The UTP Electronic and Digital Intellectual Asset. It stores digitized version of thesis, final year project reports and past year examination questions.

Browse content of UTPedia using Year, Subject, Department and Author and Search for required document using Searching facilities included in UTPedia. UTPedia with full text are accessible for all registered users, whereas only the physical information and metadata can be retrieved by public users. UTPedia collaborating and connecting peoples with university’s intellectual works from anywhere.

Disclaimer - Universiti Teknologi PETRONAS shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.Best viewed using Mozilla Firefox 3 or IE 7 with resolution 1024 x 768.

loformation Retrieval - using Porter Stemming Algorithm

Zulkifly, Zurida Azita (2006) loformation Retrieval - using Porter Stemming Algorithm. Universiti Teknologi Petronas. (Unpublished)

[img] PDF
Download (634Kb)

Abstract

Stemming is a process of removing or transforming endings (suffixes) when they are found on a word; inflectional endings (-s, -ing, -ed, etc) and derivational endings (-ion, - ative, -ity, -ment, -less, etc) and prefixes (un-, in-, etc). The rationale for using stemming is that similar words usually have similar meanings, so including words that are similar in meaning to those originally contained within it will increased the retrieval process effectiveness. There are many stemming method that have been developed. However, the main focus of this project is on Porter Stemming Algorithm which has been developed by M.F Porter in 1980. The objective of this project is to develop a system that will demonstrate the information retrieval using Porter Stemming Algorithm. Problem with information retrieval is to get document that relevant to users query. To measure the performance, there are two measurement, which are precision and recall. The scope of the project is to implement the original Porter Stemming Algorithm in the application to improved the precision and recall in the retrieving document process. Even though there are many improvements have been made to the Porter Algorithm, we will focus on the original algorithm in this project. The Porter Stemming algorithm had five phases, which in every phase have it owns rules to stripping the suffixes. By implementing the algorithm, it is expected from the application to retrieve only documents that relevant to the users query.

Item Type: Final Year Project
Academic Subject : Academic Department - Information Communication Technology
Subject: Z Bibliography. Library Science. Information Resources > ZA Information resources
Divisions: Sciences and Information Technology > Computer and Information Sciences
Depositing User: Users 2053 not found.
Date Deposited: 27 Sep 2013 12:18
Last Modified: 27 Jul 2021 15:46
URI: http://utpedia.utp.edu.my/id/eprint/7082

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...