Welcome To UTPedia

We would like to introduce you, the new knowledge repository product called UTPedia. The UTP Electronic and Digital Intellectual Asset. It stores digitized version of thesis, final year project reports and past year examination questions.

Browse content of UTPedia using Year, Subject, Department and Author and Search for required document using Searching facilities included in UTPedia. UTPedia with full text are accessible for all registered users, whereas only the physical information and metadata can be retrieved by public users. UTPedia collaborating and connecting peoples with university’s intellectual works from anywhere.

Disclaimer - Universiti Teknologi PETRONAS shall not be liable for any loss or damage caused by the usage of any information obtained from this web site.Best viewed using Mozilla Firefox 3 or IE 7 with resolution 1024 x 768.


YAACOB, NORHANA (2006) JTAGGER. Universiti Teknologi Petronas. (Unpublished)

[img] PDF
Download (1345Kb)


Part-of-speech tagging, also called grammatical tagging, is the process of assigning the words in a text with their corresponding parts of speech like noun, verb, pronoun, or other lexical class markers to each word in a sentence. Part-of-speech tagging is an important step in natural language processing. Part-of-speech tagging is an ambiguous process because a word can represent morethan one part of speech at different times. Most difficult task is because it deals with ambiguities of the word. A word, phrase, or sentence is ambiguous if it has more than one meaning. The word 'light', for example, can mean not very heavy or not very dark. There are two types of ambiguity which are lexical and structural. When a word has more than one meaning, it is said to be lexically ambiguous. When a phrase or sentence can have more than one structure it is said to be structurally ambiguous. The part-of-speech tagging algorithms fall into three classes which are rule-based taggers, stochastic taggers, and transformation-based taggers. In this project, rule-based tagging algorithm is used as the mechanism to develop the system which named JTagger. The tagger initially tags by assigning each word its most likely tag, estimated by examining a corpus that consists of Penn Treebank Tagsets. JTagger is automatically performed the tagging process giving reasonable accuracy thus eliminate the difficulties of hand tagging task for the reader to manually tag a sentence. Part-of-speech tagging is important since it could help people to understand English better. The programming language used in this system is Java because it is an independent source that can run in any platform including Microsoft or UNIX.

Item Type: Final Year Project
Academic Subject : Academic Department - Information Communication Technology
Subject: Z Bibliography. Library Science. Information Resources > ZA Information resources
Divisions: Sciences and Information Technology > Computer and Information Sciences
Depositing User: Users 2053 not found.
Date Deposited: 27 Sep 2013 11:10
Last Modified: 27 Jul 2021 15:46
URI: http://utpedia.utp.edu.my/id/eprint/7056

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...