Info Extract
NLP
POS Tagger
Crawlers

Privacy Policy

POS Tagger

TAIParse Part-of-Speech (POS) Tagger (DOWNLOAD)

We are proud to announce the release of a standalone freeware executable of TAIParse featuring part-of-speech tagging. A tagger is a necessary component of most text analysis systems, as it assigns a syntax class (e.g., noun, verb, adjective, adverb) to every word in a sentence.

The tagger produces an output format almost identical to that of the Penn Treebank Project, including bracketing of noun phrases. The current version achieves 94% accuracy in a blind test that we use to assess progress.

The tagger has been built manually with general rules and methods. The entire analyzer definition, in our NLP++ language, is supplied with the download   In contrast to other taggers, which are overtrained for particular document sets and use overly specific rules, this tagger can readily be applied to unseen text types.

Editing, enhancing, and compiling the tagger requires Professional VisualText, available automatically by DOWNLOAD.

We welcome your feedback, questions, and suggestions.

 DOWNLOAD TAIParse 0.8 beta, focusing on POS tagging and shallow parsing.

Reference: Tagset used in Penn Treebank.
 

CC

Coordinating conjunction

CD

Cardinal number

DT

Determiner

EX

Existential there

FW

Foreign word

IN

Preposition or subordinating conjunction

JJ

Adjective

JJR

Adjective, comparative

JJS

Adjective, superlative

LS

List item marker

MD

Modal

NN

Noun, singular or mass

NP

Proper noun singular

NPS

Proper noun plural

PDT

Predeterminer

POS

Possessive ending

PP

Personal pronoun

PP$

Possessive pronoun

RB

Adverb

RBR

Adverb, comparative

RBS

Adverb, superlative

RP

Particle

SYM

Symbol

TO

To

UH

Interjection

VB

Verb, base form

VBD

Verb, past tense

VBG

Verb, gerund or present participle

VBN

Verb, past participle

VBP

Verb, noun-3rd person singular present

VBZ

Verb, 3rd person singular present

WDT

Wh-determiner

WP

Wh-pronoun

WP$

Possessive wh-pronoun

WRB

Wh-adverb

Keywords: part of speech tagger, pos tagger, postagger,  syntax class, syntactic class.