Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
linguisticsweb:resources:corpora [2023/04/06 12:25]
sabinebartsch
linguisticsweb:resources:corpora [2023/04/06 12:36]
sabinebartsch [Tag sets]
Line 2: Line 2:
  
 ===== Tag sets ===== ===== Tag sets =====
 +
 +==== Penn TreeBank tag set ====
 +
 +Reference:
 +Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: the penn treebank. Comput. Linguist. 19, 2 (June 1993), 313–330. [[https://dl.acm.org/doi/10.5555/972470.972475|Ref]]
  
 ^pos tag^description^example^ ^pos tag^description^example^
-|CC|coordinating conjunction|and| +|CC|coordinating conjunction|and, or
-|CD|cardinal number|1, third| +|CD|cardinal number|3, third| 
-|DT|determiner|the|+|DT|determiner|the, this|
 |EX|existential there|there is| |EX|existential there|there is|
-|FW|foreign word|les|+|FW|foreign word|tabula|
 |IN|preposition, subordinating conjunction|in, of, like| |IN|preposition, subordinating conjunction|in, of, like|
 |IN/that|that as subordinator|that| |IN/that|that as subordinator|that|
-|JJ|adjective|green+|JJ|adjective|blue, happy
-|JJR|adjective, comparative|greener+|JJR|adjective, comparative|bluer, happier
-|JJS|adjective, superlative|greenest|+|JJS|adjective, superlative|bluest, happiest|
 |LS|list marker|1)| |LS|list marker|1)|
 |MD|modal|could, will| |MD|modal|could, will|
-|NN|noun, singular or mass|table+|NN|noun, singular or mass|house
-|NNS|noun plural|tables+|NNS|noun plural|houses
-|NP|proper noun, singular|John+|NP|proper noun, singular|Carrie
-|NPS|proper noun, plural|Vikings+|NPS|proper noun, plural|Americans
-|PDT|predeterminer|both the boys+|PDT|predeterminer|both as in "both the girls"
-|POS|possessive ending|friend’s| +|POS|possessive ending|person’s| 
-|PP|personal pronoun|I, he, it| +|PP|personal pronoun|I, she, it| 
-|PPZ|possessive pronoun|my, his|+|PPZ|possessive pronoun|my, his, your|
 |RB|adverb|however, usually, naturally, here, good| |RB|adverb|however, usually, naturally, here, good|
 |RBR|adverb, comparative|better| |RBR|adverb, comparative|better|
 |RBS|adverb, superlative|best| |RBS|adverb, superlative|best|
-|RP|particle|give up|+|RP|particle|up as in "give up"|
 |SENT|Sentence-break punctuation|. ! ?| |SENT|Sentence-break punctuation|. ! ?|
 |SYM|Symbol|/ [ = *| |SYM|Symbol|/ [ = *|
-|TO|infinitive ‘to’|togo+|TO|infinitive ‘to’|to play
-|UH|interjection|uhhuhhuhh|+|UH|interjection|aha|
 |VB|verb be, base form|be| |VB|verb be, base form|be|
 |VBD|verb be, past tense|was, were| |VBD|verb be, past tense|was, were|
Line 50: Line 55:
 |VVP|verb, sing. present, non-3d|take| |VVP|verb, sing. present, non-3d|take|
 |VVZ|verb, 3rd person sing. present|takes| |VVZ|verb, 3rd person sing. present|takes|
-|WDT|wh-determiner|which|+|WDT|wh-determiner|which, who|
 |WP|wh-pronoun|who, what| |WP|wh-pronoun|who, what|
 |WP$|possessive wh-pronoun|whose| |WP$|possessive wh-pronoun|whose|
Line 62: Line 67:
 |,|Comma|,| |,|Comma|,|
 |:|Punctuation|– ; : — …| |:|Punctuation|– ; : — …|
- 
- 
  
 ===== Corpora ===== ===== Corpora =====