Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revision Both sides next revision
linguisticsweb:resources:corpora [2023/04/06 12:36]
sabinebartsch [Tag sets]
linguisticsweb:resources:corpora [2023/04/06 12:48]
sabinebartsch [Tag sets]
Line 3: Line 3:
 ===== Tag sets ===== ===== Tag sets =====
  
-==== Penn TreeBank tag set ==== +  * [[linguisticsweb:resources:corpora:tagsets|Penn TreeBank tag set]] 
- +  * CLAWS 5 tag set 
-Reference: +  CLAWS 7 tag set
-Mitchell P. Marcus, Mary Ann Marcinkiewicz, and Beatrice Santorini. 1993. Building a large annotated corpus of English: the penn treebank. Comput. Linguist. 19, 2 (June 1993), 313–330. [[https://dl.acm.org/doi/10.5555/972470.972475|Ref]] +
- +
-^pos tag^description^example^ +
-|CC|coordinating conjunction|and, or| +
-|CD|cardinal number|3, third| +
-|DT|determiner|the, this| +
-|EX|existential there|there is| +
-|FW|foreign word|tabula| +
-|IN|preposition, subordinating conjunction|in, of, like| +
-|IN/that|that as subordinator|that| +
-|JJ|adjective|blue, happy| +
-|JJR|adjective, comparative|bluer, happier| +
-|JJS|adjective, superlative|bluest, happiest| +
-|LS|list marker|1)| +
-|MD|modal|could, will| +
-|NN|noun, singular or mass|house| +
-|NNS|noun plural|houses| +
-|NP|proper noun, singular|Carrie| +
-|NPS|proper noun, plural|Americans| +
-|PDT|predeterminer|both as in "both the girls"+
-|POS|possessive ending|person’s| +
-|PP|personal pronoun|I, she, it| +
-|PPZ|possessive pronoun|my, his, your| +
-|RB|adverb|however, usually, naturally, here, good| +
-|RBR|adverb, comparative|better| +
-|RBS|adverb, superlative|best| +
-|RP|particle|up as in "give up"| +
-|SENT|Sentence-break punctuation|. ! ?| +
-|SYM|Symbol|/ [ = *+
-|TO|infinitive ‘to’|to play| +
-|UH|interjection|aha| +
-|VB|verb be, base form|be| +
-|VBD|verb be, past tense|was, were| +
-|VBG|verb be, gerund/present participle|being| +
-|VBN|verb be, past participle|been| +
-|VBP|verb be, sing. present, non-3d|am, are| +
-|VBZ|verb be, 3rd person sing. present|is| +
-|VH|verb have, base form|have| +
-|VHD|verb have, past tense|had| +
-|VHG|verb have, gerund/present participle|having| +
-|VHN|verb have, past participle|had| +
-|VHP|verb have, sing. present, non-3d|have| +
-|VHZ|verb have, 3rd person sing. present|has| +
-|VV|verb, base form|take| +
-|VVD|verb, past tense|took| +
-|VVG|verb, gerund/present participle|taking| +
-|VVN|verb, past participle|taken| +
-|VVP|verb, sing. present, non-3d|take| +
-|VVZ|verb, 3rd person sing. present|takes| +
-|WDT|wh-determiner|which, who| +
-|WP|wh-pronoun|who, what| +
-|WP$|possessive wh-pronoun|whose| +
-|WRB|wh-abverb|where, when| +
-|#|#|#| +
-|$|$|$| +
-|“|Quotation marks|‘ “| +
-|``|Opening quotation marks|‘ “| +
-|(|Opening brackets|( {| +
-|)|Closing brackets|) }| +
-|,|Comma|,+
-|:|Punctuation|– ; : — …| +
 ===== Corpora ===== ===== Corpora =====