This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
linguisticsweb:resources:corpora [2021/05/20 21:21] sabinebartsch |
linguisticsweb:resources:corpora [2023/04/06 12:22] sabinebartsch [Corpora and other language resources] |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== Corpora and other language resources ====== | ====== Corpora and other language resources ====== | ||
+ | |||
+ | ===== Tag sets ===== | ||
+ | |||
+ | |||
===== Corpora ===== | ===== Corpora ===== | ||
Line 12: | Line 16: | ||
|DWDS Kernkorpus| | |DWDS Kernkorpus| | ||
|DWDS Kernkorpus 21| |2000-2010 | |DWDS Kernkorpus 21| |2000-2010 | ||
- | |Deutscher Wortschatz Project|35 mio. sentences, 500 mio. words| | ||
|Hamburg Dependency Treebank| | |Hamburg Dependency Treebank| | ||
|IDS-Corpora| | |IDS-Corpora| | ||
|LIMAS-Korpus|1 mio words, 500 texts / fragments|1970s|http:// | |LIMAS-Korpus|1 mio words, 500 texts / fragments|1970s|http:// | ||
+ | |Arabic News Texts Corpus (AntCorpus)| | | https:// | ||
+ | |Wortschatz Leipzig|various sample sizes|Arabic, | ||
+ | |SpråkbankenText| | |https:// | ||