This directory contains freely available treebanks for Dutch. The treebanks are distributed as gzipped tar archives of XML files. The annotations are according to the CGN/D-COI/Lassy annotation guidelines.
The combined treebanks are available collectively in DBXML format, in the file lassy.dact.
More information is available from the Lassy website.
Name | Last modified | Size | Description | |
---|---|---|---|---|
Parent Directory | - | |||
lot_test_suite1-1271.tar.gz | 2024-02-26 11:58 | 819K | ||
lot_test_suite1.tar.gz | 2024-02-26 11:58 | 819K | 18 sentences from the LOT parsing task, many years ago | |
novelsample_corrected-1271.tar.gz | 2024-02-26 11:58 | 3.2M | ||
novelsample_corrected.tar.gz | 2024-02-26 11:58 | 3.2M | 100 sentences from novels annotated by Andreas | |
ntv-suite-1271.tar.gz | 2024-02-26 11:58 | 5.1M | ||
ntv-suite.tar.gz | 2024-02-26 11:58 | 5.1M | sentences annotated by students of NTV course | |
lassy-doc-1271.tar.gz | 2024-02-26 11:59 | 9.0M | ||
lassy-doc.tar.gz | 2024-02-26 11:59 | 9.0M | ||
leuven_yellow_pages-1271.tar.gz | 2024-02-26 11:59 | 8.8M | ||
leuven_yellow_pages.tar.gz | 2024-02-26 11:59 | 8.8M | example sentences from Leuven Yellow Pages doc | |
cgn_exs-1271.tar.gz | 2024-02-26 11:59 | 12M | ||
cgn_exs.tar.gz | 2024-02-26 11:59 | 12M | example sentences from CGN docs | |
Titaantjes-1271.tar.gz | 2024-02-26 11:59 | 26M | ||
Titaantjes.tar.gz | 2024-02-26 11:59 | 26M | all sentences from Titaanjes by Nescio | |
i_suite-1271.tar.gz | 2024-02-26 11:59 | 29M | ||
i_suite.tar.gz | 2024-02-26 11:59 | 29M | more simple hand crafted sentences for grammar testing | |
h_suite-1271.tar.gz | 2024-02-26 11:59 | 30M | ||
h_suite.tar.gz | 2024-02-26 11:59 | 30M | 1000 simple hand crafted sentences for grammar testing | |
j_suite-1271.tar.gz | 2024-02-26 11:59 | 28M | ||
j_suite.tar.gz | 2024-02-26 11:59 | 28M | ||
g_suite-1271.tar.gz | 2024-02-26 11:59 | 30M | ||
g_suite.tar.gz | 2024-02-26 11:59 | 30M | 1000 simple hand crafted sentences for grammar testing | |
wpspel-1271.tar.gz | 2024-02-26 11:59 | 38M | ||
wpspel.tar.gz | 2024-02-26 11:59 | 38M | 1000 questions from WPSPEL | |
extra-1271.tar.gz | 2024-02-26 12:00 | 63M | ||
extra.tar.gz | 2024-02-26 12:00 | 63M | weird mix of additional sentences for training Alpino | |
eans-1271.tar.gz | 2024-02-26 12:01 | 77M | ||
eans.tar.gz | 2024-02-26 12:01 | 77M | some examples from Electronic ANS | |
qa-1271.tar.gz | 2024-02-26 12:01 | 106M | ||
qa.tar.gz | 2024-02-26 12:01 | 106M | questions from the various CLEF shared tasks | |
cdb-1271.tar.gz | 2024-02-26 12:06 | 266M | ||
cdb.tar.gz | 2024-02-26 12:06 | 266M | latest version of Alpino Treebank | |
alpino.dact | 2024-02-26 12:35 | 1.2G | ||