This page contains the datasets used in [1], i.e. training and test datasets for Dutch in retagged CoNLL format. The data was converted from Alpino XML into CoNLL format based on an adapted version of Erwin Marsi's conversion software [2], but PoS tags were replaced by automatically assigned Alpino tags.
[1] Barbara Plank. Improved statistical measures to assess natural language parser performance across domains. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC2010), Valletta, Malta, May 2010.
[2] The conversion of Alpino Treebank XML to CoNLL format is based on Erwin Marsi's tool developed for CoNLL-X 2007, available at: http://nextens.uvt.nl/depparse-wiki/SharedTaskWebsite However, instead of using MBT tags, we adapted the conversion scripts such that they use Alpino Pos tags. (adapted scripts will be made available here)