Software
Here is an overview of the natural language processing software currently used for the automatic annotation of the Groningen Meaning Bank.
-
elephant is the statistical tool for word and sentence
segmentation used in the GMB.
-
The C&C tools
provide the automatic annotation of the
GMB: POS tagging, NE tagging, and syntactic parsing (CCG).
-
We use morpha for morphological analysis.
- Further included in the suite is
Boxer
which prodices semantic representations (DRS) on top of the CCG parse trees.
- An online demo of the C&C and Boxer pipeline is available here. The demo is based on a REST API.