Software

Here is an overview of the natural language processing software currently used for the automatic annotation of the Groningen Meaning Bank.

  • elephant is the statistical tool for word and sentence segmentation used in the GMB.

  • The C&C tools provide the automatic annotation of the GMB: POS tagging, NE tagging, and syntactic parsing (CCG).

  • We use morpha for morphological analysis.

  • Further included in the suite is Boxer which prodices semantic representations (DRS) on top of the CCG parse trees.

  • An online demo of the C&C and Boxer pipeline is available here. The demo is based on a REST API.