Publications by Gertjan van Noord

    2010

  1. Gertjan van Noord. Self-trained Bilexical Preferences to Improve Disambiguation Accuracy. In: Harry Bunt, Paola Merlo and Joakim Nivre (editors), Trends in Parsing Technology. Springer Verlag. To appear. [draft pdf]

    2009

  2. Daniël de Kok, Jianqiang Ma and Gertjan van Noord, A generalized method for iterative error mining in parsing results. In: ACL2009 Workshop Grammar Engineering Across Frameworks (GEAF), Singapore, 2009. [pdf]
  3. Kostadin Cholakov and Gertjan van Noord. Combining Finite State and Corpus-based Techniques for Unknown Word Prediction. To appear in RANLP 2009. [pdf]
  4. Gertjan van Noord and Gosse Bouma. Parsed Corpora for Linguistics. In: Proceedings of EACL Workshop The Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous? Athens, 2009. pp 33-39. [pdf]
  5. Gertjan van Noord, Learning Efficient Parsing. In: EACL 2009. The 12th Conference of the European Chapter of the Association for Computational Linguistics. 30 March - 3 April 2009, Athens, Greece. pp 817-825. [pdf]
  6. Gertjan van Noord et al., Lassy Syntactische Annotatie. In preparation. pdf of draft
  7. Gertjan van Noord. Huge Parsed Corpora in LASSY. In: Frank van Eynde, Anette Frank, Koenraad de Smedt, Gertjan van Noord (editors), Proceedings of the Seventh International Workshop on Treebanks and Linguistic Theories (TLT 7). January 23-24, 2009, Groningen, The Netherlands. LOT Occasional Series. [LOT site]
  8. Frank van Eynde, Anette Frank, Koenraad de Smedt, Gertjan van Noord (editors), Proceedings of the Seventh International Workshop on Treebanks and Linguistic Theories (TLT 7). January 23-24, 2009, Groningen, The Netherlands. LOT Occasional Series. [LOT site]

    2008

  9. Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann. Question Answering with Joost at CLEF 2008. CLEF 2008 Working Notes. Aarhus Denmark. [pdf]
  10. Barbara Plank and Gertjan van Noord. Exploring An Auxiliary Distribution based approach to Domain Adaptation of a Syntactic Disambiguation Model. In: Coling Workhop 'Cross Framework and Cross Domain Parser Evaluation'. [pdf]
  11. N. Oostdijk, M. Reynaert, P. Monachesi, G. van Noord, R. Ordelman, I. Schuurman, V. Vandeghinste. From D-Coi to SoNaR: A reference corpus for Dutch. In: LREC 2008. [pdf]
  12. Gosse Bouma, Geert Kloosterman, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann. Question Answering with Joost at CLEF 2007. In: Carol Peters, Valentin Jijkoun, Thomas Mandl, Henning Mueller, Douglas W. Oard, Anselmo Penas, Vivien Petras, Diana Santos (editors), Advances in Multilingual and Multimodal Information Retrieval, 8th workshop of the Cross-Language Evaluation Form, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers. Lecture Notes in Computer Science 5152, Springer 2008. pp 257-260.

    2007

  13. Gosse Bouma, Geert Kloosterman, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann. Question Answering with Joost at CLEF 2007, CLEF 2007 Working Notes. [pdf]
  14. Gertjan van Noord. Using Self-Trained Bilexical Preferences to Improve Disambiguation Accuracy. In: Proceedings of the Tenth International Conference on Parsing Technologies. IWPT 2007, Prague. Pages 1--10. [pdf]
  15. Timothy Baldwin, Mark Dras, Julia Hockenmaier, Tracy Holloway King, Gertjan van Noord. The Impact of Deep Linguistic Processing on Parsing Technology. In: Proceedings of the Tenth International Conference on Parsing Technologies. IWPT 2007, Prague. Pages 36--38. [pdf from ACL Anthology]
  16. Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann. Using Syntactic Knowledge for QA. In: C. Peters, P. Clough, F. Gey, J. Karlgren, B. Magnini, D.W. Oard, M. de Rijke, M. Stempfhuber (editors) Evaluation of Multilingual and Multi-modal Information Retrieval. Lecture Notes in Computer Science 4730/2007. Springer. pp 318--327. [concept pdf]
  17. Martijn Wieling, Mark-Jan Nederhof, Gertjan van Noord. Parsing Partially Bracketed Input. In: CLIN 2005. Proceedings of the 16th Meeting of Computational Linguistics in the Netherlands. Pages 1--16. [pdf]
  18. Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann. Linguistic Knowledge and Question Answering. In: Traitement Automatique des Langues 46 (3) 2005. Pages 15--39. Appeared in 2007. [concept pdf]

    2006

  19. Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas, Jörg Tiedemann. The University of Groningen at QA@CLEF2006. Using Syntactic Knowledge for QA. [pdf, website]
  20. Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann. Question Answering for Dutch using Dependency Relations. In: C. Peters, F. Gey, J. Gonzalo, H. Mueller, G. Jones, M. Kluck, B. Magnini, M. de Rijke (editors), Accessing Multilingual Information Repositories. Lecture Notes in Computer Science Vol. 4022/2006. Springer. Pages 370-379. [pdf]
  21. Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste, Syntactic Annotation of Large Corpora in STEVIN. In: LREC 2006. [pdf]
  22. Gertjan van Noord. At Last Parsing Is Now Operational. In: Piet Mertens, Cedrick Fairon, Anne Dister, Patrick Watrin (editors): TALN06. Verbum Ex Machina. Actes de la 13e conference sur le traitement automatique des langues naturelles. Page 20--42. [pdf]

    2005

  23. Gertjan van Noord and Valia Kordoni. A Raising Analysis of the Dutch Passive. In: Proceedings of HPSG2005. [website,pdf]
  24. Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann. Question Answering for Dutch using Dependency Relations. In: CLEF2005 workshop. [pdf]
  25. Gosse Bouma, Jori Mur, Gertjan van Noord. Reasoning over Dependency Relations for QA. In: KRAQ05. [pdf]
  26. Gertjan van Noord, Robert Malouf. Wide Coverage Parsing with Stochastic Attribute Value Grammars. Draft. [pdf]
  27. Frederik Fouvry, Valia Kordoni, Gertjan van Noord. Object-to-Subject Raising: An Analysis of the Dutch Passive. In: HPSG05. See above for a more recent version, in the HPSG2005 proceedings.

    2004

  28. Gertjan van Noord. Error Mining for Wide-Coverage Grammar Engineering. In: ACL 2004, Barcelona. [pdf, ps] ACL Anthology
  29. Robert Malouf, Gertjan van Noord. Wide Coverage Parsing with Stochastic Attribute Value Grammars. In: IJCNLP-04 Workshop Beyond Shallow Analyses - Formalisms and statistical modeling for deep analyses. [pdf,web page]
  30. Jan Daciuk, Gertjan van Noord. Finite Automata for Compact Representation of Tuple Dictionaries. Theoretical Computer Science. Volume 313, Issue 1, Pages 45--56. [postscript, pdf]
  31. Robbert Prins and Gertjan van Noord. Reinforcing Parser Preferences through Tagging. In special issue on Evolutions in Parsing of the journal Traitement Automatique des Langues volume 44(3) 2003, pages 121-139. [postscript, pdf]

    2003

  32. Gertjan van Noord, Finite State Processing. In: Lynn Nadel (editor-in-chief), Encyclopedia of Cognitive Science. pp 130--134. (Originally: Nature Publishing Group; Now: Wiley). [homepage]
  33. Lauri Karttunen, Kimmo Koskenniemi, Gertjan van Noord. Special issue: Finite State Methods in Natural Language Processing. Natural Language Engineering. Volume 9, Part 1, March 2003. [postscript, pdf]

    2002

  34. Leonoor van der Beek, Gosse Bouma, Jan Daciuk, Tanja Gaustad, Robert Malouf, Gertjan van Noord, Robbert Prins, Begona Villada. Algorithms for Linguistic Processing. NWO PIONIER Progress Report. [postscript, pdf]
  35. Leonoor van der Beek, Gosse Bouma, Robert Malouf, Gertjan van Noord. The Alpino Dependency Treebank. In: Mariët Theune, Anton Nijholt, Hendri Hondorp (editors). Computational Linguistics in the Netherlands CLIN 2001. Selected papers from the Twelfth CLIN Meeting. Rodopi 2002. [postscript, pdf, html, ordering info]
  36. Leonoor van der Beek, Gosse Bouma, Gertjan van Noord. Een brede computationele grammatica voor het Nederlands. Nederlandse Taalkunde, jaargang 7, 2002-4. [in Dutch]. 353--374. [postscript, pdf]
  37. Jan Daciuk and Gertjan van Noord, Finite Automata for Compact Representation of Language Models in NLP. In: Bruce Watson, Derick Wood (eds), Implementation and Application of Automata. Lecture Notes in Computer Science. 65--73.

    2001

  38. Lauri Karttunen, Kimmo Koskenniemi, Gertjan van Noord (editors), Finite State Methods in Natural Language Processing. FSMNLP 2001. Extended Abstracts. ESSLLI Workshop, Helsinki 2001. [homepage with links to on-line versions]
  39. Tony Mullen, Robert Malouf and Gertjan van Noord, Statistical Parsing of Dutch using Maximum Entropy Models with Feature Merging. NLPRS01, Tokyo. [postscript, pdf, html ]
  40. Robbert Prins, Gertjan van Noord, Unsupervised POS-tagging Improves Parsing Accuracy and Parsing Efficiency. IWPT, Beijing. See above for the article entitled Reinforcing Parser Preferences through Tagging
  41. Jan Daciuk, Gertjan van Noord, Finite Automata for Compact Representation of Language Models in NLP. CIAA 2001, Pretoria. See above for the article entitled Finite Automata for Compact Representation of Tuple Dictionaries.
  42. Gosse Bouma, Gertjan van Noord, Robert Malouf. Alpino: Wide Coverage Computational Analysis of Dutch. In: Computational Linguistics in the Netherlands CLIN 2000. [postscript, pdf, html, ordering info ]
  43. Gertjan van Noord and Dale Gerdemann. Finite State Transducers with Predicates and Identity. Grammars 4 (3). [postscript, pdf, html ]
  44. Jean-Claude Junqua and Gertjan van Noord (editors), Robustness in Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1 [home page]
  45. Jean-Claude Junqua and Gertjan van Noord, Introduction. In: Jean-Claude Junqua and Gertjan van Noord (editors), Robustness in Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1 [home page]
  46. Gertjan van Noord, Robust Parsing of Word Graphs. In: Jean-Claude Junqua and Gertjan van Noord (editors), Robustness in Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1 [home page]
  47. Gertjan van Noord, Dale Gerdemann, An Extendible Regular Expression Compiler for Finite-state Approaches in Natural Language Processing. In: O.Boldt, H.Juergensen (eds), Automata Implementation. 4th International Workshop on Implementing Automata, WIA '99, Potsdam Germany, July 1999, Revised Papers. Springer Lecture Notes in Computer Science 2214, 2000. [postscript, pdf, html, order ]

    2000

  48. Dale Gerdemann, Gertjan van Noord. Approximation and Exactness in Finite State Optimality Theory. In: Jason Eisner, Lauri Karttunen, Alain Thériault (editors), SIGPHON 2000, Finite State Phonology. Proceedings of the Fifth Workshop of the ACL Special Interest Group in Computational Phonology. August 2000, Luxembourg. [postscript, pdf, html, cmp-lg full proceedings, cmp-lg ]
  49. Gertjan van Noord, Treatment of Epsilon Moves in Subset Construction. Computational Linguistics, 26 (1). [postscript, pdf, html ] ACL Anthology
  50. Gertjan van Noord, Grammar-based Natural Language Understanding. Priority Programme Language and Speech Technology, Technical Report 121. [postscript, pdf]
  51. Gertjan van Noord. FSA6 Reference Manual [html, home page]

    1999

  52. Gertjan van Noord, Algorithms for Linguistic Processing. PIONIER project proposal (accepted by NWO in spring 1999). [html, postscript, pdf ]
  53. Gert Veldhuijzen van Zanten, Gosse Bouma, Khalil Sima'an, Gertjan van Noord, Remko Bonnema. Evaluation of the NLP Components of the OVIS2 Spoken Dialogue System. In: van Eynde, Schuurman and Schelkens (eds), Computational Linguistics in the Netherlands 1998, Rodopi Amsterdam, 1999, pages 213--229. [postscript, pdf, html, cmp-lg]
  54. Gertjan van Noord, Dale Gerdemann, An Extendible Regular Expression Compiler for Finite-state Approaches in Natural Language Processing. WIA 99, Potsdam Germany. See above for later version.
  55. Dale Gerdemann, Gertjan van Noord, Transducers from Rewrite Rules with Backreferences. EACL 99, Bergen Norway. [html, postscript, pdf, cmp-lg] ACL Anthology
  56. Gertjan van Noord, Efficient Approximation to a Word Graph Search Algorithm. Priority Programme Language and Speech Technology, Technical Report 78.
  57. Gertjan van Noord, Techniques for Fast and Accurate Reduction of Word-graph Size. Priority Programme Language and Speech Technology, Technical Report 77.
  58. Gertjan van Noord, Hdrug Reference Manual. [html, home page ]
  59. Gertjan van Noord, Gosse Bouma, Rob Koeling, Mark-Jan Nederhof. Robust Grammatical Analysis for Spoken Dialogue Systems. Natural Language Engineering, 5(1), 1999, pages 45--93 (written 1997). [ postscript, html, pdf cmp-lg ]

    1998

  60. Gertjan van Noord. Fsa Utilities User Manual Version 5. See above for more recent version.
  61. Remko Bonnema, Gertjan van Noord, Gert Veldhuizen van Zanten, Evaluation Results NLP Components OVIS2. Priority Programme Language and Speech Technology, Technical Report 57. See above for newer version.
  62. Gertjan van Noord, Treatment of Epsilon-moves in Subset Construction. Appears in FSMNLP98. An improved version has been published in Computational Linguistics. See above.
  63. Gosse Bouma, Gertjan van Noord, Word Order Constraints on Verb Clusters in German and Dutch. In: Erhard Hinrichs, Tsuneko Nakazawa, Andreas Kathol (editor) Complex predicates in Nonderivational Syntax. Academic Press. [postscript, html, pdf ] (completed in 1996).
  64. Gertjan van Noord, Guenter Neumann, Syntactic Generation. [html, postscript pdf]. This is (a local copy of) chapter 4 of the Survey of the State of the Art in Human Language Technology edited by Ronald A. Cole, Joseph Mariani, Hans Uszkoreit, Annie Zaenen and Victor Zue. This report is sponsored by the National Science Foundation, Directorate XIII-E of the Commission of the European Communities, and the Center for Spoken Language Understanding, Oregon Graduate Institute. (completed in 1995).

    1997

  65. Gertjan van Noord, Prolog(Elex): a New Tool to Generate Prolog Tokenizers. Draft. [ postscript, html ]
  66. Gosse Bouma and Gertjan van Noord, Natuurlijke-taal Interfaces. In: Petra Hendriks, Niels Taatgen, Tjeerd Andringa, Breinmakers en Breinbrekers; Inleiding Cognitiewetenschap. Longman 1997. [ html ]
  67. Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord, Priority Programme Language and Speech Technology, Research Plan 1997-2000. [ html ]
  68. Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord, Priority Programme Language and Speech Technology, Progress Report 1995-1996. [ html ]
  69. Mark-Jan Nederhof, Gosse Bouma, Rob Koeling, Gertjan van Noord, Grammatical analysis in the OVIS spoken-dialogue system. Proceedings of the ACL/EACL Workshop on Spoken Dialog Systems, Madrid, Spain - July 11-12, 1997. [ postscript, dvi, html, comp-lg ] ACL Anthology
  70. Gertjan van Noord, Gosse Bouma. HDRUG. A Flexible and Extendible Development Environment for Natural Language Processing. Proceedings of the EACL/ACL Workshop ENVGRAM, Computational Environments for Grammar Development and Linguistic Engineering. [ postscript, html, the software ] ACL Anthology
  71. Gertjan van Noord. An Efficient Implementation of the Head-Corner Parser. Computational Linguistics, volume 23, number 3, 1997. [html, postscript, dvi, cmp-lg, the software ].
  72. Gertjan van Noord. FSA Utilities: A Toolbox to Manipulate Finite-state Automata. In: Darrell Raymond, Derick Wood and Sheng Yu (eds), Automata Implementation. Lecture Notes in Computer Science 1260, Springer Verlag. [official version from Springer, postscript, draft dvi, html, the software ]. (completed in 1996).

    1996

  73. Gertjan van Noord and Gosse Bouma, Dutch Verb Clustering without Verb Clusters. In: Patrick Blackburn, Maarten de Rijke (editors), Specifying Syntactic Structures. [postscript, dvi, html, ordering information ] (completed in 1995).
  74. Gertjan van Noord, Gosse Bouma, Rob Koeling, Mark-Jan Nederhof, Conventional Natural Language Processing in OVIS2: October 1996 Deliverables. NWO Priority Programme Language and Speech Technology - technical report: 28. [postscript]
  75. Gosse Bouma and Gertjan van Noord. Word Order Constraints on German Verb Clusters. Conference on Formal Grammar. Prague 1996. [postscript, dvi, html ]
  76. Gertjan van Noord. Robust Parsing with the Head-corner Parser. Robust Parsing Workshop (during ESSLLI 1996), 1996 Prague. [postscript, dvi, html ]
  77. Gertjan van Noord, Mark-Jan Nederhof, Rob Koeling, Gosse Bouma, Conventional Natural Language Processing in the Priority Programme on Language and Speech Technology: January 1996 Deliverables. NWO Priority Programme Language and Speech Technology - technical report: 22. [postscript].
  78. Gosse Bouma, Rob Koeling, Mark-Jan Nederhof, Gertjan van Noord, Grammatical Analysis in a Spoken Dialog System. Appears in CLCG Yearbook 1996. Also known as technical report 21 of the NWO Priority Programme Language and Speech Technology. [postscript, html ]

    1995

  79. Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord, Language and Speech Technology. Project plan for the NWO Priority Programme `Language and Speech Technology'.
  80. Gertjan van Noord and Gosse Bouma, Delayed Evaluation of Lexical Rules. 3 page abstract for the Aquilex Workshop on Lexical Rules. [ html, postscript, dvi ]
  81. Gertjan van Noord, The Intersection of Finite State Automata and Definite Clause Grammars. ACL 1995 Boston. [postscript, dvi, html, cmp-lg ] ACL Anthology

    1994

  82. On the Intersection of Finite State Automata and Definite Clause Grammars. Paper for TWLT 8, Twente. [html, postscript, dvi ]
  83. Head-corner parsing for TAG. In: Computational Intelligence, volume 10 number 4, page 525 - 534. [html, postscript, dvi ]
  84. Head-corner Parsing. In: C.J. Rupp, M.A. Rosner, R.L. Johnson (editors), Constraints, Language and Computation. Academic Press 1994. [postscript, dvi, html ] (completed in 1991). Some people think this is really weird.
  85. Gosse Bouma and Gertjan van Noord, A lexicalist account of the Dutch verb cluster. In: Gosse Bouma and Gertjan van Noord (editor), Papers from the Fourth Clin Meeting. [postscript, dvi, html]
  86. Gosse Bouma and Gertjan van Noord, Constraint-based Categorial Grammar. In: Proceedings of the ACL, New Mexico 1994. [html, postscript, dvi, cmp-lg ] ACL Anthology
  87. Gertjan van Noord and Gosse Bouma, The Scope of Adjuncts and the Processing of Lexical Rules In: Proceedings of Coling, 1994, Kyoto Japan. [html, postscript, dvi, cmp-lg ] ACL Anthology
  88. Gosse Bouma and Gertjan van Noord (editors), CLIN IV Papers from the fourth CLIN meeting.
  89. HDRUG: A Graphical User Environment for Natural Language Processing in Prolog. See above for more recent version.

    1993

  90. Reversibility in Natural Language Processing. Dissertation University of Utrecht 1993. [html, Printed version, postscript (700K), dvi ]
  91. Gosse Bouma and Gertjan van Noord, Head-driven Parsing for Lexicalist Grammars: Experimental Results. In: Proceedings of the EACL, 1993, Utrecht. [html, postscript, dvi ] ACL Anthology
  92. Gunter Neumann and Gertjan van Noord, Reversibility and Self-Monitoring in Natural Language Generation. In: Strzalkovski (editor) Reversible Grammars in NLP, 1993. [html, postscript, dvi ]

    1992

  93. Guenter Neumann and Gertjan van Noord, Self-monitoring with Reversible Grammars. In: Proceedings of Coling, 1992, Nantes. [html, postscript, dvi ] ACL Anthology

    1991

  94. Head-corner parsing for discontinuous constituency. (in: ACL 1991 Berkeley). ACL Anthology
  95. Towards Uniform Processing of Constraint-based Categorial Grammars. In: ACL workshop Reversible Grammar in Natural Language Processing. Berkely 1991. ACL Anthology
  96. Gertjan van Noord, Joke Dorrepaal, Pim van der Eijk, Maria Florenza, Herbert Ruessink, and Louis des Tombe An overview of MiMo2. In: Machine Translation 1991. The postscript version is here. The DVI version is here.
  97. Gertjan van Noord, Morphology in MiMo2. [dvi, postscript].

    1990

  98. An Overview of Head driven Bottom-up Generation. In: Robert Dale, Chris Mellish, Michael Zock (eds) Current Research in Natural Language Generation. 1990. The postscript version is here. The DVI version is here.
  99. Herbert Ruessink, Gertjan van Noord, Remarks on the Bottom-up Generation Algorithm. [dvi, postscript].
  100. Gertjan van Noord, Joke Dorrepaal, Pim van der Eijk, Maria Florenza, and Louis des Tombe The MiMo2 Research System. (In: MT conference 1990 Austin Texas). The postscript version is here. The DVI version is here.
  101. Stuart Shieber, Gertjan van Noord, Fernando Pereira and Robert Moore. Semantic-head-driven Generation. In: Computational Linguistics 1990. [postscript]. ACL Anthology
  102. Reversible Unification Based Machine Translation. (in: Coling 1990, Helsinki). The postscript version is here. The DVI version is here. ACL Anthology
  103. Joke Dorrepaal and Gertjan van Noord, Anaforische Relaties in het automatisch vertaalsysteem MiMo. In Anneke Neijt and Dik Bakker (editors), Computerlinguistiek, een overzicht in artikelen. 1990. Not available in electronic form.

    1989

  104. Stuart Shieber, Gertjan van Noord, Fernando Pereira and Robert Moore A Semantic-head-driven Generation Algorithm for Unification-based Formalisms. (In: ACL 1989, Vancouver). The postscript version is here. The DVI version is here. ACL Anthology
  105. Gertjan van Noord, Joke Dorrepaal, Doug Arnold, Steven Krauwer, Louisa Sadler and Louis des Tombe, An approach to sentence-level anaphora in Machine Translation. (in: EACL 1989 Manchester). The postscript version is here. The DVI version is here. ACL Anthology
  106. BUG: A Directed Bottom Up Generator for Unification Based Formalisms. Utrecht/Leuven working papers in Natural Language Processing 1989. The postscript version is here. The DVI version is here.