Publications by Gertjan van Noord
2024
- Yuqing Zhang, Tessa Verhoef, Gertjan van Noord, Arianna Bisazza.
Endowing Neural Language Learners with Human-like Biases: A Case Study on Dependency Length Minimization.
Accepted for COLING 2024.
- Yuqing Zhang, Tessa Verhoef, Yuchen Lian, Gertjan van Noord, Arianna Bisazza.
Simulating Dependency Length Minimization using neural-network based learning and communication.
Accepted for Fifteenth International Conference on the Evolution of Language (EVOLANG), 2024.
- Gosse Bouma, Gertjan van Noord.
Van wie heeft u een foto op uw boksbal hangen? Extraction from NPs in Dutch.
In: Festschrift for Jack Hoeksema. Special Issue of TABU. University of Groningen Press.
https://ugp.rug.nl/TABU
- Lukas Edman, Gabriele Sarti, Antonio Toral, Gertjan van Noord, Arianna Bisazza.
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation.
arXiv:2302.14220. This version has been accepted for Transactions of the ACL (TACL), and
has been presented at EACL 2024 (Malta).
2023
- Andreas van Cranenburgh, Gertjan van Noord. OpenBoek:
A Corpus of Literary Coreference and Entities with an Exploration of Historical Spelling Normalization.
In: CLIN Journal Volume 12.
2022
- Lukas Edman, Antonio Toral, Gertjan van Noord. Subword-Delimited
Downsampling for Better Character-Level Translation. In: EMNLP Findings 2022.
- Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord,
Sebastian Ruder. Hyper-X: A Unified Hypernetwork for Multi-Task
Multilingual Transfer. arXiv:2205.12148 [cs.CL]. In: EMNLP
2022.
- Lukas Edman, Antonio Toral, Gertjan van Noord. Patching Leaks in
the Charformer for Efficient Character-Level Generation.
arXiv:2205.014086 [cs.CL].
- Prajit Dhar, Arianna Bisazza and Gertjan van Noord. Evaluating
Pre-training Objectives for Low-Resource Translation into
Morphologically Rich Languages. In: Proceedings of LREC 2022.
- Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord.
UDapter: Typology-based Language Adapters for Multilingual Dependency
Parsing and Sequence Labeling. In: Computational Linguistics, volume
48, issue 3. September 2022. pp 555-592.
- Jack Hoeksema, Kees de Glopper, Gertjan van Noord. Syntactic
Profiles in Secondary School Writing Using PaQu and SPOD. In:
CLARIN: The Infrastructure for Language Resources. Fišer, D. & Witt, A. (eds.).
De Gruyter, p. 691-707 17 p. (Digital Linguistics; vol. 1).
2021
- Lukas Edman, Antonio Toral, Gertjan van Noord.
The Importance of Context in Very Low Resource Language Modeling.
In: ICON 2021. Also available via arXiv:2205.04810 [cs.CL].
- Lukas Edman, Ahmet Üstün, Antonio Toral, Gertjan van
Noord. Unsupervised Translation of German--Lower Sorbian: Exploring
Training and Novel Transfer Methods on a Low-Resource Language. In:
WMT 2021.
- Prajit Dhar, Arianna Bisazza, Gertjan van Noord. Optimal Word
Segmentation for Neural Machine Translation into Dravidian
Languages. Proceedings of the 8th Workshop on Asian Translation,
pages 181--190 Bangkok, Thailand (online), August 5-6, 2021.
ACL Anthology.
2020
- Lukas Edman, Antonio Toral, Gertjan van Noord. Data Selection for
Unsupervised Translation of German-Upper Sorbian. In: WMT 2020.
ACL Anthology.
- Prajit Dhar, Arianna Bisazza, Gertjan van Noord. Linguistically Motivated Subwords Improve English-Tamil Transliation: University of Groningen's Submission to WMT-2020. In: WMT 2020.
ACL Anthology.
- Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord. Is
Typology-Based Adaptation Effective for Multilingual Sequence
Labelling? In: EMNLP Workshop SIGTYPE 2020.
pdf.
- Gertjan van Noord, Jack Hoeksema, Peter Kleiweg, Gosse Bouma. SPOD: Syntactic Profiler of Dutch. In: CLIN Journal, volume 10.
https://www.clinjournal.org/clinj.
- Peter Kleiweg, Gertjan van Noord. AlpinoGraph: A Graph-based
Search Engine for Flexible and Efficient Treebank Search. In:
TLT 2020.
ACL Anthology.
- Ahmet Üstün, Arianna Bisazza, Gosse Bouma, Gertjan van Noord.
UDapter: Language Adaptation for Truly Universal Dependency Parsing.
arXiv:2004.14327 [cs.CL]. [pdf]
In: EMNLP 2020.
ACL Anthology.
- Lukas Edman, Antonio Toral, Gertjan van Noord. Low-Resource
Unsupervised NMT: Diagnosing the Problem and Providing a
Linguistically Motivated Solution. In: Proceedings EAMT 2020.
[proceedings]
ACL Anthology.
- Antonio Branco, Nicoletta Calzolari, Piek Vossen, Gertjan van
Noord, Dieter Van Uytvank, Jõao Silva, Lúıs Gomes, Andŕe Moreira,
Willem Elbers. A Shared Task of a New, Collaborative Type to
foster Reproducibility: A first exercise in the area of language
science and technology with REPROLANG2020. In: LREC 2020.
[website with pdf].
ACL Anthology.
2019
- Wietse de Vries, Andreas van Cranenburgh, Arianna Bisazza, Tommaso Caselli, Gertjan van Noord, Malvina Nissim. BERTje: A Dutch BERT Model. arXiv:1912.09582
- Ahmet Üstün, Gosse Bouma and Gertjan van Noord. Cross-lingual Word Embeddings for Morphologically Rich Languages. 1222-1228. Paper presented at Recent Advances in Natural Language Processing 2019, Varna, Bulgaria.
ACL Anthology.
- Gertjan van Noord, Alwin van Lubeck & Willem-Jan Fontijn, Amerikaanse Tafeleend bij Zuidhorn in januari-maart 2016. In: Dutch Birding. 41 (5). pp 331--336.
- Ahmet Üstün, Rob van der Goot, Gosse Bouma and Gertjan van Noord.
Multi-Team: A Multi-attention, Multi-decoder Approach to Morphological Analysis.
In: SIGMORPHON 16, Florence, August 2, 2019.
ACL Anthology.
2018
- Martijn Wieling, Josine Rawee and Gertjan van Noord. Reproducibility in computational linguistics: are we willing to share?
In: Computational Linguistics. [pdf]
ACL Anthology.
- Rob van der Goot, Rik van Noord and Gertjan van Noord. A
Taxonomy for In-depth Evaluation of Normalization for User Generated
Content. In: LREC 2018.
ACL Anthology.
- Rob van der Goot and Gertjan van Noord. Modeling Input Uncertainty
in Neural Network Dependency Parsing. In: EMNLP, Brussel.
ACL Anthology.
- Gertjan van Noord. Nieuw voor Groningen en Nederland: Amerikaanse Tafeleend (Aythya americana).
In: De Grauwe Gors. Jaargang 45, 2018. pp 72-73.
- Dieke Oele, Gertjan van Noord. Simple Embedding-Based Word Sense Disambiguation. In: Proceedings of the 9th Global Wordnet Conference.
ACL Anthology.
2017
- Rob van der Goot and Gertjan van Noord. Parser Adaptation for Social
Media by Integrating Normalization. In: ACL 2017. Vancouver.
ACL Anthology.
- Gosse Bouma and Gertjan van Noord. Increasing return on annotation investment: the
automatic construction of a Universal Dependency treebank for Dutch. In: Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017). May 2017, Gothenburg Sweden. [pdf]
- Dieke Oele and Gertjan Van Noord. Distributional Lesk: Effective Knowledge-Based Word Sense Disambiguation.
In: IWCS 2017. Montpellier.
ACL Anthology.
- Martijn Wieling, Martin Kroon, Gertjan van Noord, Gosse Bouma
(editors), From Semantics to Dialectometry. Festschrift in honor of
John Nerbonne. College Publications. http://www.let.rug.nl/vannoord/30years/festschrift/
- Gertjan van Noord, How to compare speed and accuracy of syntactic
parsers. In: Hilke Reckman, Lisa L.S. Cheng, Maarten Hijzelendoorn,
Rint Sybesma (editors), Crossroads Semantics. Computation,
experiment and grammar. John Bejamins Publishing Company.
- Daniël de Kok and Gertjan van Noord. Mining for Parsing
Failures. In: Martijn Wieling, Martin Kroon, Gertjan van Noord,
Gosse Bouma (editors), From Semantics to Dialectometry. Festschrift
in honor of John Nerbonne. College Publications. http://www.let.rug.nl/vannoord/30years/festschrift/
- Rob van der Goot and Gertjan van Noord. MoNoise: Modeling Noise Using a Modular Normalization System.
In: CLIN Journal, Volume 7, pp 129-144. [pdf]
- Jan Odijk, Gertjan van Noord, Peter Kleiweg and Erik Tjong Kim
Sang. The Parse and Query (PaQu) Application. In: Jan Odijk,
Arjan van Hessen (editors), Clarin in the low countries.
Ubiquity Press, London, 2017.
[site]
- Artur Kulmizev; Bo Blankers; Johannes Bjerva; Malvina Nissim; Gertjan van Noord; Barbara Plank; Martijn Wieling.
The Power of Character N-grams in Native Language Identification. In: Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications. EMNLP workshop. September 2017, Copenhagen. [pdf]
- Marc Kemps-Snijders, Ineke Schuurman, Walter Daelemans, Kris
Demuynck, Brecht Desplanques, Véronique Hoste, Marijn Huijbregts,
Jean-Pierre Martens, Hans Paulussen, Joris Pelemans, Martin
Reynaert, Vincent Vandeghinste, Antal Van den Bosch, Henk Van den
Heuvel, Maarten Van Gompel, Gertjan Van Noord and Patrick Wambacq.
TTNWW to the rescue: no need to know how to handle tools and
resources. In: Jan Odijk, Arjan van Hessen (editors), Clarin in the
low countries. Ubiquity Press, London, 2017.
[site]
2016
- Pierrette Bouillon, Paola Merlo, Gertjan van Noord, Mike Rosner.
Obituary: In Memoriam: Susan Armstrong. Computational Linguistics, Volume 42, Issue 2 - June 2016.
ACL Anthology.
- Simon Šuster, Ivan Titov, Gertjan van Noord. Bilingual Learning
of Multi-sense Embeddings with Discrete Autoencoders. In: NAACL
2016.
ACL Anthology.
- Dieke Oele, Gertjan van Noord. Choosing lemmas from Wordnet
synsets in Abstract Dependency Trees. In: Second Workshop on Deep
Language Processing for Quality Machine Translation. Varna,
Bulgaria. [pdf]
- António Branco, Hans Uszkoreit, Aljoscha Burchardt, Jan Hajic,
Martin Popel, Kiril Simov, Petya Osenova, Markus Egg, Eneko Agirre,
Gertjan van Noord, Filipe Barrancos and Rosa Del Gaudio. QTLeap: A
European scientific research project on machine translation by deep
language engineering approaches. [pdf]
In: META-FORUM 2016.
- Luis Gomes, Gertjan van Noord, Antonio Branco, Steven
Neale. Seeking to Reproduce "Easy Domain Adaptation". In: 4REAL
workshop at LREC 2016.
- Laura Toloşi, Valentin Zhikov, Andrej Tagarev, Kiril Simov, Petya
Osenova, Gertjan van Noord and Dieke Oele. Machine Translation for
Crosslingual Annotation Transfer. In: Second Workshop on Deep
Language Processing for Quality Machine Translation. Varna,
Bulgaria. [pdf]
- Rosa Gaudio, Gorka Labaka, Eneko Agirre, Petya Osenova, Kiril
Simov, Martin Popel, Dieke Oele, Gertjan van Noord, João Silva, João
António Rodrigues, Steven Neale, Luís Gomes, Nuno Rendeiro, Andreia
Querido and António Branco. SMT and Hybrid systems of the QTLeap
project in the WMT16 IT-task. In: WMT16. [pdf]
ACL Anthology.
2015
- Michal Novák, Dieke Oele, Gertjan van Noord. Comparison of Coreference Resolvers for Deep Syntax Translation. In:
EMNLP 2015 Workshop on Discourse in Machine Translation, September 2015, Lisbon.
- Dieke Oele and Gertjan van Noord. Lexical Choice in Abstract Dependency Trees. In: Deep Machine Translation Workshop 2015.
September 2015, Prague.
- Rob van der Goot and Gertjan van Noord. ROB: Using Semantic Meaning to Recognize Paraphrases.
In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). [pdf]
- Simon Šuster, Gertjan van Noord, Ivan Titov. Word Representations, Tree Models and Syntactic Functions. arXiv.org. arxiv.org/abs/1508.07709.
2014
- Simon Suster and Gertjan van Noord. From neighborhood to
parenthood: the advantages of dependency representation over bigrams
in Brown clustering. In: COLING 2014.
- Angelina Ivanova and Gertjan Van Noord. Treelet Probabilities for
HPSG Parsing and Error Correction. In: LREC 2014. [website
with link to pdf]
2013
- Simon Suster and Gertjan van Noord. Semantic Mapping for Lexical Sparseness Reduction in Parsing.
In: ESSLLI'13 Workshop on Extrinsic Parse Improvement. [pdf]
- Gertjan van Noord, Gosse Bouma, Frank van Eynde, Daniël de Kok,
Jelmer van der Linde, Ineke Schuurman, Erik Tjong Kim Sang, Vincent
Vandeghinste. Large Scale Syntactic Annotation of Written Dutch:
Lassy. In: Essential Speech and Language Technology for Dutch. Springer.
[website with link to pdf]
- Vincent Vandeghinste, Scott Martens, Gideon Kotze, Jorg Tiedemann,
Joachim Van den Bogaert, Koen De Smet, Frank Van Eynde, and Gertjan
van Noord. Parse and Corpus-based Machine Translation. In: Essential
Speech and Language Technology for Dutch.
Springer.
[website with link to pdf]
- Jan De Belder, Daniël de Kok, Gertjan van Noord, Fabrice Nauze,
Leonoor van der Beek, and Marie-Francine Moens. Question Answering of
Informative Web Pages: How Summarisation Technology Helps. In:
Essential Speech and Language Technology for Dutch. Springer.
[website with link to pdf]
2012
- Gertjan van Noord. Het ontleedkundig Laboratorium. Rede uitgesproken bij de aanvaarding
van het ambt van hoogleraar taaltechnologie aan de Rijksuniversiteit Groningen. [pdf]
2011
- Kostadin Cholakov, Gertjan van Noord, Valia Kordoni, Yi Zhang. An empirical comparison of Unknown Word Prediction Methods. In: IJCNLP 2011.
[pdf]
- Kostadin Cholakov, Gertjan van Noord, Valia Kordoni, Yi Zhang. Adaptability of Lexical Acquisition for Large-scale Grammars. In: RANLP 2011. [pdf]
- Barbara Plank and Gertjan van Noord. Effective Measures of Domain Similarity for Parsing. In: ACL 2011. [pdf]
- Daniël de Kok and Barbara Plank and Gertjan van Noord. Reversible Stochastic Attribute-value Grammars. In: ACL 2011.
[pdf]
2010
- Barbara Plank and Gertjan van Noord. Dutch Dependency Parser
Performance Across Domains. In: Proceedings of the 20th Meeting of
Computational Linguistics in the Netherlands.
[pdf]
- Daniël de Kok and Gertjan van Noord. A Sentence Generator for Dutch.
In: Proceedings of the 20th Meeting of Computational Linguistics
in the Netherlands.
[pdf]
- Kostadin Cholakov and Gertjan van Noord. Using Unknown Word Techniques To
Learn Known Words. In: EMNLP 2010.
[pdf]
- Barbara Plank and Gertjan van Noord. Grammar-driven versus data-driven:
Which Parsing System is More Affected by Domain Shifts? In: ACL workshop
NLP and Linguistics: Finding the Common Ground. July 16, 2010, Uppsala, Sweden.
[pdf]
- Kostadin Cholakov and Gertjan van Noord. Acquisition of Unknown
Word Paradigms for Large Scale Grammars. In: COLING 2010: Poster Volume,
pages 153-161. August 23-27, Beijing, China.
[pdf]
- Yan Zhao and Gertjan van Noord. POS Multi-tagging Based on Combined Models.
In: LREC 2010. May 2010, Malta. pages 1249-1252.
[from ELRA,
pdf]
- Valia Kordoni and Gertjan van Noord. Passives in Germanic
Languages: the case of Dutch and German. In: Groninger Arbeiten zur
Germanistischen Linguistik (GAGL). Volume 49. pp 77-96. December 2009
(appeared in 2010).
[pdf;
GAGL website].
- Gertjan van Noord. Self-trained Bilexical Preferences to Improve
Disambiguation Accuracy. In: Harry Bunt, Paola Merlo and Joakim Nivre
(editors), Trends in Parsing Technology. Dependency Parsing, Domain
Adaptation, and Deep Parsing. Springer Verlag. pp 183-200. 2010.
[draft pdf;
book
page of publisher]
2009
- Daniël de Kok, Jianqiang Ma and Gertjan van Noord, A generalized
method for iterative error mining in parsing results. In: ACL2009
Workshop Grammar Engineering Across Frameworks (GEAF), Singapore,
2009. [pdf]
- Kostadin Cholakov and Gertjan van Noord. Combining Finite State
and Corpus-based Techniques for Unknown Word Prediction. In:
RANLP 2009. [pdf]
- Gertjan van Noord and Gosse Bouma. Parsed Corpora for Linguistics.
In: Proceedings of EACL Workshop The Interaction between
Linguistics and Computational Linguistics: Virtuous, Vicious or
Vacuous? Athens, 2009. pp 33-39. [pdf]
- Gertjan van Noord, Learning Efficient Parsing. In: EACL 2009. The
12th Conference of the European Chapter of the Association for
Computational Linguistics. 30 March - 3 April 2009, Athens, Greece. pp
817-825. [pdf]
- Gertjan van Noord et al., Lassy Syntactische Annotatie. In
preparation. pdf of draft
- Gertjan van Noord. Huge Parsed Corpora in LASSY. In: Frank van
Eynde, Anette Frank, Koenraad de Smedt, Gertjan van Noord (editors),
Proceedings of the Seventh International Workshop on Treebanks and
Linguistic Theories (TLT 7). January 23-24, 2009, Groningen, The
Netherlands. LOT Occasional Series.
[LOT site]
- Frank van Eynde, Anette Frank, Koenraad de Smedt, Gertjan van
Noord (editors), Proceedings of the Seventh International Workshop on
Treebanks and Linguistic Theories (TLT 7). January 23-24, 2009,
Groningen, The Netherlands. LOT Occasional Series.
[LOT site]
2008
- Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas,
Jörg Tiedemann. Question Answering with Joost at CLEF 2008. CLEF 2008
Working Notes. Aarhus Denmark.
[pdf]
- Barbara Plank and Gertjan van Noord. Exploring An Auxiliary
Distribution based approach to Domain Adaptation of a Syntactic
Disambiguation Model. In: Coling Workhop 'Cross Framework and
Cross Domain Parser Evaluation'. [pdf]
- N. Oostdijk, M. Reynaert, P. Monachesi, G. van Noord,
R. Ordelman, I. Schuurman, V. Vandeghinste. From D-Coi to SoNaR: A
reference corpus for Dutch. In: LREC 2008. [pdf]
- Gosse Bouma, Geert Kloosterman, Jori Mur, Gertjan van Noord,
Lonneke van der Plas, and Jörg Tiedemann. Question Answering with
Joost at CLEF 2007. In: Carol Peters, Valentin Jijkoun, Thomas Mandl,
Henning Mueller, Douglas W. Oard, Anselmo Penas, Vivien Petras, Diana
Santos (editors), Advances in Multilingual and Multimodal Information
Retrieval, 8th workshop of the Cross-Language Evaluation Form, CLEF
2007, Budapest, Hungary, September 19-21, 2007, Revised Selected
Papers. Lecture Notes in Computer Science 5152, Springer 2008. pp 257-260.
2007
- Gosse Bouma, Geert Kloosterman, Jori Mur, Gertjan van Noord,
Lonneke van der Plas, and Jörg Tiedemann. Question Answering with
Joost at CLEF 2007, CLEF 2007 Working Notes. [pdf]
- Gertjan van Noord. Using Self-Trained Bilexical Preferences to Improve
Disambiguation Accuracy. In: Proceedings of the Tenth International
Conference on Parsing Technologies. IWPT 2007, Prague. Pages 1--10.
[pdf]
- Timothy Baldwin, Mark Dras, Julia Hockenmaier, Tracy Holloway
King, Gertjan van Noord. The Impact of Deep Linguistic Processing on
Parsing Technology. In: Proceedings of the Tenth International
Conference on Parsing Technologies. IWPT 2007, Prague. Pages 36--38. [pdf from ACL Anthology]
-
Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der Plas,
Jörg Tiedemann. Using Syntactic Knowledge for QA. In: C. Peters,
P. Clough, F. Gey, J. Karlgren, B. Magnini, D.W. Oard, M. de Rijke,
M. Stempfhuber (editors) Evaluation of
Multilingual and Multi-modal Information Retrieval. Lecture Notes in
Computer Science 4730/2007. Springer. pp 318--327.
[concept pdf]
- Martijn Wieling, Mark-Jan Nederhof, Gertjan van Noord. Parsing Partially
Bracketed Input. In: CLIN 2005. Proceedings of the 16th Meeting of
Computational Linguistics in the Netherlands. Pages 1--16.
[pdf]
- Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der
Plas, Jörg Tiedemann. Linguistic Knowledge and Question Answering.
In: Traitement Automatique des Langues 46 (3) 2005. Pages
15--39. Appeared in 2007. [concept pdf]
2006
- Gosse Bouma, Ismail Fahmi, Jori Mur, Gertjan van Noord, Lonneke van der
Plas, Jörg Tiedemann. The University of Groningen at QA@CLEF2006. Using
Syntactic Knowledge for QA.
[pdf,
website]
- Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann.
Question Answering for Dutch using Dependency Relations. In:
C. Peters, F. Gey, J. Gonzalo, H. Mueller, G. Jones, M. Kluck,
B. Magnini, M. de Rijke (editors), Accessing Multilingual Information
Repositories. Lecture Notes in Computer Science
Vol. 4022/2006. Springer. Pages 370-379.
[pdf]
- Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste, Syntactic
Annotation of Large Corpora in STEVIN. In: LREC 2006.
[pdf]
- Gertjan van Noord. At Last Parsing Is
Now Operational. In: Piet Mertens, Cedrick Fairon, Anne Dister,
Patrick Watrin (editors): TALN06. Verbum Ex Machina. Actes de la 13e
conference sur le traitement automatique des langues naturelles. Page 20--42.
[pdf]
2005
- Gertjan van Noord and Valia Kordoni. A Raising Analysis of the Dutch
Passive. In: Proceedings of HPSG2005. [website,pdf]
- Gosse Bouma, Jori Mur, Gertjan van Noord, Lonneke van der Plas, and Jörg Tiedemann.
Question Answering for Dutch using Dependency Relations. In: CLEF2005
workshop. [pdf]
- Gosse Bouma, Jori Mur, Gertjan van Noord. Reasoning over
Dependency Relations for QA. In: KRAQ05. [pdf]
- Gertjan van Noord, Robert Malouf. Wide Coverage Parsing with
Stochastic Attribute Value Grammars. Draft. [pdf]
- Frederik Fouvry, Valia Kordoni, Gertjan van Noord. Object-to-Subject
Raising: An Analysis of the Dutch Passive. In: HPSG05. See above for a more
recent version, in the HPSG2005 proceedings.
2004
- Gertjan van Noord. Error Mining for Wide-Coverage Grammar
Engineering. In: ACL 2004, Barcelona.
[pdf,
ps] ACL Anthology
- Robert Malouf, Gertjan van Noord. Wide Coverage Parsing with
Stochastic Attribute Value Grammars. In: IJCNLP-04 Workshop Beyond
Shallow Analyses - Formalisms and statistical modeling for deep
analyses. [pdf,web page]
- Jan Daciuk, Gertjan van Noord. Finite Automata for Compact
Representation of Tuple Dictionaries.
Theoretical Computer Science. Volume 313, Issue 1,
Pages 45--56.
[postscript,
pdf]
- Robbert Prins and Gertjan van Noord. Reinforcing Parser
Preferences through Tagging. In special issue on
Evolutions in Parsing of the journal Traitement Automatique
des Langues volume 44(3) 2003, pages 121-139.
[postscript,
pdf]
2003
- Gertjan van Noord, Finite State Processing. In: Lynn Nadel
(editor-in-chief), Encyclopedia of Cognitive Science.
pp 130--134. (Originally: Nature Publishing Group; Now: Wiley).
[homepage]
- Lauri Karttunen, Kimmo Koskenniemi, Gertjan van Noord. Special
issue: Finite State Methods in Natural Language Processing. Natural
Language Engineering. Volume 9, Part 1, March 2003.
[postscript,
pdf]
2002
- Leonoor van der Beek, Gosse Bouma, Jan Daciuk, Tanja Gaustad,
Robert Malouf, Gertjan van Noord, Robbert Prins, Begona
Villada. Algorithms for Linguistic Processing. NWO PIONIER
Progress Report.
[postscript,
pdf]
- Leonoor van der Beek, Gosse Bouma, Robert Malouf, Gertjan van
Noord. The Alpino Dependency Treebank. In: Mariët Theune,
Anton Nijholt, Hendri Hondorp (editors). Computational
Linguistics in the Netherlands CLIN 2001. Selected papers from
the Twelfth CLIN Meeting. Rodopi 2002.
[postscript,
pdf,
html,
ordering info]
- Leonoor van der Beek, Gosse Bouma, Gertjan van Noord. Een brede
computationele grammatica voor het
Nederlands. Nederlandse Taalkunde, jaargang 7,
2002-4. [in Dutch]. 353--374.
[postscript,
pdf]
- Jan Daciuk and Gertjan van Noord, Finite Automata for Compact
Representation of Language Models in NLP. In: Bruce Watson, Derick
Wood (eds), Implementation and Application of Automata. Lecture Notes
in Computer Science. 65--73.
2001
- Lauri Karttunen, Kimmo Koskenniemi, Gertjan van Noord (editors),
Finite State Methods in Natural Language Processing. FSMNLP 2001.
Extended Abstracts. ESSLLI Workshop, Helsinki 2001.
[homepage with links to on-line versions]
- Tony Mullen, Robert Malouf and Gertjan van Noord, Statistical
Parsing of Dutch using Maximum Entropy Models with Feature
Merging. NLPRS01, Tokyo.
[postscript,
pdf,
html
]
- Robbert Prins, Gertjan van Noord, Unsupervised POS-tagging
Improves Parsing Accuracy and Parsing Efficiency. IWPT,
Beijing. See above for the article entitled Reinforcing
Parser Preferences through Tagging
- Jan Daciuk, Gertjan van Noord, Finite Automata for Compact
Representation of Language Models in NLP. CIAA 2001,
Pretoria. See above for the article entitled Finite
Automata for Compact Representation of Tuple Dictionaries.
- Gosse Bouma, Gertjan van Noord, Robert Malouf. Alpino: Wide
Coverage Computational Analysis of Dutch. In: Computational
Linguistics in the Netherlands CLIN 2000.
[postscript,
pdf,
html,
ordering info
]
- Gertjan van Noord and Dale Gerdemann. Finite State Transducers
with Predicates and Identity. Grammars 4 (3).
[postscript,
pdf,
html
]
- Jean-Claude Junqua and Gertjan van Noord (editors), Robustness in
Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1
[home page]
- Jean-Claude Junqua and Gertjan van Noord, Introduction. In:
Jean-Claude Junqua and Gertjan van Noord (editors), Robustness in
Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1
[home page]
- Gertjan van Noord, Robust Parsing of Word Graphs. In: Jean-Claude
Junqua and Gertjan van Noord (editors), Robustness in
Language and Speech Technology. Kluwer. ISBN 0-7923-6790-1
[home page]
- Gertjan van Noord, Dale Gerdemann, An Extendible Regular Expression
Compiler for Finite-state Approaches in Natural Language Processing.
In: O.Boldt, H.Juergensen (eds), Automata Implementation. 4th
International Workshop on Implementing Automata, WIA '99,
Potsdam Germany, July 1999, Revised Papers. Springer
Lecture Notes in Computer Science 2214, 2000.
[postscript,
pdf,
html,
order
]
2000
- Dale Gerdemann, Gertjan van Noord. Approximation and Exactness in
Finite State Optimality Theory. In: Jason Eisner, Lauri
Karttunen, Alain Thériault (editors), SIGPHON 2000, Finite
State Phonology. Proceedings of the Fifth Workshop of the ACL
Special Interest Group in Computational Phonology. August
2000, Luxembourg.
[postscript,
pdf,
html,
cmp-lg full proceedings,
cmp-lg
]
- Gertjan van Noord, Treatment of Epsilon Moves in Subset
Construction. Computational Linguistics, 26 (1).
[postscript,
pdf,
html
] ACL Anthology
- Gertjan van Noord, Grammar-based Natural Language Understanding.
Priority Programme Language and Speech Technology,
Technical Report 121.
[postscript,
pdf]
- Gertjan van Noord. FSA6 Reference Manual
[html,
home page]
1999
- Gertjan van Noord, Algorithms for Linguistic Processing. PIONIER
project proposal (accepted by NWO in spring 1999).
[html,
postscript,
pdf
]
- Gert Veldhuijzen van Zanten, Gosse Bouma, Khalil Sima'an, Gertjan van
Noord, Remko Bonnema. Evaluation of the NLP Components of the OVIS2
Spoken Dialogue System. In: van Eynde, Schuurman and Schelkens (eds),
Computational Linguistics in the Netherlands 1998, Rodopi Amsterdam,
1999, pages 213--229.
[postscript,
pdf,
html,
cmp-lg]
- Gertjan van Noord, Dale Gerdemann, An Extendible Regular Expression
Compiler for Finite-state Approaches in Natural Language Processing.
WIA 99, Potsdam Germany. See above for later version.
- Dale Gerdemann, Gertjan van Noord, Transducers from Rewrite Rules
with Backreferences. EACL 99, Bergen Norway.
[html,
postscript,
pdf,
cmp-lg] ACL Anthology
- Gertjan van Noord, Efficient Approximation to a Word Graph Search
Algorithm. Priority Programme Language and Speech Technology,
Technical Report 78.
- Gertjan van Noord, Techniques for Fast and Accurate Reduction of
Word-graph Size. Priority Programme Language and Speech Technology,
Technical Report 77.
- Gertjan van Noord, Hdrug Reference Manual.
[html,
home page
]
- Gertjan van Noord, Gosse Bouma, Rob Koeling, Mark-Jan
Nederhof. Robust Grammatical Analysis for Spoken Dialogue
Systems. Natural Language Engineering, 5(1), 1999,
pages 45--93 (written 1997).
[ postscript,
html,
pdf
cmp-lg
]
1998
- Gertjan van Noord. Fsa Utilities User Manual Version 5. See above
for more recent version.
- Remko Bonnema, Gertjan van Noord, Gert Veldhuizen van Zanten,
Evaluation Results NLP Components OVIS2. Priority Programme
Language and Speech Technology, Technical Report 57. See above
for newer version.
- Gertjan van Noord, Treatment of Epsilon-moves in Subset
Construction. Appears in FSMNLP98. An
improved version has been published in Computational
Linguistics. See above.
- Gosse Bouma, Gertjan van Noord, Word Order Constraints on
Verb Clusters in German and Dutch.
In: Erhard Hinrichs, Tsuneko Nakazawa,
Andreas Kathol (editor) Complex predicates in Nonderivational Syntax.
Academic Press.
[postscript,
html,
pdf
] (completed in 1996).
- Gertjan van Noord, Guenter Neumann, Syntactic
Generation.
[html,
postscript
pdf].
This is (a local copy of)
chapter 4 of the
Survey of the State of the Art in Human Language Technology
edited by Ronald A. Cole, Joseph Mariani, Hans Uszkoreit, Annie Zaenen
and Victor Zue. This report is sponsored by the National Science Foundation,
Directorate XIII-E of the Commission of the European Communities, and
the Center for Spoken Language Understanding, Oregon Graduate
Institute. (completed in 1995).
1997
- Gertjan van Noord, Prolog(Elex): a New Tool to Generate Prolog
Tokenizers. Draft.
[ postscript,
html
]
- Gosse Bouma and Gertjan van Noord, Natuurlijke-taal
Interfaces. In: Petra Hendriks, Niels Taatgen, Tjeerd Andringa,
Breinmakers en Breinbrekers; Inleiding Cognitiewetenschap.
Longman 1997.
[ html
]
- Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord,
Priority Programme Language and Speech Technology, Research Plan
1997-2000.
[ html
]
- Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord,
Priority Programme Language and Speech Technology, Progress Report
1995-1996.
[ html
]
- Mark-Jan Nederhof, Gosse Bouma, Rob Koeling, Gertjan van Noord,
Grammatical analysis in the OVIS spoken-dialogue system.
Proceedings of the ACL/EACL Workshop on Spoken Dialog Systems,
Madrid, Spain - July 11-12, 1997.
[ postscript,
dvi,
html,
comp-lg
] ACL Anthology
- Gertjan van Noord, Gosse Bouma. HDRUG. A Flexible and Extendible
Development Environment for Natural Language Processing.
Proceedings of the EACL/ACL Workshop ENVGRAM,
Computational Environments for Grammar
Development and Linguistic Engineering.
[ postscript,
html,
the software
] ACL Anthology
- Gertjan van Noord. An Efficient Implementation of the Head-Corner
Parser. Computational Linguistics, volume 23, number 3, 1997.
[html,
postscript,
dvi,
cmp-lg,
the software
].
- Gertjan van Noord. FSA Utilities: A Toolbox to Manipulate
Finite-state Automata. In: Darrell Raymond, Derick Wood and Sheng Yu
(eds), Automata Implementation. Lecture Notes in Computer
Science 1260, Springer Verlag.
[official version from Springer,
postscript,
draft dvi,
html,
the software
]. (completed in 1996).
1996
- Gertjan van Noord and Gosse Bouma, Dutch Verb Clustering without
Verb Clusters. In: Patrick Blackburn, Maarten de Rijke
(editors), Specifying Syntactic Structures.
[postscript,
dvi,
html,
ordering information
] (completed in 1995).
- Gertjan van Noord, Gosse Bouma, Rob Koeling, Mark-Jan Nederhof,
Conventional Natural Language Processing in OVIS2: October 1996 Deliverables.
NWO Priority Programme Language and Speech Technology -
technical report: 28.
[postscript]
- Gosse Bouma and Gertjan van Noord. Word Order Constraints on
German Verb Clusters. Conference on Formal Grammar. Prague 1996.
[postscript,
dvi,
html
]
- Gertjan van Noord. Robust Parsing with the Head-corner Parser.
Robust Parsing Workshop (during ESSLLI 1996), 1996 Prague.
[postscript,
dvi,
html
]
- Gertjan van Noord, Mark-Jan Nederhof, Rob Koeling, Gosse Bouma,
Conventional Natural Language Processing in the Priority Programme on
Language and Speech Technology: January 1996 Deliverables.
NWO Priority Programme Language and Speech Technology -
technical report: 22.
[postscript].
- Gosse Bouma, Rob Koeling, Mark-Jan Nederhof, Gertjan van Noord,
Grammatical Analysis in a Spoken Dialog System. Appears in CLCG
Yearbook 1996. Also known as technical report 21 of the NWO Priority
Programme Language and Speech Technology. [postscript, html ]
1995
- Lou Boves, Jan Landsbergen, Remko Scha and Gertjan van Noord, Language and Speech
Technology. Project plan for the
NWO Priority Programme `Language and Speech Technology'.
- Gertjan van Noord and Gosse Bouma, Delayed Evaluation of Lexical
Rules. 3 page abstract for the Aquilex Workshop on Lexical Rules.
[ html,
postscript,
dvi
]
- Gertjan van Noord, The
Intersection of Finite State Automata and Definite Clause
Grammars. ACL 1995 Boston.
[postscript,
dvi,
html,
cmp-lg
] ACL Anthology
1994
- On the Intersection of Finite
State Automata and Definite Clause Grammars. Paper for TWLT 8, Twente.
[html,
postscript,
dvi
]
- Head-corner parsing for TAG. In:
Computational Intelligence, volume 10 number 4, page 525 - 534.
[html,
postscript,
dvi
]
- Head-corner Parsing. In: C.J. Rupp, M.A. Rosner, R.L. Johnson (editors),
Constraints, Language and Computation. Academic Press 1994.
[postscript,
dvi,
html
] (completed in 1991).
- Gosse Bouma and Gertjan van Noord, A lexicalist
account of the Dutch verb cluster. In: Gosse Bouma and Gertjan
van Noord (editor),
Papers from the Fourth Clin Meeting.
[postscript,
dvi,
html]
-
Gosse Bouma and Gertjan van Noord,
Constraint-based Categorial Grammar.
In: Proceedings of the ACL, New Mexico 1994.
[html,
postscript,
dvi,
cmp-lg
] ACL Anthology
-
Gertjan van Noord and Gosse Bouma,
The Scope of Adjuncts and the Processing of Lexical Rules
In: Proceedings of Coling, 1994, Kyoto Japan.
[html,
postscript,
dvi,
cmp-lg
] ACL Anthology
-
Gosse Bouma and Gertjan van Noord (editors), CLIN IV
Papers from the fourth CLIN meeting.
- HDRUG: A Graphical User Environment
for Natural Language Processing in Prolog. See above for more recent
version.
1993
-
Reversibility in Natural Language Processing. Dissertation University
of Utrecht 1993.
[html,
Printed version,
postscript (700K),
dvi
]
- Gosse Bouma and Gertjan van Noord,
Head-driven Parsing for Lexicalist Grammars: Experimental Results. In:
Proceedings of the EACL, 1993, Utrecht.
[html,
postscript,
dvi
] ACL Anthology
- Gunter Neumann and Gertjan van Noord,
Reversibility and Self-Monitoring in Natural Language Generation.
In: Strzalkovski (editor) Reversible Grammars in NLP, 1993.
[html,
postscript,
dvi
]
1992
- Guenter Neumann and Gertjan van Noord,
Self-monitoring with
Reversible Grammars. In: Proceedings of Coling, 1992, Nantes.
[html,
postscript,
dvi
] ACL Anthology
1991
- Head-corner parsing for discontinuous constituency. (in: ACL 1991
Berkeley). ACL Anthology
- Towards Uniform Processing of Constraint-based Categorial Grammars. In:
ACL workshop Reversible Grammar in Natural Language Processing. Berkely
1991. ACL Anthology
-
Gertjan van Noord, Joke Dorrepaal, Pim van der Eijk, Maria Florenza,
Herbert Ruessink, and Louis des Tombe
An overview of MiMo2. In:
Machine Translation 1991. The postscript version is here.
The DVI version is here.
-
Gertjan van Noord, Morphology in MiMo2.
[dvi,
postscript].
1990
-
An Overview of
Head driven Bottom-up Generation. In: Robert Dale, Chris Mellish,
Michael Zock (eds) Current Research in Natural
Language Generation. 1990.
The postscript version is
here. The DVI version is here.
-
Herbert Ruessink, Gertjan van Noord, Remarks on the Bottom-up
Generation Algorithm.
[dvi,
postscript].
-
Gertjan van Noord, Joke Dorrepaal, Pim van der Eijk, Maria Florenza,
and Louis des Tombe
The MiMo2 Research System. (In:
MT conference 1990 Austin Texas).
The postscript version is here.
The DVI version is here.
- Stuart Shieber, Gertjan van Noord, Fernando Pereira and Robert
Moore. Semantic-head-driven Generation. In: Computational Linguistics
1990. [postscript]. ACL Anthology
- Reversible Unification
Based Machine Translation. (in: Coling 1990, Helsinki). The
postscript version is here.
The DVI version is here. ACL Anthology
- Joke Dorrepaal and Gertjan van Noord, Anaforische Relaties in het
automatisch vertaalsysteem MiMo. In Anneke Neijt and Dik Bakker
(editors), Computerlinguistiek, een overzicht in artikelen. 1990. Not
available in electronic form.
1989
- Stuart Shieber, Gertjan van Noord, Fernando Pereira and Robert
Moore A Semantic-head-driven
Generation Algorithm for Unification-based Formalisms. (In: ACL
1989, Vancouver). The postscript version is here.
The DVI version is here. ACL Anthology
- Gertjan van Noord, Joke Dorrepaal, Doug Arnold, Steven Krauwer,
Louisa Sadler and Louis des Tombe, An approach to sentence-level
anaphora in Machine Translation. (in: EACL 1989 Manchester). The
postscript version is here.
The DVI version is here. ACL Anthology
- BUG: A Directed Bottom Up Generator
for Unification Based Formalisms. Utrecht/Leuven working papers in
Natural Language Processing 1989. The postscript version is here. The DVI version is here.