Data sets in Excel format. May freely be used, but you are kindly requested to identify the source, and to cite the paper for which they were collected.
Long distance extraction. Data from Dutch and English
Dutch data for Ankelien Schippers & Jack Hoeksema, 2020, Langeafstandsverplaatsing in het Nederlands, Engels en Duits: de sandwich ontleed. [Long-distance movement in Dutch, English and German: the deconstructed sandwich]. To appear in Nederlandse Taalkunde.
English data for the above paper
This is a slight superset of the data that were used for the paper
Jack Hoeksema, 2006, "Pseudogapping: its syntactic analysis and cumulative effects on its acceptability", Research on Language and Computation, vol. 4, no. 4, 335-352.
Long-distance movement in Dutch
Jack Hoeksema and Ankelien Schippers, Diachronic changes in long-distance dependencies: the case of Dutch. Proceedings of ICHL Nijmegen (2009).
Performative van with adverbs of polarity
These data were used for
Jack Hoeksema, 2006, "Hij zei van niet, maar knikte van ja: Distributie en diachronie van bijwoorden van polariteit ingeleid door van" Tabu 35, 3-4, 135-158.
Occurrences of polarity sensitive enig
These data were used for
Jack Hoeksema, 2010, "Dutch ENIG: from nonveridicality to downward entailment." Natural Language and Linguistic Theory, 28(4):837–859.
Occurrences of the discourse particle best
These data were used for Jack Hoeksema, 2008, "The emergence of particle clusters in Dutch: Grammaticalization under adverse conditions", in: Elena Seoane and Maria José Lopez-Couso (eds.), Theoretical and Empirical Issues in Grammaticalization, John Benjamins, Amsterdam and Philadelphia, 131-149.
Occurrences of the swarm-construction
Data are a superset of those used for Jack Hoeksema, 2009, "The swarm alternation revisited" in Erhard Hinrichs and John Nerbonne, eds., Theory and Evidence in Semantics, CSLI, Stanford, 53-80.