... words (14K sentences, 946 documents), thetest set contains 46K words (3.5K sentences, 231documents), and the development set contains51K words (3.3K sentences, 216 documents).We also evaluated ... of160 million word tokens with a vocabulary size Wof 70K word types. There are 2·W types of context(columns): The first or second W are counted if theword c occurs within a window of 10 to the ... parsing with semi-supervisedword clustering. IWPT (pp. 138–141).Collobert, R., & Weston, J. (2008). A unifiedarchitecture for natural language processing:Deep neural networks with multitask...