... of rule c → e, and K is the total number of ways to rewrite c, we now take into account our DP(αc , P0 (· | c)) prior in (1), which, when truncated to a finite grammar, reduces to a K-dimensional ... prior and selects a compact representation of the data (grammar sizes ranged from 4K-7K for GS compared to a grammar of about 35K rules for EM) (2) GS does not commit to a precomputed grammar and ... 2009; Post and Gildea, 2009) and machine translation (DeNero et al., 2008; Cohn and Blunsom, 2009; Liu and Gildea, 2009) Segmentation is achieved by introducing a prior bias towards grammars that...