... Czech Republic, June 2007.
c
2007 Association for Computational Linguistics
A Fully Bayesian Approach to Unsupervised Part-of-Speech Tagging
∗
Sharon Goldwater
Department of Linguistics
Stanford ... parameters.
We show using part-of-speech tagging that
a fully Bayesian approach can greatly im-
prove performance. Rather than estimating
a single set of parameters, the Baye...
... converted to dependency trees us-
ing Stanford Parser (Marneffe et al., 2006). We con-
vert the tokens in training data to lower case, and
re-tokenize the sentences using the same tokenizer
from ... sensitive to parser er-
rors; on the other hand, integrated model is forced
to use a longer distortion limit which leads to more
search errors during decoding time. It is possible to
9...
... state of a simple finite-state
automaton that only has two states. The automaton
is set to initial state (q
0
) at the top of a message. It
makes a transition to state (q
1
) when it encounters ... is to
enable the quality and nature of discussions that
occur within an on-line discussion board to be
communicated in a summary to a potential new-
comer or group moderators.
We p...
... approach to unsupervised
parsing. Unsupervised DOP models assign
all possible binary trees to a set of sentences
and next use (a large random subset of) all
subtrees from these binary trees to ... extends DOP1 to unsupervised parsing
(Bod 2006). Its key idea is to assign all unlabeled
binary trees to a set of sentences and to next use (in
principle) all subtrees from the...
... the
future, we plan to explore phonological context and
use more flexible topological structures to model
acoustic units within our framework.
Acknowledgements
The authors would like to thank Hung-an ... Computational Linguistics
A Nonparametric Bayesian Approach to Acoustic Model Discovery
Chia-ying Lee and James Glass
Computer Science and Artificial Intelligence Laboratory
Massach...
... a tool for
predicting metabolic engineering strategies to optimize plant freezing toler-
ance. We confirm that a significant improvement in freezing tolerance in
plants involves multiple regulatory ... demonstrated this [12]. However, it was shown
that fully cold-acclimated transformants of C24 did
not differ from the wild type with regard to freezing
tolerance and, at the same time, diff...
... Moreover, we
also intend to perform a user study on our visualiza-
tion prototype to see if it increases the productivity of
post-editors.
Acknowledgements
We would like to thank Christoph Tillmann and ... confidence score, extend it to
the sentence level, then apply it to n-best list reranking
task to improve MT quality, and finally design a vi-
sualization prototype. We try to a...
... solu-
tion to the modelling and scaling problems of
previous approaches. We describe our Bayesian
SCFG model in Section 4 and a Gibbs sampler
to explore its posterior. We apply this sampler
to build ... Figure 4. The
update equations are analogous to those used for
the Split/Join operator in Figure 3. In order for this
operator to be effective we need to allow greater
than binar...
... approximation to the joint
probability.
Lastly, re-ranking of POS sequences is expected
to predict reanalysis of lexical categories. This is
because re-ranking in the tagger is parallel to re-
analysis ... disambiguating word, to com-
pared to “on”.
4.3 Probability Re-ranking
The probability re-ranking reported in Corley and
Crocker (2000) was replicated. The tagger suc-
cessfully res...
... on the other. We propose to demon-
strate a prototype system instantiating this
architecture, which has been built on top
of the Open Source REGULUS 2 platform.
The prototype translates spoken ... Translator (Rayner et al., 2000a)).
At run-time, the system behaves essentially like a
phrasal translator which allows some variation in the
input language. This is close in spirit to the appro...