... Association for Computational Linguistics, pages 139–144,
Jeju, Republic of Korea, 8-14 July 2012.
c
2012 Association for Computational Linguistics
A Graphical Interface for MT Evaluation and Error ... typologies of
the errors produced by MT systems (Vilar et al.,
2006; Farr
´
us et al., 2011; Kirchhoff et al., 2007) and
graphical interfaces for human classification...
... grammatical and pro-
cessing framework for handling the repairs,
hesitations, and other interruptions in nat-
ural human dialog. The proposed frame-
work has proved adequate for a collection ... (urn) and speech re-
pairs (I mean) and give meta-comments on the ut-
terance (right).
specify how speech repairs should be handled
by the parser. (Hindle, 1983) and (Bear et
al...
... general information about
the system’s components, a description and time-
stamp for each system event and user event, names
and timestamps for the system-recorded sound
files, and timestamps for ... straightforward graphical
interface similar to those found in current systems
but with the addition of icons for actors and direc-
tors that can be used both for unimod...
... display general Web pages and
documents. Palm and Pocket PC devices, whose
screens commonly display 10–15 lines, are candi-
dates. Schofield and Kubin (2002) argue that for
such devices question-answering ... Introduction
This paper demonstrates a multimodal interface for
asking questions and retrieving a set of likely an-
swers. Such an interface is particularly appropri-
ate...
...
Abstract
GernEdiT (short for: GermaNet Editing Tool)
offers a graphical interface for the lexicogra-
phers and developers of GermaNet to access
and modify the underlying GermaNet ... Princeton Word-
Net for English. The traditional lexicographic
development of GermaNet was error prone
and time-consuming, mainly due to a complex
underlying data format and no opp...
... word alignment information.
3 Experiments
3.1
PORT as an Evaluation Metric
We studied PORT as an evaluation metric on
WMT data; test sets include WMT 2008, WMT
2009, and WMT 2010 all-to-English, ... as speed,
requirements for linguistic resources, and
optimization difficulty, they have not been
widely adopted for tuning. This paper
presents PORT
1
, a new MT evaluat...
... values for the family of metrics AEv(α,N), for adequacy scores in MT evaluation
scores from the human evaluation, for summaries
of 200 and 400 words, respectively (the values of
R
2
for summaries ...
score (see (Lin and Hovy 2003) for more
information on the human summary evaluation) .
From the publicly available data for this evaluation
(DUC 2001), we compute...
... & Anadiou
2000) and standard statistical significance tests
such as the t-test, the chi-squared test, and log-
likelihood (Church and Hanks 1990, Dunning
1993), and information-based methods, ... relative entropy and mutual
information). In Krenn & Evert 2001, frequency
outperformed mutual information though not the t-
test, while in Evert and Krenn 2001, log-likeliho...
... to stems to form a word that may cor-
respond to an entire phrase in a language like En-
glish. For instance, in Turkish, word formation is
based on suffixation of derivational and inflectional ...
the knowledge about the type of process and the
type of morpheme. We adopt a representation sim-
ilar to Hoeksema and Janda's (1988) notation for
the operator. The 3-tuple
&l...
... the tags and lemmas of bouncer
and bounce will detect a problem with this assign-
ments and will be able to correct the tagging and
lemmatization error for bouncer.
The main source of information ... tag-sets and top three lemmas for each
tag for training.
for word w
i
, for j = 1 . . . k. Also, let l
i
(t)
j
de-
note the top lemmas for word w
i
given tag t. An
assignm...