... June 2007.
c
2007 Association for Computational Linguistics
GLEU: Automatic Evaluation of Sentence-Level Fluency
Andrew Mutton
∗
Mark Dras
∗
Stephen Wan
∗,†
Robert Dale
∗
∗
Centre for Language ... grammar, rhythm and flow,
appropriateness of tone, and several other specific
characteristics of good text.
In terms of automatic evaluation, we are not aware
of any technique that...
... construction of N-best translation
lexicons from parallel text. Melamed (1995) used
the ratio (LCSR) between the length of the LCS of
two words and the length of the longer word of the
two ... section, we present the evaluations of
ROUGE-L, ROUGE-S, and compare their per-
formance with other automatic evaluation meas-
ures.
5 Evaluations
One of the goals of developing...
... (possible)
medical conditions.
The importance of the task of negation and spec-
ulation (a.k.a. hedge) detection is attested by a num-
ber of research initiatives. The creation of the Bio-
Scope corpus (Vincze ... Statistics of the BioScope corpus. The 2nd and 3d
columns show the total number of cues within the datasets; the
4th and 5th columns show the percentage of negated and...
... ex-
amples of the previous section. From the point of
view of bag -of- word methods, the pairs (T
1
, H
1
)
and (T
1
, H
2
) have both the same intra-pair simi-
larity since the sentences of T
1
and ... rules that describe a non trivial set
of entailment cases. The experiments with
the data sets of the RTE 2005 challenge
show an improvement of 4.4% over the
state -of- the-art me...
... there are two sentences in each of the
454
(1) kono software-no riten-ha hayaku ugoku koto
this software-POST advantage-POS T quickly run to
The advantage
of this software is to run quickly.
(2) ... the polarity of words
There are some works that discuss learning the po-
larity of words instead of sentences.
Hatzivassiloglou and McKeown proposed a
method of learning the polarity...
... specific and tangible
features. Also, there are somewhat a fixed set of
features of a specific type of product, for exam-
ple, ease of use, durability, battery life, photo
quality, and shutter lag ... examples of sen-
tences that our system identified as reasons of
complaints.
(1) Unfortunately, I find that
I am no longer comfortable in
your establishment because of
the...
... the word senses numbered i of
the word x. I
x
is the word sense indexing function
of x that gives an index to each sense of the word
x. All contextual words x
i
±j
of a central word x have ... 1
shows the average number of clusters with each
clustering method shown chapter 3 by the part of
speech. WC and WF are the average number of
senses by the part of speech.
In Ta...
... application of the method is auto-
matic or semi-automatic compilation of a glossary or
technical-term dictionary for a certain domain. Re-
cursive application of the method enables to collect a
list of ... based on
search engine hits. An evaluation result
shows that the precision of the method is
85%.
1 Introduction
This study aims to realize an automatic method of
collecting t...
... The completeness of the output
list increases monotonically with the total number
of occurrences of each verb in the corpus. False
positive rates are one to three percent of observa-
tions. ... architecture of the system, and that of this pa-
per, directly reflects the three challenges described
above. The system consists of three modules:
1. Verb detection: Finds some occ...
... a mi-
nority of all instances of it. Evans (2001) reports
that his corpus of approx. 370.000 words from the
SUSANNE corpus and the BNC contains 3.171
examples of it, approx. 29% of which are ... variety of genres. They count
2.337 instances of it, 646 of which (28%) are non-
referential. Finally, Clemente et al. (2004) report
that in the GENIA corpus of medical abstracts the...