... WORD,
PHRASE AND SENTENCE
Kob't
F. Sinnnons
Univ. of Texas, Austin
Among the relative verities of natural language
processing are the facts that morphemes and words are
primary ...
processing are the facts that morphemes and words are
primary semantic units, and that their co-ocurrence in
phrases and sentences provides cues for selecting sense
meanings. In this sess...
... number of
other nodes (Barab´asi and Albert, 1999). In the se-
mantic networks, such “hub” nodes correspond to
basic and highly polysemous words such as make
and money, and these words are likely to ... applications
associated with semantic processing (Widdows,
2004) and for human modeling in cognitive sci-
ence (G¨ardenfors, 2000; Landauer and Dumais,
1997). There are also good r...
... results and Ta-
ble 3 shows the final NER F1 results. We compare
to the state-of-the-art methods of Ando and Zhang
(2005), Suzuki and Isozaki (2008), and for
NER—Lin and Wu (2009). Tables 2 and 3 ... 81.44
Brown+Gaz 93.25 89.41 82.71
Lin and Wu (2009), 3.4B - 88.44 -
Ando and Zhang (2005), 27M 93.15 89.31 -
Suzuki and Isozaki (2008), 37M 93.66 89.36 -
Suzuki and Isozaki (200...
... 1.03
1
(Vo-
gel et al., 1996; Och and Ney, 2003), and HM-
BiTAM (Zhao and Xing, 2008) implemented by
us. GIZA++ is an implementation of IBM-model
4 and HMM, and HM-BiTAM corresponds to ζ =
0 ... GIZA++ and HM-
BiTAM with the SRH in the lines entitled “with
SRH” in Table 1. The GIZA++ and HM-BiTAM
with the SRH slightly outperformed the standard
GIZA++ and HM-BiTAM for the 10k...
... looking for phrases
that contain other phrases and replacing the sub-
phrases with nonterminal symbols, it gets hierar-
chical rules. Hierarchical rules are more powerful
than conventional phrases ... approach,
p(e|f → f
′
) =
count(e, f → f
′
)
e
′
count(e
′
, f → f
′
)
(1)
Given a phrase pair f , e and word alignment
a, and the dependent relation of the source sen-
tence d
J
1
(J...
...
classification task on web blog corpora using
Support Vector Machine (SVM) and Conditional
Random Field (CRF) and the observed results
have shown that the CRF classifiers outperform
SVM ... module. Rest
200 and 100 sentences, verified by language ex-
perts to perform evaluation have been considered
as development and test data respectively.
4.1 Feature Selection and Training...
... special
phenomena in SMS texts, e.g. the unique relaxed
and creative writing style and the frequent use of
unconventional and not yet standardized short-
forms. Direct modeling of these special ... correction centralize on typographic and
cognitive/orthographic errors (Kukich, 1992) and
use approaches (M.D. Kernighan, Church and
2
http://www.etranslator.ro and http://www...
... models has been
proposed, the phrase- based (PB) approach (Tom
´
as
and Casacuberta, 2001; Marcu and Wong, 2002;
Zens et al., 2002). The principal innovation of the
phrase- based alignment model ... p(˜s|
˜
t) estimates the probabil-
ity of translating the phrase
˜
t into the phrase ˜s.
A phrase can be comprised of a single word (but
empty phrases are not allowed). Thus, the con-...
... researchers build
alignment links with bilingual corpora (Wu,
1997; Och and Ney, 2003; Cherry and Lin, 2003;
Zhang and Gildea, 2005). In order to achieve
satisfactory results, all of these ... corpora in L1-L3 and L2-L3 are available.
Using these two additional bilingual corpora, we
train two word alignment models for language
pairs L1-L3 and L2-L3, respectively. And then,...
... sentences, and the phrase ex-
traction method does not have limits on the length
of extracted target side phrase. For each source
phrase ranging from positions to the target
phrase is given by and
, ... be more
accurate. Now that the candidate pool has been gen-
erated, it needs to be scored and pruned to reflect rel-
ative confidence between candidate translations and
to remove...