Tài liệu Báo cáo khoa học: "WORD, PHRASE AND SENTENCE" pptx

... WORD, PHRASE AND SENTENCE Kob't F. Sinnnons Univ. of Texas, Austin Among the relative verities of natural language processing are the facts that morphemes and words are primary ... processing are the facts that morphemes and words are primary semantic units, and that their co-ocurrence in phrases and sentences provides cues for selecting sense meanings. In this sess...

Ngày tải lên : 21/02/2014, 20:20

2
381
0

Tài liệu Báo cáo khoa học: "Word Vectors and Two Kinds of Similarity" pptx

... number of other nodes (Barab´asi and Albert, 1999). In the semantic networks, such “hub” nodes correspond to basic and highly polysemous words such as make and money, and these words are likely to ... applications associated with semantic processing (Widdows, 2004) and for human modeling in cognitive sci- ence (G¨ardenfors, 2000; Landauer and Dumais, 1997). There are also good r...

Ngày tải lên : 20/02/2014, 12:20

8
473
0

Tài liệu Báo cáo khoa học: "Word representations: A simple and general method for semi-supervised learning" doc

... results and Ta- ble 3 shows the ﬁnal NER F1 results. We compare to the state-of-the-art methods of Ando and Zhang (2005), Suzuki and Isozaki (2008), and for NER—Lin and Wu (2009). Tables 2 and 3 ... 81.44 Brown+Gaz 93.25 89.41 82.71 Lin and Wu (2009), 3.4B - 88.44 - Ando and Zhang (2005), 27M 93.15 89.31 - Suzuki and Isozaki (2008), 37M 93.66 89.36 - Suzuki and Isozaki (200...

Ngày tải lên : 20/02/2014, 04:20

11
687
0

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

... 1.03 1 (Vo- gel et al., 1996; Och and Ney, 2003), and HM- BiTAM (Zhao and Xing, 2008) implemented by us. GIZA++ is an implementation of IBM-model 4 and HMM, and HM-BiTAM corresponds to ζ = 0 ... GIZA++ and HM- BiTAM with the SRH in the lines entitled “with SRH” in Table 1. The GIZA++ and HM-BiTAM with the SRH slightly outperformed the standard GIZA++ and HM-BiTAM for the 10k...

Ngày tải lên : 20/02/2014, 04:20

5
470
2

Tài liệu Báo cáo khoa học: "Better Filtration and Augmentation for Hierarchical Phrase-Based Translation Rules" pdf

... looking for phrases that contain other phrases and replacing the sub- phrases with nonterminal symbols, it gets hierarchical rules. Hierarchical rules are more powerful than conventional phrases ... approach, p(e|f → f ′ ) = count(e, f → f ′ )  e ′ count(e ′ , f → f ′ ) (1) Given a phrase pair f , e and word alignment a, and the dependent relation of the source sentence d J 1 (J...

Ngày tải lên : 20/02/2014, 04:20

5
416
0

Tài liệu Báo cáo khoa học: "Word to Sentence Level Emotion Tagging for Bengali Blogs" doc

... classification task on web blog corpora using Support Vector Machine (SVM) and Conditional Random Field (CRF) and the observed results have shown that the CRF classifiers outperform SVM ... module. Rest 200 and 100 sentences, verified by language ex- perts to perform evaluation have been considered as development and test data respectively. 4.1 Feature Selection and Training...

Ngày tải lên : 20/02/2014, 09:20

4
429
0

Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

... special phenomena in SMS texts, e.g. the unique relaxed and creative writing style and the frequent use of unconventional and not yet standardized short- forms. Direct modeling of these special ... correction centralize on typographic and cognitive/orthographic errors (Kukich, 1992) and use approaches (M.D. Kernighan, Church and 2 http://www.etranslator.ro and http://www...

Ngày tải lên : 20/02/2014, 12:20

8
399
0

Tài liệu Báo cáo khoa học: "Statistical phrase-based models for interactive computer-assisted translation" pdf

... models has been proposed, the phrase- based (PB) approach (Tom ´ as and Casacuberta, 2001; Marcu and Wong, 2002; Zens et al., 2002). The principal innovation of the phrase- based alignment model ... p(˜s| ˜ t) estimates the probabil- ity of translating the phrase ˜ t into the phrase ˜s. A phrase can be comprised of a single word (but empty phrases are not allowed). Thus, the con-...

Ngày tải lên : 20/02/2014, 12:20

7
308
0

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

... researchers build alignment links with bilingual corpora (Wu, 1997; Och and Ney, 2003; Cherry and Lin, 2003; Zhang and Gildea, 2005). In order to achieve satisfactory results, all of these ... corpora in L1-L3 and L2-L3 are available. Using these two additional bilingual corpora, we train two word alignment models for language pairs L1-L3 and L2-L3, respectively. And then,...

Ngày tải lên : 20/02/2014, 12:20

8
359
0

Tài liệu Báo cáo khoa học: "Effective Phrase Translation Extraction from Alignment Models" ppt

... sentences, and the phrase extraction method does not have limits on the length of extracted target side phrase. For each source phrase ranging from positions to the target phrase is given by and , ... be more accurate. Now that the candidate pool has been gen- erated, it needs to be scored and pruned to reﬂect relative conﬁdence between candidate translations and to remove...

Ngày tải lên : 20/02/2014, 16:20

8
323
0