acquiring lexical generalizations from corpora

Tài liệu Báo cáo khoa học: "Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations" pdf

Tài liệu Báo cáo khoa học: "Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations" pdf

Ngày tải lên : 20/02/2014, 19:20
... percentages are the aver- age of the judges' individual classifications. 399 Acquiring Lexical Generalizations from Corpora: A Case Study for Diathesis Alternations Maria Lapata School of Cognitive ... threshold values varied from frame to flame but not from verb to verb and were determined by taking into account for each frame its overall frame frequency which was es- timated from the COMLEX subcategorization ... alternating verbs from large balanced corpora by using partial- parsing methods and taxonomic information, and discuss how corpus data can be used to quantify lin- guistic generalizations. ...
  • 8
  • 483
  • 0
Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx

Báo cáo khoa học: "Automatic Acquisition of Adjectival Subcategorization from Corpora" docx

Ngày tải lên : 08/03/2014, 04:22
... automatic acquisition of lexical in- formation from large repositories of unannotated text (such as the web, corpora of published text, etc.) is starting to produce large scale lexical re- sources ... paper describes a novel system for acquiring adjectival subcategorization frames (SCFs) and associated frequency information from English corpus data. The system incorporates a decision-tree classifier ... frames from untagged text. In Meet- ing of the Association for Computational Linguistics, pages 209–214. E. J. Briscoe and J. Carroll. 1997. Automatic Extraction of Subcategorization from Corpora. ...
  • 8
  • 390
  • 0
A STUDY ON LEXICAL COHESION IN VIETNAMESE AND ENGLISH CORPORATE ADVERTISINGS

A STUDY ON LEXICAL COHESION IN VIETNAMESE AND ENGLISH CORPORATE ADVERTISINGS

Ngày tải lên : 29/01/2014, 10:43
... CHAPTER 3: A COMPARATIVE STUDY ON LEXICAL COHESIVE DEVICES IN ENGLISH AND VIETNAMESE CORPORATE ADVERTISEMENTS 1. General picture of lexical cohesive devices in Corporate advertisements Understanding ... clearly and in details. However, in Vietnamese corporate advertisements, the copywriters hardly notice the equivalent lexical items that are rendered from Vietnamese into English or vice versus. ... description of lexical cohesion features in English - figure out how these devices are used in texts - make comparative analysis of lexical cohesion between English and Vietnamese corporate advertisements...
  • 41
  • 1.1K
  • 2
Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

Tài liệu Báo cáo khoa học: "Extracting Comparative Sentences from Korean Text Documents Using Comparative Lexical Patterns and Machine Learning Techniques" doc

Ngày tải lên : 20/02/2014, 09:20
... Singapore, 4 August 2009. c 2009 ACL and AFNLP Extracting Comparative Sentences from Korean Text Documents Us- ing Comparative Lexical Patterns and Machine Learning Techniques Seon Yang Department ... comparative sentences from text documents. This paper first investigates many comparative sentences referring to pre- vious studies and then defines a set of compar- ative keywords from them. A sentence ... to eliminate non- comparative sentences only from comparative sentence candidates with a CKL2 keyword. 4 Eliminating Non-comparative Sen- tences from the Candidates 3 As you can see in...
  • 4
  • 536
  • 0
Tài liệu Báo cáo khoa học: "Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora" ppt

Tài liệu Báo cáo khoa học: "Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora" ppt

Ngày tải lên : 20/02/2014, 12:20
... (Cucerzan and Yarowsky, 1999) and (Collins and Singer, 1999) present algorithms to obtain NEs from untagged corpora. However, they focus on the classification stage of already segmented entities, and ... feature vector from this example in the following manner: First, we split both words into all possible substrings of up to size two: We build a feature vector by coupling sub- strings from the two ... Computational Linguistics Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora Alexandre Klementiev Dan Roth Dept. of Computer Science University of Illinois Urbana,...
  • 8
  • 391
  • 0
Tài liệu Báo cáo khoa học: "Mining metalinguistic activity in corpora to create lexical resources using Information Extraction techniques: the MOP system" doc

Tài liệu Báo cáo khoa học: "Mining metalinguistic activity in corpora to create lexical resources using Information Extraction techniques: the MOP system" doc

Ngày tải lên : 20/02/2014, 15:20
... information from free-text has been successfully carried out in the past (Hearst, 1999; Manning, 1993), automatically ex- tracting lexical resources (including terminologi- cal definitions) from text ... information from a machine-readable dic- tionary. 3 Locating metalinguistic information in text: two approaches When implementingan IE application to mine metalinguistic information from text, ... tackle is how to obtain a reliable set of can- didate sentences from free text for input into the next phases of extraction. From our initial corpus analysis we selected 44 patterns that showed...
  • 8
  • 459
  • 0
Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx

Tài liệu Báo cáo khoa học: "Bilingual Terminology Acquisition from Comparable Corpora and Phrasal Translation to Cross-Language Information Retrieval" pptx

Ngày tải lên : 20/02/2014, 16:20
... Japanese-English language pair, especially if involving the comparable corpora. Re-scoring through the Comparable Corpora Comparable corpora could be considered for the disambiguation of translation ... comparable corpora- based techniques, re- spectively compared to the hybrid two-stages com- parable corpora and linguistics-based pruning. The proposed approach based on bi-directional comparable corpora ... TR2-007. P. Fung. 2000. A Statistical View of Bilingual Lexi- con Extraction: From Parallel Corpora to Non-Parallel Corpora. In Jean Veronis, Ed. Parallel Text Process- ing. G. Grefenstette. 1999....
  • 4
  • 377
  • 0
Tài liệu Báo cáo khoa học: "INSIDE-OUTSIDE REESTIMATION FROM PARTIALLY BRACKETED CORPORA" ppt

Tài liệu Báo cáo khoa học: "INSIDE-OUTSIDE REESTIMATION FROM PARTIALLY BRACKETED CORPORA" ppt

Ngày tải lên : 20/02/2014, 21:20
... ( (from SF0) (to San Francisco))))).) GR (Tell ((me (((about the) public) transportation)) ( (from SF0) ((to San) (Francisco .))))) GB ((Tell (me (about (((the public) transportation) ( (from ... corpus, the inside prob- abilities of longer spans of c are computed from INSIDE-OUTSIDE REESTIMATION FROM PARTIALLY BRACKETED CORPORA Fernando Pereira 2D-447, AT~zT Bell Laboratories PO Box ... inferred from raw text. In addition, the number of iterations needed to reach a good grammar can be reduced; in extreme cases, a good solution is found from parsed text but not from raw text....
  • 8
  • 285
  • 0
Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

Ngày tải lên : 20/02/2014, 22:20
... nouns or proper nouns is converted from their positions in the text into a vector. 3. Match pairs of positional difference vec- tors~ giving scores. All vectors from English and Chinese are matched ... dim(V2) 240 A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Pascale Fung Computer Science Department Columbia University New York, NY ... in the texts. For every word pair from this lexicon, we had ob- tained a DTW score and a DTW path. If we plot the points on the DTW paths of all word pairs from the lexicon, we get a graph...
  • 8
  • 426
  • 0
Tài liệu Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" docx

Tài liệu Báo cáo khoa học: "Creating a Multilingual Collocation Dictionary from Large Text Corpora" docx

Ngày tải lên : 22/02/2014, 02:20
... Data from Bilingual Texts. In Pro- ceedings of the First International Lexical Acquisition Workshop, Detroit. Church, K., Gale, W., Hanks, P., and Hindle, D. (1991). Using Statistics in Lexical ... linguistic analysis. The originality of our approach comes from the fact that collocations are not extracted from raw texts, but rather from syntactically parsed texts. The lin- guistic analysis ... textual corpora from the World Trade Organisation (WTO), which consist in parallel documents in three languages: English, French and Spanish. All the examples given in this paper are taken from...
  • 4
  • 479
  • 0
Tài liệu Báo cáo khoa học: "Effect of Cross-Language IR in Bilingual Lexicon Acquisition from Comparable Corpora" pot

Tài liệu Báo cáo khoa học: "Effect of Cross-Language IR in Bilingual Lexicon Acquisition from Comparable Corpora" pot

Ngày tải lên : 22/02/2014, 02:20
... translation knowledge acquisition from WWW news sites, this paper studies issues on the effect of cross-language retrieval of relevant texts in bilingual lexicon ac- quisition from comparable corpora. We experimentally ... parallel/comparative corpora. However, the sizes as well as the domain of existing parallel/comparative corpora are lim- ited, while it is very expensive to manually col- lect parallel/comparative corpora. ... translation knowledge acquisition from parallel/comparative corpora, various kinds of translation knowledge are acquired. Within this framework of translation knowledge acquisition from WWW news sites, this...
  • 8
  • 477
  • 0
Báo cáo khoa học: "Prototyping virtual instructors from human-human corpora" pdf

Báo cáo khoa học: "Prototyping virtual instructors from human-human corpora" pdf

Ngày tải lên : 07/03/2014, 22:20
... this paper we presented a novel algorithm for rapidly prototyping virtual instructors from human- human corpora without manual annotation. Using our algorithm and the GIVE corpus we have gener- ated ... sum, this paper presents a novel way of au- tomatically prototyping task-oriented virtual agents from corpora who are able to effectively and natu- rally help a user complete a task in a virtual ... world. References Sudeep Gandhe and David Traum. 2007. Creating spo- ken dialogue characters from corpora without annota- tions. In Proceedings of Interspeech, Belgium. Andrew Gargett, Konstantina...
  • 6
  • 220
  • 0