0

a method for word sense

Báo cáo khoa học:

Báo cáo khoa học: "A Method for Word Sense Disambiguation of Unrestricted Text" potx

Báo cáo khoa học

... Computational Linguistics. J. Stetina, S. Kurohashi, and M. Nagao. 1998. General word sense disambiguation method based on a full sentential context. In Us- age of WordNet in Natural Language ... Mihalcea and D.I. Moldovan. 1999. An au- tomatic method for generating sense tagged corpora. In Proceedings of AAAI-99, Or- lando, FL, July. (to appear). G. Miller, M. Chodorow, S. Landes, ... verb and noun appear. First, Algorithm 1 was applied and search the Internet using AltaVista, for all possi- ble pairs V-N that may be created using re- vise and the words from the similarity...
  • 7
  • 378
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "SenseRelate::TargetWord – A Generalized Framework for Word Sense Disambiguation" doc

Báo cáo khoa học

... lexical sample format, which is anXML–based format that has been used for both theSENSEVAL-2 and SENSEVAL-3 exercises. A file inthis format includes a number of instances, each onemade up ... Poster and Demonstration Sessions,pages 73–76, Ann Arbor, June 2005.c2005 Association for Computational LinguisticsSenseRelate::TargetWord – A Generalized Framework for Word Sense DisambiguationSiddharth ... disambiguation that com-putes the intended sense of a target word, using WordNet-based measures of seman-tic relatedness (Patwardhan et al., 2003).SenseRelate::TargetWord is a Perl pack-age that implements...
  • 4
  • 349
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text" docx

Báo cáo khoa học

... ambigu-ous word in a given context. As a fundamentaltask in natural language processing (NLP), WSDcan benefit applications such as machine transla-tion (Chan et al., 200 7a; Carpuat and Wu, 2007)and ... systems are publicly available – the onlyother publicly available WSD system that we areaware of is SenseLearner (Mihalcea and Csomai,2005). Therefore, for applications which employWSD as a component, ... 200megabytes.3 EvaluationIn our experiments, we evaluate our IMS systemon SensEval and SemEval tasks, the benchmarkdata sets for WSD. The evaluation on both lexical-sample and all-words tasks...
  • 6
  • 355
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Combination of Active Learning and Semi-supervised Learning Starting with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query" pdf

Báo cáo khoa học

... Electric Corporation 5-1-1 Ofuna, Kamakura, Kanagawa, Japan {Imamura.Makoto@bx,Takayama.Yasu hiro@ea}.MitsubishiElectric.co.jp Nobuhiro Kaji, Masashi Toyoda and Masaru Kitsuregawa Institute ... with Positive and Unlabeled Examples for Word Sense Disambiguation: An Empirical Study on Japanese Web Search Query Makoto Imamura and Yasuhiro Takayama Information Technology R&D Center, ... train WSD systems we need a large amount of positive and negative examples. In the real Web mining application, how to acquire training data for a various target of analysis has become a major...
  • 4
  • 441
  • 1
Tài liệu The Cost of a Military Person-Year - A Method for Computing Savings from Force Reductions pptx

Tài liệu The Cost of a Military Person-Year - A Method for Computing Savings from Force Reductions pptx

Khoa học xã hội

... of a military person-year is equal to a metric called regular military compensation (RMC). RMC includes average basic pay for each military grade, basic allowance for housing, basic allowance ... on data for the last 50 years would be only a starting point. e nature of modern warfare and modern casualty treatment options have changed the ratio and cost of deaths and disabilities drastically. ... two boards of actuaries after analysis of past data trends and comparisons to similar assump-tions in other relevant federal programs and private plans. In cases where the current rate is...
  • 153
  • 396
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Method for Measuring Machine Translation Confidence" docx

Báo cáo khoa học

... that this process also refers to the inability of the multinational naval forceswydyf an hdhh alamlyt ayda tshyr aly adm qdrt almtaddt aljnsyt alqwat albhryt (a) Source phraseSource POS and ... high-levellinguistic structures are likely to transfer across certainlanguage pairs. For example, prepositional phrases(PP) in Arabic and English are similar in a sense that PPs generally appear at the end of ... hdhh alamlyt ayda tshyr aly adm qdrt almtaddt aljnsyt alqwat albhrytHe adds that this process also refers to the inability of the multinational naval forcesMT outputSource POSSourceTarget...
  • 9
  • 543
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation" pdf

Báo cáo khoa học

... their aligned translations (and probabil-319algorithm parameters in machine learning of language.Machine Learning, pages 84–95.I. Dagan and A. Itai. 1994. Word sense disambiguationusing a second ... state-of-the-art systems for all languages, ex-cept for Spanish where the results are very similar.As all steps are run automatically, this multilingualapproach could be an answer for the acquisition ... bot-tleneck, as long as there are parallel corpora avail-able for the targeted languages. Although large mul-tilingual corpora are still rather scarce, we stronglybelieve there will be more parallel...
  • 6
  • 537
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Báo cáo khoa học

... grammatical and n-gram based statistical language constraints, and uses a robust parsing technique to apply the grammatical constraints described by context-free grammar (Tsukada et aL, 97). ... the Error-Pattem-Database and String-Database can be mechanically prepared, which reduces the effort required to prepare the databases and makes it possible to apply this method to a new recognition ... correcting accuracy by changing algorithms and will also try to improve translation performance by combining our method with Wakita's method. References T. Araki et al., 93. A Method for Detecting...
  • 5
  • 588
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

Báo cáo khoa học

... thetwo SENSEVAL tasks. This gave a set of 6 nouns for SENSEVAL-2 and 9 nouns for SENSEVAL-3. For each noun, we gathered a maximum of 500parallel text examples as training data, similar towhat ... sampling with incomplete infor-mation. Annals of Mathematical Statistics, 26(4).Yee Seng Chan and Hwee Tou Ng. 200 5a. Scalingup word sense disambiguation via parallel texts. InProc. of AAAI05.Yee ... on data whichwas automatically gathered from the Internet. Theauthors reported a 14% improvement in accuracyif they have an accurate estimate of the sense pri-ors in the evaluation data and...
  • 8
  • 268
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học

... be available for many examples. The problem of data sparse-ness increases as more knowledge is exploited and this can cause problems for the machine learning algorithms. A final disadvantage ... 1st_prep_right, back). Rule_2. sense (A, chegar) :- has_rel (A, subj, B), has_bigram (A, today, B), has_bag_trans (A, hoje). Rule_3. sense (A, chegar) :- satisfy_restriction (A, [animal, human], [concrete]); ... In-troduction to Machine Translation. Academic Press, Great Britain. Abolfazl K. Lamjiri, Osama El Demerdash, Leila Kos-seim. 2004. Simple features for statistical Word Sense Disambiguation. Proceedings...
  • 8
  • 380
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Domain Adaptation with Active Learning for Word Sense Disambiguation" pdf

Báo cáo khoa học

... and accuracy improvement is less than1% after all the available WSJ adaptation examples are addedas additional training data. To obtain a clearer picture of theadaptation process, we discard ... in BC andWSJ, average MFS accuracy, average number of BCtraining, and WSJ adaptation examples per noun.data, and the rest of the WSJ examples are desig-nated as in-domain adaptation data. The ... pos-teriori (MAP) estimation, and successfully used it for probabilistic context-free grammar domain adap-tation (Roark and Bacchiani, 2003) and languagemodel adaptation (Bacchiani and Roark, 2003).Count-merging...
  • 8
  • 363
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học

... training data so that we can do a fair comparison between the accuracy of the parallel text alignment approach versus the manual sense- tagging approach. After training a WSD classifier for w ... However, large-scale, good-quality parallel corpora have recently become available. For ex-ample, six English-Chinese parallel corpora are GIZA++. For two of the corpora, Hong Kong Han-sards and ... corpora. To ensure a fairer comparison, for each of the 10-trial manually sense- tagged training data that gave rise to the ac-curacy figure M2 of a noun w, we extracted a new subset of 10-trial...
  • 8
  • 380
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Method for Relating Multiple Newspaper Articles by Using Graphs, and Its Application to Webcasting" pptx

Báo cáo khoa học

... Shimo-tsuruma, Yamato-shi, Kanagawa-ken 242 Japan { uramoto, takeda } @trl. ibm. co.j p Abstract This paper describes methods for relating (thread- ing) multiple newspaper articles, and for visualizing ... quantity of information available today makes it difficult to search for and understand the information that we want. If there are many related documents about a topic, it is important to capture ... news- paper articles automatically, and its application for a Webcasting application. A set of article on a par- I htt p://www.pointcast.com ticular topic is ordered chronologically, and the...
  • 7
  • 419
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "An Exact A* Method for Deciphering Letter-Substitution Ciphers" doc

Báo cáo khoa học

... re-trieval and data mining, in which case it is impor-tant to be able to read through them automatically,without resorting to a human annotator. The holygrail in this area would be an application ... gray are unreachable.The cell at (d) is filled using the trigram probabilities and the probability of the path at starting at (a) .In all of the data considered, the frequency ofspaces was far ... 48th Annual Meeting of the Association for Computational Linguistics, pages 1040–1047,Uppsala, Sweden, 11-16 July 2010.c2010 Association for Computational LinguisticsAn Exact A* Method for...
  • 8
  • 350
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Topic Models for Word Sense Disambiguation and Token-based Idiom Detection" pdf

Báo cáo khoa học

... were discarded.tions caused by tagging or lemmatization errors,we manually corrected any bad tags and lemmas for the target instances.4 Sense Paraphrases For word sense disam-biguation tasks, ... Boyd-Graberet al. (2007) enhance the basic LDA algorithm byincorporating WordNet senses as an additional la-tent variable. Instead of generating words directlyfrom a topic, each topic is associated ... in sense paraphrasesincreases performance. Longer paraphrases con-tain more information, and they are statisticallymore stable for inference.We find that nouns get the greatest perfor-mance...
  • 10
  • 371
  • 0

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn xác định thời lượng học về mặt lí thuyết và thực tế tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí điều tra với đối tượng sinh viên học tiếng nhật không chuyên ngữ1 khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam nội dung cụ thể cho từng kĩ năng ở từng cấp độ xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct các đặc tính của động cơ điện không đồng bộ đặc tuyến hiệu suất h fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến tốc độ rôto n fi p2 đặc tuyến dòng điện stato i1 fi p2 thông tin liên lạc và các dịch vụ phần 3 giới thiệu nguyên liệu từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng theo chất lượng phẩm chất sản phẩm khô từ gạo của bộ y tế năm 2008 chỉ tiêu chất lượng 9 tr 25