0

bootstrapping named entity recognition by means of active machine learning

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

Báo cáo khoa học

... English.408Proceedings of ACL-08: HLT, pages 407–415,Columbus, Ohio, USA, June 2008.c2008 Association for Computational LinguisticsInducing Gazetteers for Named Entity Recognition by Large-scale Clustering of ... applications in speech recognition. Proceedings of the IEEE, 77(2):257–286.E. Riloff and R. Jones. 1999. Learning dictionaries forinformation extraction by multi-level bootstrapping. In 16th ... of a gazetteer and its effect. We think this is one of theimportant directions of future research.Parallelization has recently regained attention inthe machine learning community because of...
  • 9
  • 428
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition" pdf

Báo cáo khoa học

... label of a named entity is “O”,which indicates a non -named entity. For 98.0% of the named entities in the training data of the sharedtask in the 2004 JNLPBA, the label of the preced-ing entity ... End-Word” capture the tendency of the length of a named entity. “Count feature” captures the ten-dency for named entities to appear repeatedly inthe same sentence.“Preceding Entity and Prev Word” are ... N is the length of sentence andK is the size of label set. And that of training infirst order semi-CRFs is O(K2LN). The increase of the cost is used to transfer non-adjacent entity information.To...
  • 8
  • 527
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Joint Inference of Named Entity Recognition and Normalization for Tweets" doc

Báo cáo khoa học

... number of organizationsthat are incorrectly labeled as PERSON by SBR, arenow correctly recognized by our method.532 by recognition errors. Another challenge of NENis the dearth of information ... misconceptions in named entity recognition. InCoNLL, pages 147–155.Alan Ritter, Sam Clark, Mausam, and Oren Etzioni.2011. Named entity recognition in tweets: An ex-perimental study. In Proceedings of the ... nature of tweets, there are rich variations of named enti-ties in them. According to our investigation on thedata set provided by Liu et al. (2011), every named entity in tweets has an average of...
  • 10
  • 444
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Incorporating speech recognition confidence into discriminative named entity recognition of speech data" ppt

Báo cáo khoa học

... 1999. A Maximum Entropy Ap-proach to Named Entity Recognition. Ph.D. thesis,New York University.Hai Leong Chieu and Hwee Tou Ng. 2003. Named en-tity recognition with a maximum entropy approach.In ... NERNER is a kind of chunking problem that canbe solved by classifying words into NE classesthat consist of name categories and such chunk-ing states as PERSON-BEGIN (the beginning of a person’s ... ConclusionWe proposed a method for NER of speech datathat incorporates ASR confidence as a feature of discriminative NER, where the NER model623 by a set of binary values, the same as with anSVM-based...
  • 8
  • 311
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Mining Wiki Resources for Multilingual Named Entity Recognition" pdf

Báo cáo khoa học

... this paper, we describe a system by which the multilingual characteristics of Wikipedia can be utilized to annotate a large corpus of text with Named Entity Recognition (NER) tags requiring ... detail the process by which we use the Category structure inherent to Wikipedia to determine the named entity type of a proposed entity. We further describe the methods by which English language ... trained on up to 40,000 words of human-annotated newswire. 1 Introduction Named Entity Recognition (NER) has long been a major task of natural language processing. Most of the research in the field...
  • 9
  • 429
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Acceptability Prediction by Means of Grammaticality Quantification" doc

Báo cáo khoa học

... adequacy of the PG grammatical-ity indices to the measurements was investigated by means of resultant analysis. We adapted theparameters of the model in order to arrive at agood fit based on half of ... grammaticality of theinput. In other words, instead of deciding on thegrammaticality of the input, we can give an indica-tion of its grammaticality, quantified on the basis of the description of the ... ConstNP{Det, AP, N, Pro}(set of possible constituents of NP)In PG, each category of the grammar is de-scribed with a set of properties. A grammar is thenmade of a set of properties. Parsing an...
  • 8
  • 303
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" docx

Báo cáo khoa học

... this kind of systems, aset of rules is automatically learned and revised by an expert. An alternative can be the dynamicextension of an existing set of core rulespreviously defined by the expert, ... entropy approachfor named entity recognition. PhD Thesis, NewYork University.Collins M. and Singer Y. (1999) Unsupervisedmodels for named entity classification. InProceedings of EMNLP/WVLC, 1999, ... languagetechnology is not much developed for most of them. This has a big consequence for named entity recognition: for certain languages likemost of the European languages, we benefitfrom already...
  • 4
  • 279
  • 0
Pricing Portfolio Credit Derivatives by Means of Evolutionary Algorithms doc

Pricing Portfolio Credit Derivatives by Means of Evolutionary Algorithms doc

Quản trị kinh doanh

... assistance of the Deutsche Forschungsgemeinschaft by funding my research at the University of Tübingen, and of the Stiftung LandesbankBaden-Württemberg by supporting the publication of this dissertation. ... the exchange of creditrisk is an interesting means of risk management, as long as it allows for maintenance of the client relationship. Eliminating the credit risk of a client by simply selling ... approach, which incorporates dependence by means of copula func-tions, allows the modeling of the dependence structure to be separated from the modeling of individual defaults. Li (2000) introduced...
  • 176
  • 378
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Japanese Named Entity Recognition based on a Simple Rule Generator and Decision Tree Learning" pdf

Báo cáo khoa học

... memt,January.Manabu Sassano and Takehito Utsuro. 2000. Named entity chunking techniques in supervised learning for Japanese named entity recognition. In Proceed-ings of the International Conference on Computa-tional ... tree learning forclassification of a noun phrase by assuming that named entities are noun phrases. Gallippi (1996)employs hundreds of hand-crafted templates asfeatures for decision tree learning. ... rule is refined by decision tree learning. By applying the refined recognition rules to a newdocument, we get NE candidates. Then, non-overlapping candidates are selected by a kind of longest match...
  • 8
  • 530
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "The Multilingual Named Entity Recognition Framework" ppt

Báo cáo khoa học

... resources and tools for named entity recognition. A team of computational linguist students develops thisThe members of the INaLCO Named Entity Groupare: A. Acoulon, C. Avaux, L. Beroff-Beneat-,A. ... this kind of systems, aset of rules is automatically learned and revised by an expert. An alternative can be the dynamicextension of an existing set of core rulespreviously defined by the expert, ... interesting classification of named entity recognition systems.•Manually created rule-based systems. Inthis kind of system, developers initiallyelaborate a set of patterns that will be applied...
  • 4
  • 283
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Framework for Unifying Named Entity Recognition and Disambiguation Extraction Tools" pot

Báo cáo khoa học

... the comparison of the perfor-mance of these services as well as their pos-sible combination. We address this problem by proposing NERD, a framework whichunifies 10 popular named entity extractorsavailable ... extract the list of Named Entity, their classification and the URIs that dis-ambiguate these entities. The main purpose of thisinterface is to enable a human user to assess thequality of the extraction ... Evaluating Named Entity Recognition Tools inthe Web of Data. 10thInternational Semantic WebConference (ISWC’11), Demo Session, Bonn, Ger-many.Rizzo G. and Troncy R. 2011. NERD: Evaluat-ing Named...
  • 4
  • 466
  • 0
Báo cáo khoa học:

Báo cáo khoa học: " Named Entity Recognition using an HMM-based Chunk Tagger" pptx

Báo cáo khoa học

... Proceedings of the 40th Annual Meeting of the Association forattractive in that it is trainable and adaptable and the maintenance of a machine- learning system is much cheaper than that of a rule-based ... the performance of a machine- learning system is always poorer than that of a rule-based one by about 2% [Chinchor95b] [Chinchor98b]. This may be because current machine- learning approaches ... gazetteers: lists of names of persons, organizations, locations and other kinds of named entities. This sub-feature can be determined by finding a match in the gazetteer of the corresponding...
  • 8
  • 473
  • 1
Báo cáo khoa học:

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Báo cáo khoa học

... NgDepartment of Computer ScienceSchool of ComputingNational University of Singapore3 Science Drive 2Singapore 117543nght@comp.nus.edu.sgAbstractThis paper describes how a machine- learning named entity ... the named entity task consists of labeling named entities withthe classes PERSON, ORGANIZATION, LOCA-TION, DATE, TIME, MONEY, and PERCENT. Weconducted experiments on upper case named entity recognition, ... work on un-supervised learning for mixed case named entity recognition (Collins and Singer, 1999; Cucerzanand Yarowsky, 1999). Collins and Singer (1999)investigated named entity classification...
  • 8
  • 285
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Named Entity Recognition for Catalan Using Spanish Resources" potx

Báo cáo khoa học

... Language-Independent Named Entity Recognition. In Proceedings of CoNLL-2002, pages 155-158. Taipei, Taiwan.E. Tjong Kim Sang. 2002b. Memory-Based Named Entity Recognition. In Proceedings of CoNLL-2002,pages ... Sassano. 2002. Learning with Multiple Stacking for Named Entity Recognition. In Proceedings of CoNLL-2002, pages191-194. Taipei, Taiwan.R.Weischedel. 1995. BBN: Description of the PLUMSystem ... BarcelonaIcarreras,lluism,padroWsi.upc.esAbstractThis work studies Named Entity Recog-nition (NER) for Catalan without mak-ing use of annotated resources of thislanguage. The approach presented isbased on machine learning techniquesand...
  • 8
  • 288
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Multilingual Named Entity Recognition using Parallel Data and Metadata from Wikipedia" potx

Báo cáo khoa học

... sourceand target entity by computing Levenshtein andother distance metrics between the source entity and the closest transliteration of the target (out of a10-best list of transliterations). ... foreign candidate entity strings (sequences of tokens) and best corre-sponding English candidate entities. The candidateEnglish entities are defined by the union of entitiesproposed by the Wiki-based ... followed the approach of Richman and Schone(2008) to derive named entity annotations of bothEnglish and foreign phrases in Wikipedia, usingWikipedia metadata. The following sources of in-formation...
  • 9
  • 333
  • 0

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn xác định thời lượng học về mặt lí thuyết và thực tế tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí điều tra với đối tượng sinh viên học tiếng nhật không chuyên ngữ1 khảo sát các chương trình đào tạo theo những bộ giáo trình tiêu biểu xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ mở máy động cơ lồng sóc mở máy động cơ rôto dây quấn các đặc tính của động cơ điện không đồng bộ hệ số công suất cosp fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến dòng điện stato i1 fi p2 động cơ điện không đồng bộ một pha sự cần thiết phải đầu tư xây dựng nhà máy thông tin liên lạc và các dịch vụ từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose