combining a statistical language model

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Ngày tải lên : 20/02/2014, 15:20
... NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... Recognition Evaluation (LRE) data. The database was intended to establish a baseline of performance capability for language recognition of conversational tele- phone speech. The database contains recorded ... by a chan- nel noise. The n-gram language model has achieved equal amounts of success in both tasks, e.g. n-character slice for text categorization by lan- guage (Cavnar and Trenkle, 1994) and...
  • 8
  • 436
  • 0
Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Ngày tải lên : 20/02/2014, 15:20
... measures are inadequate due to their reliance on vocabulary lists and/or a superfi- cial representation of syntax. Our approach uses n- gram language models as a low-cost automatic ap- proximation of ... syntactic and semantic analy- sis. Statistical language models (LMs) are used suc- cessfully in this way in other areas of NLP such as speech recognition and machine translation. We also use a ... categories relative to each other. 4.1 Statistical Language Models Statistical LMs predict the probability that a partic- ular word sequence will occur. The most commonly used statistical language...
  • 8
  • 446
  • 0
Tài liệu Báo cáo khoa học: "Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model " pptx

Tài liệu Báo cáo khoa học: "Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model " pptx

Ngày tải lên : 20/02/2014, 18:20
... Statistical Language Model Masaaki NAGATA NTT Information and Communication Systems Laboratories 1-1 Hikari-no-oka Yokosuka-Shi Kanagawa, 239-0847 Japan nagata@nttnly, isl. ntt. co. jp Abstract ... approxi- mate word matching method using character shape similarity, and a word segmentation algorithm us- ing a statistical language model. By using a sta- tistical OCR model and character shape ... present a novel OCR error correction method for languages without word delimiters that have a large character set, such as Japanese and Chinese. It consists of a statistical OCR model, an approxi-...
  • 7
  • 472
  • 0
Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Ngày tải lên : 22/02/2014, 02:20
... Gram- matical Framework (GF) (Ranta, 2004). We create a statistical language model (SLM) directly from our interpretation grammar and compare recognition per- formance of this model against a ... of Functional Programming., Vol. 14, No. 2, pp. 145– 189. Ranta A. Grammatical Framework Homepage http://www.cs.chalmers.se/ ˜ aarne/GF, as of May 2005. Raux A. , Langner B., Black A. and Eskenazi M. ... Structure into Statistical Language Models. In Philosophical Transactions of the Royal Society of London A, 358. Solsona R., Fosler-Lussier E., Kuo H.J., Potamianos A. and Zitouni I. 2002. Adaptive Language...
  • 8
  • 381
  • 0
Tài liệu Báo cáo khoa học: "A Structured Language Model" ppt

Tài liệu Báo cáo khoa học: "A Structured Language Model" ppt

Ngày tải lên : 22/02/2014, 03:20
... Proceedings of the Human Language Technology Workshop, 272-277. ARPA. Raymond Lau, Ronald Rosenfeld, and Salim Roukos. 1993. Trigger-based language models: a maximum entropy approach. In Proceedings ... University, Baltimore, MD. Frederick Jelinek, John Lafferty, David M. Mager- man, Robert Mercer, Adwait Ratnaparkhi, Salim Roukos. 1994. Decision Tree Parsing using a Hid- den Derivational Model. ... those assigned man- ually in the Penn Treebank (Marcus95) after under- going headword percolation and binarization. All four LMs predict a word wk and they were implemented using the Maximum...
  • 3
  • 342
  • 0
Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

Ngày tải lên : 08/03/2014, 02:21
... that they have the dis- advantage of being computationally expensive, and not all relevant features can be included. A discriminative language model (DLM) assigns a score to a sentence , measuring ... spe- cific applications and therefore were able to obtain real negative examples easily. For example, Roark (2007) proposed a discriminative language model, in which a model is trained so that a correct ... June. Brian Roark, Murat Saraclar, and Michael Collins. 2007. Discriminative n-gram language modeling. computer speech and language. Computer Speech and Lan- guage, 21(2):373–392. Roni Rosenfeld, Stanley...
  • 8
  • 315
  • 0
Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx

Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx

Ngày tải lên : 19/02/2014, 19:20
... International Language Resources and Evaluation (LREC’10), Val- letta, Malta. Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, and Grzegorz Kondrak. ... system learns this as a non-transliteration but it is wrongly annotated as a transliteration in the gold standard. Arabic nouns have an article “al” attached to them which is translated in English as ... uses Hidden Markov Models (Nabende, 2010; Darwish, 2010; Jiampojamarn et al., 2010), Finite State Au- tomata (Noeman and Madkour, 2010) and Bayesian learning (Kahki et al., 2011) to learn transliteration pairs...
  • 9
  • 521
  • 0
Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Ngày tải lên : 20/02/2014, 04:20
... signif- icantly. Bear in mind that Charniak et al. (2003) in- tegrated Charniak’s language model with the syntax- based translation model Yamada and Knight pro- posed (2001) to rescore a tree-to-string ... Stochastic analysis of lexical and semantic enhanced structural language model. The 8th International Colloquium on Grammatical Inference (ICGI), 97-111. K. Yamada and K. Knight. 2001. A syntax-based ... (EMNLP), 858-867. E. Charniak. 2001. Immediate-head parsing for language models. The 39th Annual Conference on Association of Computational Linguistics (ACL), 124-131. E. Charniak, K. Knight and K. Yamada. 2003....
  • 10
  • 567
  • 0
Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

Ngày tải lên : 20/02/2014, 07:20
... parts randomly: 5K as the adaptation corpus and 5K as the testing set. We show the ASR char- acter accuracy results after lexicon adaptation by the proposed approach in Table 3. LAICA-1 LAICA-2 A ... replaced by characters, we can treat words as a means to enhance character recog- nition accuracy. Such arguments stand at least for Chinese ASR since they evaluate on character error rate and ... total path probability mass. This can be amended by involving the discriminative language model adaptation in the iteration, which results in a unified language model and lexicon adaptation framework....
  • 9
  • 466
  • 0
Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Ngày tải lên : 20/02/2014, 09:20
... and Linda C. Bauman Peto. 1995. A hierarchical Dirichlet language model. Natural Lan- guage Engineering, 1(3):1–19. Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor processes. ... n-grams: C(ab) − C(ab∗). A( ab) = max(1, K(C(ab) − C(ab∗))) A different K constant is chosen for each n-gram order. Using this formulation as an interpolated 5- gram language model gives a cross ... Speech and Language. R. Kneser and H. Ney. 1995. Improved backing-off for m-gram language modeling. In International Confer- ence on Acoustics, Speech, and Signal Processing. David J. C. Mackay and...
  • 4
  • 425
  • 1
Tài liệu Báo cáo khoa học: "A Succinct N-gram Language Model" ppt

Tài liệu Báo cáo khoa học: "A Succinct N-gram Language Model" ppt

Ngày tải lên : 20/02/2014, 09:20
... com- pression tasks achieved a significant com- pression rate without any loss. 1 Introduction There has been an increase in available N -gram data and a large amount of web-scaled N-gram data has been ... the ACL-IJCNLP 2009 Conference Short Papers, pages 341–344, Suntec, Singapore, 4 August 2009. c 2009 ACL and AFNLP A Succinct N-gram Language Model Taro Watanabe Hajime Tsukada Hideki Isozaki NTT ... Communication Science Laboratories 2-4 Hikaridai Seika-cho Soraku-gun Kyoto 619-0237 Japan {taro,tsukada,isozaki}@cslab.kecl.ntt.co.jp Abstract Efficient processing of tera-scale text data is an important...
  • 4
  • 457
  • 0
Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

Ngày tải lên : 20/02/2014, 15:20
... set of candidates. This computational advantage is the main reason that we adopt the local model in this paper. 3.3 Global versus Local Models Both the global and the localized log-linear models ... paper, we present a block-based model for statis- tical machine translation. A block is a pair of phrases which are translations of each other. For example, Fig. 1 shows an Arabic-English translation ... Boston, MA, May. Christoph Tillmann and Fei Xia. 2003. A Phrase-based Unigram Model for Statistical Machine Translation. In Companian Vol. of the Joint HLT and NAACL Confer- ence (HLT 03), pages...
  • 8
  • 578
  • 0
Tài liệu Báo cáo khoa học: "SIMULATING CHILDREN''''S NULL SUBJECTS: A NEARLY LANGUAGE GENERATION MODEL" ppt

Tài liệu Báo cáo khoa học: "SIMULATING CHILDREN''''S NULL SUBJECTS: A NEARLY LANGUAGE GENERATION MODEL" ppt

Ngày tải lên : 20/02/2014, 21:20
... Universal Grammar and American Sign Language: Setting the Null Argument Parameters. Dordrecht: Kluwer Academic Publishers. MacWhinney, B., & Snow, C. (1985). The Child Language Data Exchange ... form a 'maximal' phrase or XP. Lexical items are inserted as soon as the appropriate X ° heads (or XPs, for pro-forms) become available. Each time a structural unit is built, and each ... while leaving the NPL and NPI parameters set at the default (negative) values. FELICITY can also be used to address theories pertaining to other aspects of language acquisition that appear slightly...
  • 3
  • 372
  • 0
Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Ngày tải lên : 07/03/2014, 22:20
... Philadelphia, Pennsylva- nia, USA, July. Matt Post and Daniel Gildea. 2008. Parsers as language models for statistical machine translation. In Proceed- ings of AMTA. Sylvain Raybaud, Caroline Lavecchia, ... prediction ability, we present two ex- tensions to standard n-gram language mod- els in statistical machine translation: a back- ward language model that augments the con- ventional forward language model, ... that a language model that embraces a larger context provides better pre- diction ability, we learn additional information from training data to enhance conventional n-gram lan- guage models and...
  • 10
  • 415
  • 0

Xem thêm