... measure shows substantial im- provement in structural disambiguation over a syntax-based approach. 1. Introduction In a large natural language processing system, such as a machine translation ... R&D Road II, Science-Based Industrial Park Hsinchu, TAIWAN 30077, R.O.C. ABSTRACT In natural language processing, ambiguity res- olution is a central issue, and can be regarded as a ... information. Hence, we will show how to annotate a syntax tree so that various interpretations can be characterized differently. Semantic Tagging A popular linguistic approach to annotate a...
... variouslanguage-pairs, one issue is that matching syn-tactic analysis can not always guarantee a goodtranslation, and violating syntactic structure doesnot always induce a bad translation. Marton andResnik ... Singapore, 2-7 August 2009.c2009 ACL and AFNLP A Syntax-Driven Bracketing Modelfor Phrase-Based TranslationDeyi Xiong, Min Zhang, Aiti Aw and Haizhou LiHuman Language TechnologyInstitute for ... Reordering Model for Statistical Machine Translation. In Proceedings ofACL-COLING 2006.Deyi Xiong, Min Zhang, Aiti Aw, and Haizhou Li.2008. Linguistically Annotated BTG for StatisticalMachine Translation....
... Communication Re-search Centre, University of Edinburgh.John Hale, Izhak Shafran, Lisa Yung, Bonnie Dorr, MaryHarper, Anna Krasnyanskaya, Matthew Lease, YangLiu, Brian Roark, Matthew Snover, and ... 1 and 2, in which the samerepair fragment is shown in a standard state such asmight be used to train aprobabilistic context freegrammar, and after the right-corner transform. Fig-ure 1 also ... modified for use in a specialrepair grammar, which not only reduces the amountof available training data, but violates our intuitionthat most reparanda are fluent up until the actual editoccurs.The...
... groups and domains can be modeled separately without accessing and adapting the language model of the MT system for each SMS application. Another advantage is that the normalization module can ... normalization as a translation problem from the SMS language to the English language1 and we propose to adapt a phrase-based statistical MT modelfor the task. Evaluation by 5-fold cross validation ... a consensus translation technique to bootstrap parallel data using off-the-shelf translation sys-tems for training a hierarchical statistical transla-tion modelfor general domain instant...
... & Takahara, J. (1999)Evidence fora potential role for HDL as an important source ofcholesterol in human adrenocortical tumors via the CLA-1 path-way. Endocr. J. 46, 27–34.63. Cherradi,N.,Bideau,M.,Arnaudeau,S.,Demaurex,N.,James,R.W., ... proliferation andinvasion. Choriocarcinoma is a malignant neoplasm thatrepresents the early trophoblast of the attachment phase oras later invasive stage [46–48]. Thus, in most cases,choriocarcinoma ... choriocarcinomacells were incubated as described above. Total RNA was isolated, andNorthern blot analysis was performed using radiolabeled SR-BIcDNA probe for each cell line (top panel) in the absence...
... bridge and New York. Weinberg, Amy, Joseph Garman. Jeffery Martin. and Paola Merlo. 1995. Principle-Based Parser for Foreign Language Training in German and Arabic. In Melissa Holland, Jonathan ... representations, as in the examples provided from machine transla- tion and foreign language tutoring applications. We are aware of no attempt in the literature to represent and access aspect on a ... of Lezical and Grammat- ical Aspect. Garland, New York. Passoneau, Rebecca. 1988. A Computational Model of the Semantics of Tense and Aspect. Compu- tational Linguistics: Special Issue...
... Ducharme, P. Vincent, and C. Jauvin. 2003. A Neural Probabilistic Language Model. Journal ofMachine Learning Research, 3:1137–1155. A. Berger, V. Della Pietra, and S. Della Pietra. 1996. A Maximum ... CategorizationResearch. Journal of Machine Learning Research,5:361–397. A. Mnih and G. Hinton. 2008. A Scalable HierarchicalDistributed Language Model. In Advances in NeuralInformation Processing ... model whose relative performance compared to N-Grammodels gets better as one incorporates richer fea-ture sets. It scales almost as well to large datasetsas standard N-Gram models: training requires...
... is substantial re-search focusing on syntactic tree alignment model for machine translation. For example, (Wu 1997; Alshawi, Bangalore, and Douglas, 2000; Yamada and Knight, 2001) have studied ... for Machine Translation in the Americas. Munteanu D. S, A. Fraser, and D. Marcu. D., 2002. Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora. ... three features, the maximum en-tropy model is trained on 1,000 pairs of web pages manually labeled as parallel or non-parallel. The Iterative Scaling algorithm (Pietra, Pietra and Lafferty...
... measure of relatedness does (lowy values for small x values and high y values for high x). The same pattern applies in the M&C and353-C data sets.4.2 Evaluation of the GVSM For the evaluation ... query, are computed similarly. A GVSM model aims at being able to retrieve documentsthat not necessarily contain exact matches of thequery terms, and this is its great advantage. Thisnew space ... Linguistics A Generalized Vector Space Modelfor Text RetrievalBased on Semantic RelatednessGeorge Tsatsaronis and Vicky PanagiotopoulouDepartment of InformaticsAthens University of Economics and Business,76,...
... ring. Assuming that the data Chordis being used to locate is cryptographically authenticated, this is a threat to availability of data rather than to authenticity. The sameapproach used above ... de-sired authentication, caching, replication, and user-friendly namingof data. Chord’s flat key space eases the implementation of thesefeatures. For example, an application could authenticate data ... mechanism also helps higher layer softwarereplicate data. A typical application using Chord might store repli-cas of the data associated with a key at the nodes succeeding thekey. The fact that...
... exponential translation modelfor targetlanguage morphology. In ACL-HLT.C. Tillmann. 2004. A unigram orientation modelfor statisticalmachine translation. In NAACL.K. Toutanova, H. Suzuki, and A. ... similar agreement phenom-ena as probabilistic sequences.Factored Translation ModelsFactored transla-tion models (Koehn and Hoang, 2007) facilitate a more data-oriented approach to agreement modeling.Words ... phrase ta-ble annotations and can be easily implementedas a feature in many phrase-based decoders.1 IntroductionLanguages vary in the degree to which surface formsreflect grammatical relations....
... information for each character.Each character can be assigned one of two possi-ble boundary tags: “B” fora character that begins a word and “I” fora character that occurs in the mid-dle of a word. ... representa-tion (Ramshaw and Marcus, 1995) and the Start/Endrepresentation (Kudo and Matsumoto, 2001) arepopular. For example, the label B-NN indicates that a character is located at the begging of a noun. ... POS information is allowed to inter-act with segmentation. Note that word segmentationcan also be formulated as a sequential classificationproblem to predict whether a character is located atthe...
... implementation is availableas the open-source splitSVM Java library.1 IntroductionOver the last decade, many natural language pro-cessing tasks are being cast as classification prob-lems. These are ... achieved by taking into account the Zipfian natureof natural language data, and structuring the compu-tation accordingly. On a sample application (replac-ing the libsvm classifier used by MaltParser ... Changingexisting Java code to accommodate our fast SVMclassifier is done by loading a different model, andchanging a single function call.4.1 Evaluation: Speeding up MaltParserWe evaluate...
... thank Emily Pitler and Samarth Ke-shava for making available the code of the RePortSalgorithm, and Stefan Bordag and Delphine Bern-hard for running their algorithms on the Germandata. Many ... presentedhere have been shown to improve accuracy (Kurimoet al., 2006).Another motivation for evaluating the system on a task rather than on manually annotated data isthat linguistically motivated morphological ... semantic and syntactic informa-tion is very attractive because it adds an additionaldimension, but these approaches have to cope withmore severe data sparseness issues than approachesthat...
... Linguistics A Hierarchical Phrase-Based Modelfor Statistical Machine TranslationDavid ChiangInstitute for Advanced Computer Studies (UMIACS)University of Maryland, College Park, MD 20742, USAdchiang@umiacs.umd.eduAbstractWe ... USAdchiang@umiacs.umd.eduAbstractWe present a statistical phrase-based transla-tion model that uses hierarchical phrases—phrases that contain subphrases. The model is formally a synchronous ... in NaturalLanguage Processing (EMNLP), pages 388–395.Shankar Kumar, Yonggang Deng, and William Byrne.2005. A weighted finite state transducer transla-tion template modelfor statistical machine...