DSpace About DSpace Software

DSpace at Cochin University >
Department of Computer Science >
Faculty >
G. Santhosh Kumar >
Publications >
International Conferences >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/984

Title: Alignment Model and Training Technique in SMT from English to Malayalam
Authors: Sebastian, Mary Priya
Kurian, Sheena
Kumar, G Santhosh
Keywords: Parallel Corpus
PoS Tagging
statistical machine translation
Issue Date: 30-Aug-2010
Abstract: This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better training results with improved accuracy.
URI: http://hdl.handle.net/123456789/984
Appears in Collections:International Conferences

Files in This Item:

File Description SizeFormat
finalCRC.pdf379.74 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback