|
DSpace at Cochin University >
Department of Computer Science >
Faculty >
G. Santhosh Kumar >
Publications >
International Conferences >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/984
|
| Title: | Alignment Model and Training Technique in SMT from English to Malayalam |
| Authors: | Sebastian, Mary Priya Kurian, Sheena Kumar, G Santhosh |
| Keywords: | Parallel Corpus PoS Tagging SMT statistical machine translation malayalam english |
| Issue Date: | 30-Aug-2010 |
| Abstract: | This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the
parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better
training results with improved accuracy. |
| URI: | http://hdl.handle.net/123456789/984 |
| Appears in Collections: | International Conferences
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|