| Download | - View final version: Bilingual Sense Similarity for Statistical Machine Translation (PDF, 614 KiB)
|
|---|
| Author | Search for: Chen, Boxing1; Search for: Foster, George1; Search for: Kuhn, Roland1 |
|---|
| Affiliation | - National Research Council Canada. NRC Institute for Information Technology
|
|---|
| Format | Text, Article |
|---|
| Conference | 48th Annual Meeting of the Association for Computational Linguistics, July 11–16, 2010, Uppsala, Sweden |
|---|
| Subject | Information and Communications Technologies |
|---|
| Abstract | This paper proposes new algorithms to compute the sense similarity between two units (words, phrases, rules, etc.) from parallel corpora. The sense similarity scores are computed by using the vector space model. We then apply the algorithms to statistical machine translation by computing the sense similarity between the source and target side of translation rule pairs. Similarity scores are used as additional features of the translation model to improve translation performance. Significant improvements are obtained over a state-of-the-art hierarchical phrase-based machine translation system. |
|---|
| Publication date | 2010 |
|---|
| Access condition | |
|---|
| In | |
|---|
| Language | English |
|---|
| Peer reviewed | Yes |
|---|
| NPARC number | 15736685 |
|---|
| Export citation | Export as RIS |
|---|
| Report a correction | Report a correction (opens in a new tab) |
|---|
| Record identifier | c5a3a3fe-7d31-4f97-9d07-8dda09a155b9 |
|---|
| Record created | 2010-07-05 |
|---|
| Record modified | 2020-05-27 |
|---|