| Download | - View final version: Gᵢ2Pᵢ: rule-based, index-preserving grapheme-to-phoneme transformations (PDF, 494 KiB)
|
|---|
| Author | Search for: Pine, Aidan1; Search for: Littell, Patrick1; Search for: Joanis, Eric1; Search for: Huggins-Daines, David; Search for: Cox, Christopher; Search for: Davis, Fineen; Search for: Santos, Eddie Antonio; Search for: Srikanth, Shankhalika; Search for: Torkornoo, Delasie; Search for: Yu, Sabrina |
|---|
| Affiliation | - National Research Council Canada. Digital Technologies
|
|---|
| Format | Text, Article |
|---|
| Conference | Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages, June 26-27, 2022, |
|---|
| Abstract | This paper describes the motivation and implementation details for a rule-based, index preserving grapheme-to-phoneme engine ‘Gᵢ2Pᵢ’ implemented in pure Python and released under the open source MIT license8 . The engine and interface have been designed to prioritize the developer experience of potential contributors without requiring a high level of programming knowledge. Gᵢ2Pᵢ already provides mappings for 30 (mostly Indigenous) languages, and the package is accompanied by a web-based interactive development environment, a RESTful API, and extensive documentation to encourage the addition of more mappings in the future. We also present three downstream applications of Gᵢ2Pᵢ and show results of a preliminary evaluation. |
|---|
| Publication date | 2022-05 |
|---|
| Publisher | Association for Computational Linguistics |
|---|
| Licence | |
|---|
| In | |
|---|
| Language | English |
|---|
| Peer reviewed | Yes |
|---|
| Export citation | Export as RIS |
|---|
| Report a correction | Report a correction (opens in a new tab) |
|---|
| Record identifier | de4b961d-54bf-4187-a3fc-d875ac285e79 |
|---|
| Record created | 2022-07-11 |
|---|
| Record modified | 2022-07-12 |
|---|