Go back to tatoeba.org.
This article explains how to request a new language to be added in Tatoeba. This is about languages of the sentences in the corpus. For languages of the interface, please refer to the article about the interface translation.
How to request a language
Step 1: Identify your language in the ISO 693-3 standard
Search for your language in the ISO 639-3 list of languages.
The language must have a 3-letter code in the column "639-3" (the first column). If not, then it is not valid to us.
The language must be defined as an individual language. It must not be not a macrolanguage.
Please understand that if your language is not recognized in the ISO 639-3 standard, we cannot support it in Tatoeba. Language classification is a complex task and it is not part of Tatoeba's mission. We rely on the ISO 639-3 standard to define what is a valid language.
There are some exceptions due to legacy reasons, but we will not make more exceptions.
If your language is missing in this standard, please contact the ISO 639-3 Registration Authority from their website: https://iso639-3.sil.org/.
Step 2: Create a list with sentences in your language
Warning: If you are new to Tatoeba, be aware that before you can add any sentence at all, you need add at least one language in your profile. You don't have to mention all the languages you know, just one is enough. Ideally, pick your strongest language among the ones that are available.
Create a list with the name of the language in English and the ISO code in parenthesis. For example:
You can add new sentences directly from the list or you can find a few sentences to translate into your language.
- When you add your translations, choose the option "Auto detect" in the language dropdown.
- After the sentence/translation is added, if the language was not identified as "unknown" (you will see an icon with a question mark if it is), then click on the language icon and select "other language".
If you have trouble understanding how to use lists, don't hesitate to ask for help on the Wall.
Step 3: Find an image which we can use to create the language icon
Just like members of Tatoeba have profile pictures, languages in Tatoeba also have their "profile picture" (language icon). We will create an icon for your language based on the image you will send us.
First of all, do not feel pressured to find an image. If you are unable to find something good, we can always use our default language icon template: the icon will have the 3-letter ISO code written in black on a white background.
If you wish to find a more colorful icon to represent your language, please check our policies regarding language icons.
Step 4: Submit your request
Contact the language team via private message or via email at firstname.lastname@example.org. Make sure you provide the following information:
- The link to the list that you created.
- The image that you suggest we use for the language icon (if you have found any), or a link to it if you upload it to an image-hosting site.
- You can craft yourself a 30x20 pixel icon but make sure you link the original image. It is possible we decide to readjust a few things from your icon and need the original image to achieve these adjustments.
Feel free to provide any other information that you think is relevant.
How to follow the progress of your request
Once you have submitted your request, here are the remaining steps before you can see your language available in Tatoeba.
Issue created in GitHub
After sending your request, you will have to wait for it to be reported in GitHub, where we officially track all the language requests.
Before adding your request to GitHub, we will check the following:
- The sentences you have added in the list are indeed written in the language you request to add.
- The image you have suggested for the language icon is appropriate.
This should take no longer than two weeks. If your request is not valid, we will let you know within two weeks. If your request looks valid to us, we will send you the link to the GitHub issue within two weeks.
Pull request opened
After the issue has been created, the next step is to create a pull request. This is a request to change the code source of Tatoeba, to include the new language.
This is not something you have to do. Ricardo14 is in charge of creating pull requests for languages.
This should take no longer than two weeks as well. Once the pull request is created, you can see it in the pull requests page on GitHub.
Issue assigned to milestone
If there is controversy around the language you wish to add, you may need to wait until we come to an agreement or to a compromise. This can stall the addition of your language for several weeks.
If nothing is wrong, your language request will be scheduled for to be deployed as soon as possible.
You will have an idea on which date your language will be available on Tatoeba by checking the milestone of the GitHub issue (displayed in the right-side column). For instance if the milestone says
2018-10-23, it means the language is scheduled for October 23rd, 2018.
Pull request merged
Merging a pull request means that we integrate the changes into the main codebase. As long as the pull request is not merged, the source code doesn't contain yet the necessary modifications to support your language.
A pull request will usually be merged shortly after it is assigned to a milestone. However, if we discover last minute issues, we may delay the merge and as a result, we may move the language request to a later milestone.
This is the last step. Your language will be available on Tatoeba once the website is updated.
You can check the milestones on GitHub to see when the next Tatoeba update is scheduled, if there's any scheduled yet.
This article was last updated on April 14, 2019.