Version at: 17/05/2020, 17:28 vs. version at: 05/03/2021, 22:17
11[Go back to tatoeba.org](http://tatoeba.org/)
22
33# FAQ
44
55## What should I do if I have trouble logging in?
66
77Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:
88
99* [Google Chrome](https://support.google.com/chrome/answer/95582)
1010* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
1111* [Safari](http://support.apple.com/kb/ph11920)
1212
1313If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.
1414
1515
1616## Why are some translations in grey?
1717
1818Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).
1919
2020We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.
2121
2222
2323## When a sentence has several possible translations, should I translate it with all the possibilities?
2424
2525It's really up to you.
2626
2727If you think one translation is a lot more likely than the other, you can add only that translation. If you think it makes sense to add all the translations, then you can add all the possible translations.
2828
2929If you think all the translations make sense but it gets boring to add them all every time (and surely you will), then you can just choose randomly, or based on your personal preference, which translation to add.
3030
31We don't need to have *every* sentence translated in *every* possible ways. What is important to us is that the corpus *as a whole* covers all the possible ways.
31We don't need to have *every* sentence translated in *every* possible way. What is important to us is that the corpus *as a whole* covers all the possible ways.
3232
3333For example, we have English sentences with the word "you" and this word can be translated in different ways in other languages. For the sake of simplicity, we'll assume two forms: singular and plural. In this case, we just want to make sure that at least one sentence is translated with the singular form and at least one sentence is translated with the plural form. So if you notice no one translated with the singular form, then you can add it (and vice versa for the plural form). If you feel that one form is under-represented compared to the other, then you can favor translations with the form that is under-represented so that we have a good balance between the two.
3434
3535## Why do I not see all the translations I expect to see?
3636
3737If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.
3838
3939
4040## Why are some sentences in red?
4141
4242Red sentences are not approved. They raise copyright issues or are otherwise problematic.
4343
4444You should not translate them.
4545
4646
4747## A sentence is not marked with the right language. How do I fix it?
4848
4949If you are using the old interface design, click on the language icon (usually a flag) to the left of the sentence and select the correct language from the drop-down list. If you are using the new interface design, click on the "edit" icon (pen). If you are not sure whether you are using the old or new interface, go to your "Settings" page and see whether the option next to the text "Display sentences with the new design" is checked.
5050
5151
5252## How can I add tags to a sentence?
5353
5454To add tags, you must be an advanced contributor.
5555
5656=> [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)
5757
5858
5959## How can I request a new language?
6060
6161=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)
6262
6363
6464## When contributing in Chinese, should I use simplified or traditional characters?
6565
6666You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.
6767
6868When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.
6969
7070* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
7171* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified
7272
7373Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.
7474
7575You can browse the Chinese sentences to see what they look like.
7676
7777
7878## How do I delete my account?
7979
8080=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)
8181
8282
8383## Does Tatoeba provide an API?
8484
8585No, it does not (yet).
8686
8787We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this.
8888
8989With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.
9090
9191Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.
9292
9393## I would like to use Tatoeba's data for my project. How do I give proper attribution?
9494
9595**For the textual data**
9696
9797Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).
9898
9999Here's an example of good attribution:
100100[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)
101101
102102**For the audio data**
103103
104104Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.
105105
106106You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.
107107
108108We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.
109109
110110Here's an example of attribution:
111111
112112 All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
113113 - userA (license: CC-BY-SA)
114114 - userB (license: CC-BY-NC)
115115 - userC (license: CC-BY)
116116
117117## Where can I download Tatoeba's audio data?
118118
119119Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.
120120
121121We only have one ZIP file for English, namely [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip), a 3.8 GB file generated in November 2017, upon request from the Common Voice project, which wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.
122122
123123If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:
124124
125125- "Sentences with audio": to have the ID's of all the sentences that have audio
126126- "Sentences": to know what is the language of each sentence
127127
128128Once you have the language and the ID, the URL to download the audio file is:
129129`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`
130130
131131For instance:
132132[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)
133133
134134Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!
135135
136136
137137## How can I download all sentences and translations in specific languages?
138138
139139From the [Downloads](https://tatoeba.org/eng/downloads) page, you can download all sentences in all languages, or all sentences in a specific language. You can also download the translation links for all sentences, but this will probably only be of use to you if you are comfortable writing code.
140140
141141At the site [ManyThings.org](http://www.manythings.org/anki/), you will find sets of English sentences and their translations into other languages compiled by a Tatoeba member.
142142
143143If you are comfortable with modifying Python code, you can use the [Tatoeba Playground](https://mybinder.org/v2/gh/agrodet/Tatoeba-playground/master) project on GitHub to obtain sentences and translations satisfying the conditions you specify.
144144
145145You may also be able to find scripts that already do what you want. Here are some examples of Google searches you can do to find these scripts:
146146
147147- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
148148- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)
149149
150150You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.
151151
152152
153153---
154154
155155### Related Articles
156156
157157[Quick Start Guide](quick-start)
158158[Rules and Guidelines](guidelines)
159159
160160---
161161[Go back to tatoeba.org](http://tatoeba.org/)
diff view generated by jsdifflib

Version at: 17/05/2020, 17:28

[Go back to tatoeba.org](http://tatoeba.org/)

# FAQ

## What should I do if I have trouble logging in?

Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:

* [Google Chrome](https://support.google.com/chrome/answer/95582)
* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
* [Safari](http://support.apple.com/kb/ph11920)

If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.


## Why are some translations in grey?

Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).

We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.


## When a sentence has several possible translations, should I translate it with all the possibilities?

It's really up to you.

If you think one translation is a lot more likely than the other, you can add only that translation. If you think it makes sense to add all the translations, then you can add all the possible translations. 

If you think all the translations make sense but it gets boring to add them all every time (and surely you will), then you can just choose randomly, or based on your personal preference, which translation to add.

We don't need to have *every* sentence translated in *every* possible ways. What is important to us is that the corpus *as a whole* covers all the possible ways. 

For example, we have English sentences with the word "you" and this word can be translated in different ways in other languages. For the sake of simplicity, we'll assume two forms: singular and plural. In this case, we just want to make sure that at least one sentence is translated with the singular form and at least one sentence is translated with the plural form. So if you notice no one translated with the singular form, then you can add it (and vice versa for the plural form). If you feel that one form is under-represented compared to the other, then you can favor translations with the form that is under-represented so that we have a good balance between the two.

## Why do I not see all the translations I expect to see?

If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.


## Why are some sentences in red?

Red sentences are not approved. They raise copyright issues or are otherwise problematic.

You should not translate them.


## A sentence is not marked with the right language. How do I fix it?

If you are using the old interface design, click on the language icon (usually a flag) to the left of the sentence and select the correct language from the drop-down list. If you are using the new interface design, click on the "edit" icon (pen). If you are not sure whether you are using the old or new interface, go to your "Settings" page and see whether the option next to the text "Display sentences with the new design" is checked.


## How can I add tags to a sentence?

To add tags, you must be an advanced contributor.

=>  [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)


## How can I request a new language?

=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)


## When contributing in Chinese, should I use simplified or traditional characters?

You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.

When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.

* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified

Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.

You can browse the Chinese sentences to see what they look like.


## How do I delete my account?

=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)


## Does Tatoeba provide an API?

No, it does not (yet).

We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this. 

With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.

Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.

## I would like to use Tatoeba's data for my project. How do I give proper attribution?

**For the textual data**

Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).

Here's an example of good attribution:
[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)

**For the audio data**

Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.

You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.

We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.

Here's an example of attribution:

    All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
      - userA (license: CC-BY-SA)
      - userB (license: CC-BY-NC)
      - userC (license: CC-BY)

## Where can I download Tatoeba's audio data?

Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.

We only have one ZIP file for English, namely [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip), a 3.8 GB file generated in November 2017, upon request from the Common Voice project, which wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.

If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:

- "Sentences with audio": to have the ID's of all the sentences that have audio
- "Sentences": to know what is the language of each sentence

Once you have the language and the ID, the URL to download the audio file is:
`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`

For instance:
[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)

Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!


## How can I download all sentences and translations in specific languages?

From the [Downloads](https://tatoeba.org/eng/downloads) page, you can download all sentences in all languages, or all sentences in a specific language. You can also download the translation links for all sentences, but this will probably only be of use to you if you are comfortable writing code.

At the site [ManyThings.org](http://www.manythings.org/anki/), you will find sets of English sentences and their translations into other languages compiled by a Tatoeba member.

If you are comfortable with modifying Python code, you can use the [Tatoeba Playground](https://mybinder.org/v2/gh/agrodet/Tatoeba-playground/master) project on GitHub to obtain sentences and translations satisfying the conditions you specify.

You may also be able to find scripts that already do what you want. Here are some examples of Google searches you can do to find these scripts:

- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)

You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.


---

### Related Articles

[Quick Start Guide](quick-start)  
[Rules and Guidelines](guidelines) 

---
[Go back to tatoeba.org](http://tatoeba.org/)

version at: 05/03/2021, 22:17

[Go back to tatoeba.org](http://tatoeba.org/)

# FAQ

## What should I do if I have trouble logging in?

Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:

* [Google Chrome](https://support.google.com/chrome/answer/95582)
* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
* [Safari](http://support.apple.com/kb/ph11920)

If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.


## Why are some translations in grey?

Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).

We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.


## When a sentence has several possible translations, should I translate it with all the possibilities?

It's really up to you.

If you think one translation is a lot more likely than the other, you can add only that translation. If you think it makes sense to add all the translations, then you can add all the possible translations. 

If you think all the translations make sense but it gets boring to add them all every time (and surely you will), then you can just choose randomly, or based on your personal preference, which translation to add.

We don't need to have *every* sentence translated in *every* possible way. What is important to us is that the corpus *as a whole* covers all the possible ways. 

For example, we have English sentences with the word "you" and this word can be translated in different ways in other languages. For the sake of simplicity, we'll assume two forms: singular and plural. In this case, we just want to make sure that at least one sentence is translated with the singular form and at least one sentence is translated with the plural form. So if you notice no one translated with the singular form, then you can add it (and vice versa for the plural form). If you feel that one form is under-represented compared to the other, then you can favor translations with the form that is under-represented so that we have a good balance between the two.

## Why do I not see all the translations I expect to see?

If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.


## Why are some sentences in red?

Red sentences are not approved. They raise copyright issues or are otherwise problematic.

You should not translate them.


## A sentence is not marked with the right language. How do I fix it?

If you are using the old interface design, click on the language icon (usually a flag) to the left of the sentence and select the correct language from the drop-down list. If you are using the new interface design, click on the "edit" icon (pen). If you are not sure whether you are using the old or new interface, go to your "Settings" page and see whether the option next to the text "Display sentences with the new design" is checked.


## How can I add tags to a sentence?

To add tags, you must be an advanced contributor.

=>  [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)


## How can I request a new language?

=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)


## When contributing in Chinese, should I use simplified or traditional characters?

You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.

When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.

* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified

Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.

You can browse the Chinese sentences to see what they look like.


## How do I delete my account?

=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)


## Does Tatoeba provide an API?

No, it does not (yet).

We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this. 

With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.

Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.

## I would like to use Tatoeba's data for my project. How do I give proper attribution?

**For the textual data**

Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).

Here's an example of good attribution:
[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)

**For the audio data**

Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.

You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.

We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.

Here's an example of attribution:

    All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
      - userA (license: CC-BY-SA)
      - userB (license: CC-BY-NC)
      - userC (license: CC-BY)

## Where can I download Tatoeba's audio data?

Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.

We only have one ZIP file for English, namely [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip), a 3.8 GB file generated in November 2017, upon request from the Common Voice project, which wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.

If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:

- "Sentences with audio": to have the ID's of all the sentences that have audio
- "Sentences": to know what is the language of each sentence

Once you have the language and the ID, the URL to download the audio file is:
`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`

For instance:
[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)

Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!


## How can I download all sentences and translations in specific languages?

From the [Downloads](https://tatoeba.org/eng/downloads) page, you can download all sentences in all languages, or all sentences in a specific language. You can also download the translation links for all sentences, but this will probably only be of use to you if you are comfortable writing code.

At the site [ManyThings.org](http://www.manythings.org/anki/), you will find sets of English sentences and their translations into other languages compiled by a Tatoeba member.

If you are comfortable with modifying Python code, you can use the [Tatoeba Playground](https://mybinder.org/v2/gh/agrodet/Tatoeba-playground/master) project on GitHub to obtain sentences and translations satisfying the conditions you specify.

You may also be able to find scripts that already do what you want. Here are some examples of Google searches you can do to find these scripts:

- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)

You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.


---

### Related Articles

[Quick Start Guide](quick-start)  
[Rules and Guidelines](guidelines) 

---
[Go back to tatoeba.org](http://tatoeba.org/)

Note

The lines in green are the lines that have been added in the new version. The lines in red are those that have been removed.