Version at: 05/03/2020, 09:55 vs. version at: 05/03/2020, 10:25
11<a href="http://tatoeba.org/">Go back to tatoeba.org</a>
22
33# FAQ
44
55## What should I do if I have trouble logging in?
66
77Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:
88
99* [Google Chrome](https://support.google.com/chrome/answer/95582)
1010* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
1111* [Safari](http://support.apple.com/kb/ph11920)
1212
1313If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.
1414
1515
1616## Why are some translations in grey?
1717
1818Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).
1919
2020We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.
2121
22
23## When a sentence has several possible translations, should I translate it with all the possibilities?
24
25It's really up to you.
26
27If you think one translation is a lot more likely than the other, you can add only that translation. If you think it makes sense to add all the translations, then you can add all the possible translations.
28
29If you think all the translations make sense but it gets boring to add them all every time (and surely you will), then you can just choose randomly, or based on your personal preference, which translation to add.
30
31We don't need to have *every* sentence translated in *every* possible ways. What is important to us is that the corpus *in its whole* carries all the possible ways.
32
33For example, we have English sentences with the word "you" and this word can be translated in different ways in other languages. For the sake of simplicity, we'll assume two forms: singular and plural. In this case, we just want to make sure that at least one sentence is translated with the singular form and at least one sentence is translated with the plural form. So if you notice no one translated with the singular form, then you can add it (and vice versa for the plural form). If you feel that one form is under-represented compared to the other, then you can favor translations with the form that is under-represented so that we have a good balance between the two.
2234
2335## Why do I not see all the translations I expect to see?
2436
2537If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.
2638
2739
2840## Why are some sentences in red?
2941
3042Red sentences are not approved. They raise copyright issues or are otherwise problematic.
3143
3244You should not translate them.
3345
3446
3547## A sentence is not marked with the right language. How do I fix it?
3648
3749Click on the language icon to the right of the sentence and select the correct language from the list.
3850
3951
4052## How can I add tags to a sentence?
4153
4254To add tags, you must be an advanced contributor.
4355
4456=> [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)
4557
4658
4759## How can I request a new language?
4860
4961=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)
5062
5163
5264## When contributing in Chinese, should I use simplified or traditional characters?
5365
5466You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.
5567
5668When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.
5769
5870* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
5971* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified
6072
6173Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.
6274
6375You can browse the Chinese sentences to see what they look like.
6476
6577
6678## How do I delete my account?
6779
6880=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)
6981
7082
7183## Does Tatoeba provide an API?
7284
7385No, it does not (yet).
7486
7587We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this.
7688
7789With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.
7890
7991Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.
8092
8193## I would like to use Tatoeba's data for my project, how do I give proper attribution?
8294
8395**For the textual data**
8496
8597Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).
8698
8799Here's an example of good attribution:
88100[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)
89101
90102**For the audio data**
91103
92104Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.
93105
94106You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.
95107
96108We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.
97109
98110Here's an example of attribution:
99111
100112 All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
101113 - userA (license: CC-BY-SA)
102114 - userB (license: CC-BY-NC)
103115 - userC (license: CC-BY)
104116
105117## Where can I download Tatoeba's audio data?
106118
107119Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.
108120
109121We only have one ZIP file for English [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip) 3.8 GB which was generated back in November 2017, upon request from the Common Voice project who wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.
110122
111123If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:
112124
113125- "Sentences with audio": to have the ID's of all the sentences that have audio
114126- "Sentences": to know what is the language of each sentence
115127
116128Once you have the language and the ID, the URL to download the audio file is:
117129`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`
118130
119131For instance:
120132[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)
121133
122134Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!
123135
124136
125137## How can I download all sentences and translations in specific languages?
126138
127139We don't provide yet such a feature. You will need to write your own script and process the "Sentences" and "Links" files that can be found on our [Downloads](https://tatoeba.org/eng/downloads) page.
128140
129141If you are looking for sentences and translations from/to English, you may find what you need on this page: [http://www.manythings.org/anki/](http://www.manythings.org/anki/)
130142
131143Otherwise, some people may have already written scripts that do what you want to do. Here are some examples of Google searches you can do to find these scripts:
132144
133145- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
134146- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)
135147
136148You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.
137149
138150
139151---
140152
141153### Related Articles
142154
143155[Quick Start Guide](quick-start)
144156[Rules and Guidelines](guidelines)
145157
146158---
147159
148160<a href="http://tatoeba.org/">Go back to tatoeba.org</a>
diff view generated by jsdifflib

Version at: 05/03/2020, 09:55

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

# FAQ

## What should I do if I have trouble logging in?

Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:

* [Google Chrome](https://support.google.com/chrome/answer/95582)
* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
* [Safari](http://support.apple.com/kb/ph11920)

If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.


## Why are some translations in grey?

Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).

We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.


## Why do I not see all the translations I expect to see?

If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.


## Why are some sentences in red?

Red sentences are not approved. They raise copyright issues or are otherwise problematic.

You should not translate them.


## A sentence is not marked with the right language. How do I fix it?

Click on the language icon to the right of the sentence and select the correct language from the list.


## How can I add tags to a sentence?

To add tags, you must be an advanced contributor.

=>  [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)


## How can I request a new language?

=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)


## When contributing in Chinese, should I use simplified or traditional characters?

You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.

When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.

* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified

Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.

You can browse the Chinese sentences to see what they look like.


## How do I delete my account?

=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)


## Does Tatoeba provide an API?

No, it does not (yet).

We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this. 

With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.

Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.

## I would like to use Tatoeba's data for my project, how do I give proper attribution?

**For the textual data**

Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).

Here's an example of good attribution:
[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)

**For the audio data**

Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.

You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.

We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.

Here's an example of attribution:

    All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
      - userA (license: CC-BY-SA)
      - userB (license: CC-BY-NC)
      - userC (license: CC-BY)

## Where can I download Tatoeba's audio data?

Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.

We only have one ZIP file for English [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip) 3.8 GB which was generated back in November 2017, upon request from the Common Voice project who wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.

If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:

- "Sentences with audio": to have the ID's of all the sentences that have audio
- "Sentences": to know what is the language of each sentence

Once you have the language and the ID, the URL to download the audio file is:
`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`

For instance:
[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)

Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!


## How can I download all sentences and translations in specific languages?

We don't provide yet such a feature. You will need to write your own script and process the "Sentences" and "Links" files that can be found on our [Downloads](https://tatoeba.org/eng/downloads) page.

If you are looking for sentences and translations from/to English, you may find what you need on this page: [http://www.manythings.org/anki/](http://www.manythings.org/anki/)

Otherwise, some people may have already written scripts that do what you want to do. Here are some examples of Google searches you can do to find these scripts:

- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)

You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.


---

### Related Articles

[Quick Start Guide](quick-start)  
[Rules and Guidelines](guidelines) 

---

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

version at: 05/03/2020, 10:25

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

# FAQ

## What should I do if I have trouble logging in?

Try clicking the "Remember Me" checkbox before you type in your username and password. If you still have problems logging in when you do that, try clearing your cache and cookies. Here are some guides:

* [Google Chrome](https://support.google.com/chrome/answer/95582)
* [Mozilla Firefox](https://support.mozilla.org/en-US/kb/how-clear-firefox-cache)
* [Safari](http://support.apple.com/kb/ph11920)

If you still have problems, send an email to Team Tatoeba (team@tatoeba.org) telling us your browser and other details that might be important.


## Why are some translations in grey?

Grey translations are indirect translations. In other words, they are translations of the translations, and not translations of the main sentence (the main sentence is the sentence in big letters).

We display them because they can be useful, but you should be careful. Their meaning may differ a little from the main sentence.


## When a sentence has several possible translations, should I translate it with all the possibilities?

It's really up to you.

If you think one translation is a lot more likely than the other, you can add only that translation. If you think it makes sense to add all the translations, then you can add all the possible translations. 

If you think all the translations make sense but it gets boring to add them all every time (and surely you will), then you can just choose randomly, or based on your personal preference, which translation to add.

We don't need to have *every* sentence translated in *every* possible ways. What is important to us is that the corpus *in its whole* carries all the possible ways. 

For example, we have English sentences with the word "you" and this word can be translated in different ways in other languages. For the sake of simplicity, we'll assume two forms: singular and plural. In this case, we just want to make sure that at least one sentence is translated with the singular form and at least one sentence is translated with the plural form. So if you notice no one translated with the singular form, then you can add it (and vice versa for the plural form). If you feel that one form is under-represented compared to the other, then you can favor translations with the form that is under-represented so that we have a good balance between the two.

## Why do I not see all the translations I expect to see?

If you have listed a series of language codes in your settings, Tatoeba will only display translations in the languages you indicated. Leave the field empty to display translations in all languages.


## Why are some sentences in red?

Red sentences are not approved. They raise copyright issues or are otherwise problematic.

You should not translate them.


## A sentence is not marked with the right language. How do I fix it?

Click on the language icon to the right of the sentence and select the correct language from the list.


## How can I add tags to a sentence?

To add tags, you must be an advanced contributor.

=>  [See article: advanced-contributors](http://en.wiki.tatoeba.org/articles/show/advanced-contributors)


## How can I request a new language?

=> [See article: new-language-request](http://en.wiki.tatoeba.org/articles/show/new-language-request)


## When contributing in Chinese, should I use simplified or traditional characters?

You can use whichever you like. We have a tool that will automatically convert simplified into traditional, and traditional into simplified.

When browsing sentences, if you set the Chinese sentence as the main sentence, you will see an additional icon at the top of the sentence.

* ![traditional](http://tatoeba.org/img/traditional_chinese.png) traditional
* ![simplified](http://tatoeba.org/img/simplified_chinese.png) simplified

Below each Chinese sentence, you will also see the transcription in pinyin, and below the pinyin, the conversion into simplified or traditional.

You can browse the Chinese sentences to see what they look like.


## How do I delete my account?

=> [See article: delete-account](http://en.wiki.tatoeba.org/articles/show/delete-account)


## Does Tatoeba provide an API?

No, it does not (yet).

We unfortunately do not have the proper infrastructure to host a public API. Nonetheless, please do not hesitate to [contact us](https://tatoeba.org/eng/contact) to let us know that you would be interested in this. 

With more and more people asking us, we will eventually start something and we will be happy to hear more details about the needs of your application/project.

Meanwhile, what you can do is download our sentences from the [Downloads](https://tatoeba.org/eng/downloads) page, then build your own API from there.

## I would like to use Tatoeba's data for my project, how do I give proper attribution?

**For the textual data**

Basically you just need to write somewhere that some/all of your sentences are from Tatoeba, with a link to [https://tatoeba.org](https://tatoeba.org), and mention that Tatoeba's data is released under [CC-BY 2.0 FR](https://creativecommons.org/licenses/by/2.0/fr/).

Here's an example of good attribution:
[https://www.clozemaster.com/about#where-are-the-sentences-from](https://www.clozemaster.com/about#where-are-the-sentences-from)

**For the audio data**

Our audio corpus has a wider range of licenses and isn't just restricted to CC-BY. You should therefore be more careful about which audio you are using, especially if your project/app is commercial.

You can check the license of each audio recording from the file we release under "Sentences with audio" on our [Downloads](https://tatoeba.org/eng/downloads) page.

We recommend that you mention the username of each member whose audio you are reusing, as well as the license they chose.

Here's an example of attribution:

    All the audio comes from Tatoeba (https://tatoeba.org), more specifically from the following members of Tatoeba:
      - userA (license: CC-BY-SA)
      - userB (license: CC-BY-NC)
      - userC (license: CC-BY)

## Where can I download Tatoeba's audio data?

Currently the only way you can download audio is by fetching each audio file one by one. We don't have one big ZIP file that contains all our audio.

We only have one ZIP file for English [tatoeba_audio_eng.zip](https://downloads.tatoeba.org/audio/tatoeba_audio_eng.zip) 3.8 GB which was generated back in November 2017, upon request from the Common Voice project who wanted to mention our data on their [Datasets](https://voice.mozilla.org/en/data) page.

If you'd like something more up-to-date or in other languages, you would have to do some scripting, using the files on our [Downloads](https://tatoeba.org/eng/downloads) page, more specifically under the sections:

- "Sentences with audio": to have the ID's of all the sentences that have audio
- "Sentences": to know what is the language of each sentence

Once you have the language and the ID, the URL to download the audio file is:
`https://audio.tatoeba.org/sentences/{lang}/{id}.mp3`

For instance:
[https://audio.tatoeba.org/sentences/eng/7347611.mp3](https://audio.tatoeba.org/sentences/eng/7347611.mp3)

Note: if you are going to use this data in one of your projects/apps, please be mindful about the license!


## How can I download all sentences and translations in specific languages?

We don't provide yet such a feature. You will need to write your own script and process the "Sentences" and "Links" files that can be found on our [Downloads](https://tatoeba.org/eng/downloads) page.

If you are looking for sentences and translations from/to English, you may find what you need on this page: [http://www.manythings.org/anki/](http://www.manythings.org/anki/)

Otherwise, some people may have already written scripts that do what you want to do. Here are some examples of Google searches you can do to find these scripts:

- [script tatoeba csv](https://www.google.com/search?q=script+tatoeba+csv)
- [script extract tatoeba translations](https://www.google.com/search?q=script+extract+tatoeba+translations)

You can also check our [related Google Group thread](https://groups.google.com/d/topic/tatoebaproject/WIGoAakTvKE/discussion). If you have written a script that you want to share, feel free to post a reply to this thread.


---

### Related Articles

[Quick Start Guide](quick-start)  
[Rules and Guidelines](guidelines) 

---

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

Note

The lines in green are the lines that have been added in the new version. The lines in red are those that have been removed.