Version at: 01/09/2020, 13:11 vs. version at: 01/09/2020, 14:13
11# Language exceptions
22
33Language classification is a complex task and it is not part of Tatoeba's mission. We rely on the [ISO 639-3 standard](https://iso639-3.sil.org/) to define what is a valid language. In other words, if a language is not defined in this standard, we do not support it in Tatoeba.
44
55There are however some exceptions. This article lists these exceptions and explains the reasons behind it.
66
77## How our language request requirements evolved
88
99To understand why we have exceptions, it is important to understand how Tatoeba's requirement evolved.
1010
1111At the very beginning of Tatoeba (that is in 2006), languages were associated to a 2-letter code. The codes were not explicitly following any standards, they were just chosen based on common sense.
1212
1313Our decision to follow the ISO 639-3 standard started in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/f2f136ddae2c822c05c47a1b8a4523bfc4424518). This was motivated by the desire to [support Shanghainese in Tatoeba](https://github.com/Tatoeba/tatoeba2/commit/278f7c101430ffd840f94b0ff55ad14c78683480). We realized we needed a linguistic framework to categorize languages and we chose to rely on the ISO 639-3 standard.
1414
1515Our decision to follow more strictly this standard was made official on [February 2011](https://github.com/Tatoeba/tatoeba2/commit/dc9d8981cfea5e7c68d1d9b161d937d0d3616463#diff-37e8f945736d0858f122ad4a31197068). We started to notice users adding sentences in unknown/new constructed languages and realized that we could not support all the constructed languages that people may come up with. We added the following warning in our instructions for language requests:
1616
1717 IMPORTANT: We cannot add your language if it does not have an ISO 639-3 code. At this point we already have a lot of languages to deal with, and it's a bit too complicated to deal with languages that are not "officially" recognized.
1818
1919On more recent events, we decided to restrict language requests to individual languages. This was made official in [October 2018](https://en.wiki.tatoeba.org/history/show-diff-between/18/2206/2223). We realized that Tatoeba does not properly take into account [macrolanguages](https://github.com/Tatoeba/tatoeba2/issues/1673) and until we come up with a technical solution for this issue, we will avoid adding more macrolanguages.
2020
2121## Why do we have exceptions?
2222
2323We do not consider the ISO 639-3 standard to be the holy grail of language categorization. It is only a tool that helps us make decisions on how we organize the data collected through Tatoeba, but Tatoeba has its own history and we take this history into account more than anything else.
2424
2525Whenever we introduce changes in our rules, the new rules applies to new language requests, not to existing languages.
2626
2727We will not make any change on already supported languages unless there has been a discussion and an agreement with the contributors on what to do with their language.
2828
2929If we are not facing any major issues to keep supporting a language even though it does not fit the new policies, then we will not bother removing or changing it.
3030
3131
3232## Will we make more exceptions?
3333
3434No.
3535
3636If we receive a language request that cannot fit into our current rules, we can review the rules so that we can add the language without making it an exception.
3737
3838But if we cannot agree on new rules that would support the requested language, then the language will not be supported.
3939
4040
4141## Our current exceptions
4242
4343*This list may not be exhaustive. If you notice other exceptions, please contact [Trang](https://tatoeba.org/eng/private_messages/write/TRANG).*
4444
4545### [Arabic (ara)](https://tatoeba.org/sentences/show_all_in/ara/none/none/indifferent)
4646
4747Added in [April 2009](https://github.com/Tatoeba/tatoeba2/commit/db90a4fb19af04167e907c7998b45cbe80e64002).
4848
4949Arabic is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ara). Our current requirements state that we only accept individual languages.
5050
51
5251### [Aymara (aym)](https://tatoeba.org/sentences/show_all_in/aym/none/none/indifferent)
5352
5453Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/f6c524a82b32087bad9b834952fc874d21ee5a3c).
5554
5655Aymara is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aym). Our current requirements state that we only accept individual languages.
57
5856
5957### [Azerbaijani (aze)](https://tatoeba.org/sentences/show_all_in/aze/none/none/indifferent)
6058
6159Added in [May 2013](https://github.com/Tatoeba/tatoeba2/commit/7e5f887d6ef347ad4c01756e2b8b17c7fec9d69c).
6260
6361Azerbaijani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aze). Our current requirements state that we only accept individual languages.
6462
65
6663### [Baluchi (bal)](https://tatoeba.org/sentences/show_all_in/bal/none/none/indifferent)
6764
6865Added in [July 2020](https://github.com/Tatoeba/tatoeba2/commit/a0efa1cf2863aaa154e53d16b70b47d485ce5f85).
6966
7067Baluchi is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bal). Our current requirements state that we only accept individual languages.
7168
72
7369### [Ber (ber)](https://tatoeba.org/sentences/show_all_in/ber/none/none/indifferent)
7470
7571Added in [June 2012](https://github.com/Tatoeba/tatoeba2/commit/318c4da005310de53e0b5f6fa4bc2b75da687e26).
7672
77Berber is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collective language](https://iso639-3.sil.org/code/ber) in the ISO 636-2/5 standards.
78
73Berber is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collection of languages](https://iso639-3.sil.org/code/ber) in the ISO 639-2/5 standards.
7974
8075### [Buryat (bua)](https://tatoeba.org/sentences/show_all_in/bua/none/none/indifferent)
8176
8277Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/30988eb0a88abdf3e7bde888f1371cddf925ea33).
8378
8479Buryat is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bua). Our current requirements state that we only accept individual languages.
8580
86
8781### [CycL (cycl)](https://tatoeba.org/sentences/show_all_in/cycl/none/none/indifferent)
8882
8983Added in [August 2010](https://github.com/Tatoeba/tatoeba2/commit/e0f584c66d2885941c597ce79f1a758f51b1769a).
9084
9185CycL is an exception because it is [not defined](https://iso639-3.sil.org/code/cycl) in the ISO 639-3 standard.
92
9386
9487### [Estonian (est)](https://tatoeba.org/sentences/show_all_in/est/none/none/indifferent)
9588
9689Added in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/9d1c9673ebdd18cb66f06d530b45c15f0c3362da).
9790
9891Estonian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/est). Our current requirements state that we only accept individual languages.
9992
100
10193### [Guarani (grn)](https://tatoeba.org/sentences/show_all_in/grn/none/none/indifferent)
10294
10395Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).
10496
10597Guarani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/grn). Our current requirements state that we only accept individual languages.
106
10798
10899### [Kurdish (kur)](https://tatoeba.org/sentences/show_all_in/kur/none/none/indifferent)
109100
110101Added in [April 2012](https://github.com/Tatoeba/tatoeba2/commit/618497c35d41d3d6be23677e81682094540abbbc).
111102
112103Kurdish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/kur). Our current requirements state that we only accept individual languages.
113104
114
115105### [Malagasy (mlg)](https://tatoeba.org/sentences/show_all_in/mlg/none/none/indifferent)
116106
117107Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).
118108
119109Malagasy is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mlg). Our current requirements state that we only accept individual languages.
120
121110
122111### [Mongolian (mon)](https://tatoeba.org/sentences/show_all_in/mon/none/none/indifferent)
123112
124113Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).
125114
126115Mongolian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mon). Our current requirements state that we only accept individual languages.
127116
128
129117### [Nahuatl (nah)](https://tatoeba.org/sentences/show_all_in/nah/none/none/indifferent)
130118
131119Added in [March 2014](https://github.com/Tatoeba/tatoeba2/commit/04b94c9b94f9bdd26986af42071f2e7db950db93).
132120
133Nahuatl is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collective language](https://iso639-3.sil.org/code/nah) in the ISO 636-2/5 standards.
134
121Nahuatl is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collection of languages](https://iso639-3.sil.org/code/nah) in the ISO 639-2/5 standards.
135122
136123### [Ojibwe (oji)](https://tatoeba.org/sentences/show_all_in/oji/none/none/indifferent)
137124
138125Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/6704f06f6689f42f7019bf4bc7c8dfae2d219bb2).
139126
140127Ojibwe is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/oji). Our current requirements state that we only accept individual languages.
141128
142
143129### [Odia (Oriya) (ori)](https://tatoeba.org/sentences/show_all_in/ori/none/none/indifferent)
144130
145131Added in [November 2015](https://github.com/Tatoeba/tatoeba2/commit/a6bb5fc3e430af18cd6707a82575bbd32cd817a8).
146132
147133Oriya is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ori). Our current requirements state that we only accept individual languages.
148
149134
150135### [Pashto (pus)](https://tatoeba.org/sentences/show_all_in/pus/none/none/indifferent)
151136
152137Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/2bd80e28a291f7825381b8b03908226f71bb225f).
153138
154139Pashto is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/pus). Our current requirements state that we only accept individual languages.
155140
156
157141### [Quechua (que)](https://tatoeba.org/sentences/show_all_in/que/none/none/indifferent)
158142
159143Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).
160144
161145Quechua is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/que). Our current requirements state that we only accept individual languages.
162
163146
164147### [Romani (rom)](https://tatoeba.org/sentences/show_all_in/rom/none/none/indifferent)
165148
166149Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/e99cc58727bb64f35736bbc7553da25a18caac28).
167150
168151Romani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/rom). Our current requirements state that we only accept individual languages.
169152
170
171153### [Albanian (sqi)](https://tatoeba.org/sentences/show_all_in/sqi/none/none/indifferent)
172154
173155Added in [April 2010](https://github.com/Tatoeba/tatoeba2/commit/84569292313ad82cfa354d5960aed17e35c1c5fc).
174156
175157Albanian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/sqi). Our current requirements state that we only accept individual languages.
176158
177
178159### [Sardinian (srd)](https://tatoeba.org/sentences/show_all_in/srd/none/none/indifferent)
179160
180161Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/4ebd2deac3b83788ef15d4d13aa4325c2aecedd7).
181162
182163Sardinian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/srd). Our current requirements state that we only accept individual languages.
183
184164
185165### [Toki Pona (toki)](https://tatoeba.org/sentences/show_all_in/toki/none/none/indifferent)
186166
187167Added in [November 2010](https://github.com/Tatoeba/tatoeba2/commit/243aecdbb925c478909d28805079939395cc5b55).
188168
189169Toki Pona is an exception because it is [not defined](https://iso639-3.sil.org/code/toki) in the ISO 639-3 standard.
190170
191
192171### [Uzbek (uzb)](https://tatoeba.org/sentences/show_all_in/uzb/none/none/indifferent)
193172
194173Added in [July 2010](https://github.com/Tatoeba/tatoeba2/commit/b83a564d20fa6b42a110df0a49ae5e2f7a7cc732).
195174
196175Uzbek is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/uzb). Our current requirements state that we only accept individual languages.
197176
198
199177### [Yiddish (yid)](https://tatoeba.org/sentences/show_all_in/yid/none/none/indifferent)
200178
201179Added in [May 2010](https://github.com/Tatoeba/tatoeba2/commit/8ebb4a32ac25cf4407fc4701e5753c50533d366d).
202180
203181Yiddish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/yid). Our current requirements state that we only accept individual languages.
204
205182
206183### [Zaza (zza)](https://tatoeba.org/sentences/show_all_in/zza/none/none/indifferent)
207184
208185Added in [July 2015](https://github.com/Tatoeba/tatoeba2/commit/439a46ce7e6a503d6fbb3ef6e37aebe24ecc5d4e).
209186
210187Zaza is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/zza). Our current requirements state that we only accept individual languages.
211188
212189## Related articles
213190
214191* [How to request a new language](https://en.wiki.tatoeba.org/articles/show/new-language-request)
215192
diff view generated by jsdifflib

Version at: 01/09/2020, 13:11

# Language exceptions

Language classification is a complex task and it is not part of Tatoeba's mission. We rely on the [ISO 639-3 standard](https://iso639-3.sil.org/) to define what is a valid language. In other words, if a language is not defined in this standard, we do not support it in Tatoeba.

There are however some exceptions. This article lists these exceptions and explains the reasons behind it.

## How our language request requirements evolved

To understand why we have exceptions, it is important to understand how Tatoeba's requirement evolved.

At the very beginning of Tatoeba (that is in 2006), languages were associated to a 2-letter code. The codes were not explicitly following any standards, they were just chosen based on common sense.

Our decision to follow the ISO 639-3 standard started in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/f2f136ddae2c822c05c47a1b8a4523bfc4424518). This was motivated by the desire to [support Shanghainese in Tatoeba](https://github.com/Tatoeba/tatoeba2/commit/278f7c101430ffd840f94b0ff55ad14c78683480). We realized we needed a linguistic framework to categorize languages and we chose to rely on the ISO 639-3 standard.

Our decision to follow more strictly this standard was made official on [February 2011](https://github.com/Tatoeba/tatoeba2/commit/dc9d8981cfea5e7c68d1d9b161d937d0d3616463#diff-37e8f945736d0858f122ad4a31197068). We started to notice users adding sentences in unknown/new constructed languages and realized that we could not support all the constructed languages that people may come up with. We added the following warning in our instructions for language requests:

    IMPORTANT: We cannot add your language if it does not have an ISO 639-3 code. At this point we already have a lot of languages to deal with, and it's a bit too complicated to deal with languages that are not "officially" recognized.

On more recent events, we decided to restrict language requests to individual languages. This was made official in [October 2018](https://en.wiki.tatoeba.org/history/show-diff-between/18/2206/2223). We realized that Tatoeba does not properly take into account [macrolanguages](https://github.com/Tatoeba/tatoeba2/issues/1673) and until we come up with a technical solution for this issue, we will avoid adding more macrolanguages.

## Why do we have exceptions?

We do not consider the ISO 639-3 standard to be the holy grail of language categorization. It is only a tool that helps us make decisions on how we organize the data collected through Tatoeba, but Tatoeba has its own history and we take this history into account more than anything else.

Whenever we introduce changes in our rules, the new rules applies to new language requests, not to existing languages. 

We will not make any change on already supported languages unless there has been a discussion and an agreement with the contributors on what to do with their language.

If we are not facing any major issues to keep supporting a language even though it does not fit the new policies, then we will not bother removing or changing it.


## Will we make more exceptions?

No. 

If we receive a language request that cannot fit into our current rules, we can review the rules so that we can add the language without making it an exception.

But if we cannot agree on new rules that would support the requested language, then the language will not be supported.


## Our current exceptions

*This list may not be exhaustive. If you notice other exceptions, please contact [Trang](https://tatoeba.org/eng/private_messages/write/TRANG).*

### [Arabic (ara)](https://tatoeba.org/sentences/show_all_in/ara/none/none/indifferent)    

Added in [April 2009](https://github.com/Tatoeba/tatoeba2/commit/db90a4fb19af04167e907c7998b45cbe80e64002).

Arabic is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ara). Our current requirements state that we only accept individual languages.


### [Aymara (aym)](https://tatoeba.org/sentences/show_all_in/aym/none/none/indifferent)    

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/f6c524a82b32087bad9b834952fc874d21ee5a3c).

Aymara is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aym). Our current requirements state that we only accept individual languages.


### [Azerbaijani (aze)](https://tatoeba.org/sentences/show_all_in/aze/none/none/indifferent)   

Added in [May 2013](https://github.com/Tatoeba/tatoeba2/commit/7e5f887d6ef347ad4c01756e2b8b17c7fec9d69c).

Azerbaijani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aze). Our current requirements state that we only accept individual languages.


### [Baluchi (bal)](https://tatoeba.org/sentences/show_all_in/bal/none/none/indifferent) 

Added in [July 2020](https://github.com/Tatoeba/tatoeba2/commit/a0efa1cf2863aaa154e53d16b70b47d485ce5f85).

Baluchi is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bal). Our current requirements state that we only accept individual languages.


### [Ber (ber)](https://tatoeba.org/sentences/show_all_in/ber/none/none/indifferent)   

Added in [June 2012](https://github.com/Tatoeba/tatoeba2/commit/318c4da005310de53e0b5f6fa4bc2b75da687e26).

Berber is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collective language](https://iso639-3.sil.org/code/ber) in the ISO 636-2/5 standards.


### [Buryat (bua)](https://tatoeba.org/sentences/show_all_in/bua/none/none/indifferent) 

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/30988eb0a88abdf3e7bde888f1371cddf925ea33).

Buryat is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bua). Our current requirements state that we only accept individual languages.


### [CycL (cycl)](https://tatoeba.org/sentences/show_all_in/cycl/none/none/indifferent)  

Added in [August 2010](https://github.com/Tatoeba/tatoeba2/commit/e0f584c66d2885941c597ce79f1a758f51b1769a).

CycL is an exception because it is [not defined](https://iso639-3.sil.org/code/cycl) in the ISO 639-3 standard.


### [Estonian (est)](https://tatoeba.org/sentences/show_all_in/est/none/none/indifferent) 

Added in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/9d1c9673ebdd18cb66f06d530b45c15f0c3362da).

Estonian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/est). Our current requirements state that we only accept individual languages.


### [Guarani (grn)](https://tatoeba.org/sentences/show_all_in/grn/none/none/indifferent) 

Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).

Guarani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/grn). Our current requirements state that we only accept individual languages.


### [Kurdish (kur)](https://tatoeba.org/sentences/show_all_in/kur/none/none/indifferent) 

Added in [April 2012](https://github.com/Tatoeba/tatoeba2/commit/618497c35d41d3d6be23677e81682094540abbbc).

Kurdish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/kur). Our current requirements state that we only accept individual languages.


### [Malagasy (mlg)](https://tatoeba.org/sentences/show_all_in/mlg/none/none/indifferent) 

Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).

Malagasy is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mlg). Our current requirements state that we only accept individual languages.


### [Mongolian (mon)](https://tatoeba.org/sentences/show_all_in/mon/none/none/indifferent) 

Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).   

Mongolian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mon). Our current requirements state that we only accept individual languages.


### [Nahuatl (nah)](https://tatoeba.org/sentences/show_all_in/nah/none/none/indifferent) 

Added in [March 2014](https://github.com/Tatoeba/tatoeba2/commit/04b94c9b94f9bdd26986af42071f2e7db950db93).

Nahuatl is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collective language](https://iso639-3.sil.org/code/nah) in the ISO 636-2/5 standards.


### [Ojibwe (oji)](https://tatoeba.org/sentences/show_all_in/oji/none/none/indifferent)     

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/6704f06f6689f42f7019bf4bc7c8dfae2d219bb2).

Ojibwe is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/oji). Our current requirements state that we only accept individual languages.


### [Odia (Oriya) (ori)](https://tatoeba.org/sentences/show_all_in/ori/none/none/indifferent)       

Added in [November 2015](https://github.com/Tatoeba/tatoeba2/commit/a6bb5fc3e430af18cd6707a82575bbd32cd817a8).

Oriya is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ori). Our current requirements state that we only accept individual languages.


### [Pashto (pus)](https://tatoeba.org/sentences/show_all_in/pus/none/none/indifferent)   

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/2bd80e28a291f7825381b8b03908226f71bb225f).

Pashto is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/pus). Our current requirements state that we only accept individual languages.


### [Quechua (que)](https://tatoeba.org/sentences/show_all_in/que/none/none/indifferent)   

Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).

Quechua is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/que). Our current requirements state that we only accept individual languages.


### [Romani (rom)](https://tatoeba.org/sentences/show_all_in/rom/none/none/indifferent)   

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/e99cc58727bb64f35736bbc7553da25a18caac28).

Romani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/rom). Our current requirements state that we only accept individual languages.


### [Albanian (sqi)](https://tatoeba.org/sentences/show_all_in/sqi/none/none/indifferent)    
 
Added in [April 2010](https://github.com/Tatoeba/tatoeba2/commit/84569292313ad82cfa354d5960aed17e35c1c5fc).

Albanian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/sqi). Our current requirements state that we only accept individual languages.


### [Sardinian (srd)](https://tatoeba.org/sentences/show_all_in/srd/none/none/indifferent)   
 
Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/4ebd2deac3b83788ef15d4d13aa4325c2aecedd7).

Sardinian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/srd). Our current requirements state that we only accept individual languages.


### [Toki Pona (toki)](https://tatoeba.org/sentences/show_all_in/toki/none/none/indifferent)   

Added in [November 2010](https://github.com/Tatoeba/tatoeba2/commit/243aecdbb925c478909d28805079939395cc5b55).

Toki Pona is an exception because it is [not defined](https://iso639-3.sil.org/code/toki) in the ISO 639-3 standard.


### [Uzbek (uzb)](https://tatoeba.org/sentences/show_all_in/uzb/none/none/indifferent)    

Added in [July 2010](https://github.com/Tatoeba/tatoeba2/commit/b83a564d20fa6b42a110df0a49ae5e2f7a7cc732).

Uzbek is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/uzb). Our current requirements state that we only accept individual languages.


### [Yiddish (yid)](https://tatoeba.org/sentences/show_all_in/yid/none/none/indifferent)     

Added in [May 2010](https://github.com/Tatoeba/tatoeba2/commit/8ebb4a32ac25cf4407fc4701e5753c50533d366d).

Yiddish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/yid). Our current requirements state that we only accept individual languages.


### [Zaza (zza)](https://tatoeba.org/sentences/show_all_in/zza/none/none/indifferent)       

Added in [July 2015](https://github.com/Tatoeba/tatoeba2/commit/439a46ce7e6a503d6fbb3ef6e37aebe24ecc5d4e).

Zaza is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/zza). Our current requirements state that we only accept individual languages.

## Related articles

* [How to request a new language](https://en.wiki.tatoeba.org/articles/show/new-language-request)

version at: 01/09/2020, 14:13

# Language exceptions

Language classification is a complex task and it is not part of Tatoeba's mission. We rely on the [ISO 639-3 standard](https://iso639-3.sil.org/) to define what is a valid language. In other words, if a language is not defined in this standard, we do not support it in Tatoeba.

There are however some exceptions. This article lists these exceptions and explains the reasons behind it.

## How our language request requirements evolved

To understand why we have exceptions, it is important to understand how Tatoeba's requirement evolved.

At the very beginning of Tatoeba (that is in 2006), languages were associated to a 2-letter code. The codes were not explicitly following any standards, they were just chosen based on common sense.

Our decision to follow the ISO 639-3 standard started in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/f2f136ddae2c822c05c47a1b8a4523bfc4424518). This was motivated by the desire to [support Shanghainese in Tatoeba](https://github.com/Tatoeba/tatoeba2/commit/278f7c101430ffd840f94b0ff55ad14c78683480). We realized we needed a linguistic framework to categorize languages and we chose to rely on the ISO 639-3 standard.

Our decision to follow more strictly this standard was made official on [February 2011](https://github.com/Tatoeba/tatoeba2/commit/dc9d8981cfea5e7c68d1d9b161d937d0d3616463#diff-37e8f945736d0858f122ad4a31197068). We started to notice users adding sentences in unknown/new constructed languages and realized that we could not support all the constructed languages that people may come up with. We added the following warning in our instructions for language requests:

    IMPORTANT: We cannot add your language if it does not have an ISO 639-3 code. At this point we already have a lot of languages to deal with, and it's a bit too complicated to deal with languages that are not "officially" recognized.

On more recent events, we decided to restrict language requests to individual languages. This was made official in [October 2018](https://en.wiki.tatoeba.org/history/show-diff-between/18/2206/2223). We realized that Tatoeba does not properly take into account [macrolanguages](https://github.com/Tatoeba/tatoeba2/issues/1673) and until we come up with a technical solution for this issue, we will avoid adding more macrolanguages.

## Why do we have exceptions?

We do not consider the ISO 639-3 standard to be the holy grail of language categorization. It is only a tool that helps us make decisions on how we organize the data collected through Tatoeba, but Tatoeba has its own history and we take this history into account more than anything else.

Whenever we introduce changes in our rules, the new rules applies to new language requests, not to existing languages. 

We will not make any change on already supported languages unless there has been a discussion and an agreement with the contributors on what to do with their language.

If we are not facing any major issues to keep supporting a language even though it does not fit the new policies, then we will not bother removing or changing it.


## Will we make more exceptions?

No. 

If we receive a language request that cannot fit into our current rules, we can review the rules so that we can add the language without making it an exception.

But if we cannot agree on new rules that would support the requested language, then the language will not be supported.


## Our current exceptions

*This list may not be exhaustive. If you notice other exceptions, please contact [Trang](https://tatoeba.org/eng/private_messages/write/TRANG).*

### [Arabic (ara)](https://tatoeba.org/sentences/show_all_in/ara/none/none/indifferent)    

Added in [April 2009](https://github.com/Tatoeba/tatoeba2/commit/db90a4fb19af04167e907c7998b45cbe80e64002).

Arabic is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ara). Our current requirements state that we only accept individual languages.

### [Aymara (aym)](https://tatoeba.org/sentences/show_all_in/aym/none/none/indifferent)    

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/f6c524a82b32087bad9b834952fc874d21ee5a3c).

Aymara is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aym). Our current requirements state that we only accept individual languages.

### [Azerbaijani (aze)](https://tatoeba.org/sentences/show_all_in/aze/none/none/indifferent)   

Added in [May 2013](https://github.com/Tatoeba/tatoeba2/commit/7e5f887d6ef347ad4c01756e2b8b17c7fec9d69c).

Azerbaijani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/aze). Our current requirements state that we only accept individual languages.

### [Baluchi (bal)](https://tatoeba.org/sentences/show_all_in/bal/none/none/indifferent) 

Added in [July 2020](https://github.com/Tatoeba/tatoeba2/commit/a0efa1cf2863aaa154e53d16b70b47d485ce5f85).

Baluchi is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bal). Our current requirements state that we only accept individual languages.

### [Ber (ber)](https://tatoeba.org/sentences/show_all_in/ber/none/none/indifferent)   

Added in [June 2012](https://github.com/Tatoeba/tatoeba2/commit/318c4da005310de53e0b5f6fa4bc2b75da687e26).

Berber is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collection of languages](https://iso639-3.sil.org/code/ber) in the ISO 639-2/5 standards.

### [Buryat (bua)](https://tatoeba.org/sentences/show_all_in/bua/none/none/indifferent) 

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/30988eb0a88abdf3e7bde888f1371cddf925ea33).

Buryat is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/bua). Our current requirements state that we only accept individual languages.

### [CycL (cycl)](https://tatoeba.org/sentences/show_all_in/cycl/none/none/indifferent)  

Added in [August 2010](https://github.com/Tatoeba/tatoeba2/commit/e0f584c66d2885941c597ce79f1a758f51b1769a).

CycL is an exception because it is [not defined](https://iso639-3.sil.org/code/cycl) in the ISO 639-3 standard.

### [Estonian (est)](https://tatoeba.org/sentences/show_all_in/est/none/none/indifferent) 

Added in [December 2009](https://github.com/Tatoeba/tatoeba2/commit/9d1c9673ebdd18cb66f06d530b45c15f0c3362da).

Estonian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/est). Our current requirements state that we only accept individual languages.

### [Guarani (grn)](https://tatoeba.org/sentences/show_all_in/grn/none/none/indifferent) 

Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).

Guarani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/grn). Our current requirements state that we only accept individual languages.

### [Kurdish (kur)](https://tatoeba.org/sentences/show_all_in/kur/none/none/indifferent) 

Added in [April 2012](https://github.com/Tatoeba/tatoeba2/commit/618497c35d41d3d6be23677e81682094540abbbc).

Kurdish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/kur). Our current requirements state that we only accept individual languages.

### [Malagasy (mlg)](https://tatoeba.org/sentences/show_all_in/mlg/none/none/indifferent) 

Added in [August 2012](https://github.com/Tatoeba/tatoeba2/commit/51bdfb9f5c4bfad8cb7cb52c69d09af114c375e6).

Malagasy is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mlg). Our current requirements state that we only accept individual languages.

### [Mongolian (mon)](https://tatoeba.org/sentences/show_all_in/mon/none/none/indifferent) 

Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).   

Mongolian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/mon). Our current requirements state that we only accept individual languages.

### [Nahuatl (nah)](https://tatoeba.org/sentences/show_all_in/nah/none/none/indifferent) 

Added in [March 2014](https://github.com/Tatoeba/tatoeba2/commit/04b94c9b94f9bdd26986af42071f2e7db950db93).

Nahuatl is an exception because it is not defined in the ISO 639-3 standard. It is however defined [as a collection of languages](https://iso639-3.sil.org/code/nah) in the ISO 639-2/5 standards.

### [Ojibwe (oji)](https://tatoeba.org/sentences/show_all_in/oji/none/none/indifferent)     

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/6704f06f6689f42f7019bf4bc7c8dfae2d219bb2).

Ojibwe is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/oji). Our current requirements state that we only accept individual languages.

### [Odia (Oriya) (ori)](https://tatoeba.org/sentences/show_all_in/ori/none/none/indifferent)       

Added in [November 2015](https://github.com/Tatoeba/tatoeba2/commit/a6bb5fc3e430af18cd6707a82575bbd32cd817a8).

Oriya is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/ori). Our current requirements state that we only accept individual languages.

### [Pashto (pus)](https://tatoeba.org/sentences/show_all_in/pus/none/none/indifferent)   

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/2bd80e28a291f7825381b8b03908226f71bb225f).

Pashto is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/pus). Our current requirements state that we only accept individual languages.

### [Quechua (que)](https://tatoeba.org/sentences/show_all_in/que/none/none/indifferent)   

Added in [October 2010](https://github.com/Tatoeba/tatoeba2/commit/940c0a746c25bc0ed69d735a27746f7bde03f11d).

Quechua is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/que). Our current requirements state that we only accept individual languages.

### [Romani (rom)](https://tatoeba.org/sentences/show_all_in/rom/none/none/indifferent)   

Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/e99cc58727bb64f35736bbc7553da25a18caac28).

Romani is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/rom). Our current requirements state that we only accept individual languages.

### [Albanian (sqi)](https://tatoeba.org/sentences/show_all_in/sqi/none/none/indifferent)    
 
Added in [April 2010](https://github.com/Tatoeba/tatoeba2/commit/84569292313ad82cfa354d5960aed17e35c1c5fc).

Albanian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/sqi). Our current requirements state that we only accept individual languages.

### [Sardinian (srd)](https://tatoeba.org/sentences/show_all_in/srd/none/none/indifferent)   
 
Added in [October 2015](https://github.com/Tatoeba/tatoeba2/commit/4ebd2deac3b83788ef15d4d13aa4325c2aecedd7).

Sardinian is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/srd). Our current requirements state that we only accept individual languages.

### [Toki Pona (toki)](https://tatoeba.org/sentences/show_all_in/toki/none/none/indifferent)   

Added in [November 2010](https://github.com/Tatoeba/tatoeba2/commit/243aecdbb925c478909d28805079939395cc5b55).

Toki Pona is an exception because it is [not defined](https://iso639-3.sil.org/code/toki) in the ISO 639-3 standard.

### [Uzbek (uzb)](https://tatoeba.org/sentences/show_all_in/uzb/none/none/indifferent)    

Added in [July 2010](https://github.com/Tatoeba/tatoeba2/commit/b83a564d20fa6b42a110df0a49ae5e2f7a7cc732).

Uzbek is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/uzb). Our current requirements state that we only accept individual languages.

### [Yiddish (yid)](https://tatoeba.org/sentences/show_all_in/yid/none/none/indifferent)     

Added in [May 2010](https://github.com/Tatoeba/tatoeba2/commit/8ebb4a32ac25cf4407fc4701e5753c50533d366d).

Yiddish is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/yid). Our current requirements state that we only accept individual languages.

### [Zaza (zza)](https://tatoeba.org/sentences/show_all_in/zza/none/none/indifferent)       

Added in [July 2015](https://github.com/Tatoeba/tatoeba2/commit/439a46ce7e6a503d6fbb3ef6e37aebe24ecc5d4e).

Zaza is defined in the ISO 639-3 standard but [as a macrolanguage](https://iso639-3.sil.org/code/zza). Our current requirements state that we only accept individual languages.

## Related articles

* [How to request a new language](https://en.wiki.tatoeba.org/articles/show/new-language-request)

Note

The lines in green are the lines that have been added in the new version. The lines in red are those that have been removed.