Notice

This page show a previous version of the article

The Tatoeba corpus is effectively a set of sentences and a set of links between them. Whenever a sentence is translated, a link is automatically created between the source and the translation. Sometimes, however, a translation already exists in the corpus. In this case, the ideal thing to do is to link to that translation.

(Another option is to create a duplicate version of the sentence and wait for the periodically executed deduplication script to unify the sentences. However, this is less desirable since merging two sentences, their existing links, and their existing comment threads may create technical problems and confusion, and because the script must be run manually.)

Since linking indiscriminately can be destructive, the privilege to create and destroy links is reserved to advanced contributors, corpus maintainers, and admins. Once a contributor is promoted to an advanced contributor, he or she will begin seeing icons for linking sentences (a chain) and unlinking sentences (a pair of scissors). Clicking on the chain icon links a pair of sentences, and clicking on the scissors icon unlinks it.