Version at: 21/09/2023, 23:41 vs. version at: 21/09/2023, 23:43
11[Go back to Advanced Search at tatoeba.org](https://tatoeba.org/eng/sentences/advanced_search)
22
33#How to Search for Text
44
55
66
77## Introduction
88
99Tatoeba provides two ways to search for sentences:
1010
1111* the regular search bar at the top of every page
1212* [advanced search](https://en.wiki.tatoeba.org/articles/show/advanced-search#), which you can reach from the **Advanced search** link above the regular search bar
1313
1414### Regular search
1515
1616For regular search, there are three fields:
1717
1818* the main field, which selects the word or words that you're looking for
1919* the **From** field, which selects the language you're looking for matches in
2020* the **To** field, which limits the search to sentences that have been directly or indirectly translated into the language you choose
2121
2222#### Main search field
2323
2424If you leave the main search field empty, it will find all sentences that match the **From** and **To** values that you've chosen. Otherwise, it will search for sentences containing the word or words that you type in.
2525
2626The search engine that Tatoeba uses ([Manticore](https://manticoresearch.com/)) is a little different from other search engines that you may have used, such as Google's. Please note the following:
2727
2828(1) Punctuation marks like _?_ and _!_ have special purposes in our search engine (Manticore, previously Sphinx). If you don't want to use those special functions, you should leave them out.
2929
3030(2) In Arabic, Basque, Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Irish, Italian, Lithuanian, Nepali, Norwegian (Bokmål), Portuguese, Romanian, Russian, Spanish, Swedish, Tamil and Turkish (though not in related languages such as Ancient Greek), a search for a word such as _live_ will also find similar words, such as _lived_ and _living_. If you want to indicate that a word should be matched exactly, you must put an equals sign before it: _=live_
3131
3232(3) If you are searching for sentences in a language (such as Japanese or Chinese) that does not put spaces between words, be sure to see the section [Languages without word boundaries](https://en.wiki.tatoeba.org/articles/show/text-search#languages-without-word-boundaries) below.
3333
3434(4) You can use quotation marks to group words into phrases. For instance, _met him_ will find matches where the words _met_ and _him_ will occur anywhere in the sentence, but _"met him"_ will only find matches where the words occur in that order.
3535
3636(5) For more information, read the section [Examples in English](https://en.wiki.tatoeba.org/articles/show/text-search#examples-in-english).
3737
3838#### From
3939
4040The "From" field can be set to "Any language", in which case the search will find words in any language. Otherwise, the search will only find words in the language you choose.
4141
4242#### To
4343
4444The "To" field can be set to "Any language", in which case it will be ignored. Otherwise, the search will only find sentences that are linked to sentences in the language you choose. They can either be directly linked, in which case they will be shown in black, or indirectly linked, in which case they will be shown in gray. Two sentences are indirectly linked when there is a chain of translations between them but no one has put a link between those two sentences themselves. This means you cannot be sure that the sentences are translations of each other.
4545
4646## Examples in English
4747
4848* To find English sentences with "live", "lives", "living" or "lived", search for the word "live". (This will also find sentences with "Live", "Living", etc., since capitalization is ignored.)
4949
5050 * [live](http://tatoeba.org/eng/sentences/search?query=live+&from=eng&to=und)
5151
5252* To match a word exactly (ignoring capitalization), put an equals sign (=) before it.
5353
5454 * [=live](http://tatoeba.org/eng/sentences/search?query=%3Dlive+&from=eng&to=und)
5555
5656* Leave punctuation out of your search string. Most punctuation will be ignored, but a final exclamation mark (!) or question mark (?) will actually interfere with the search. (See [Sentences with punctuation marks](https://en.wiki.tatoeba.org/articles/show/text-search#sentences-with-punctuation-marks) for an example.) These symbols have other purposes, as described later on this page.
5757
5858* Put a $ after a word to find sentences ending with that word. The example finds English sentences ending with "Tom".
5959
6060 * [Tom$](http://tatoeba.org/eng/sentences/search?query=Tom%24&from=eng&to=und)
6161
6262* Most punctuation symbols cannot be found via a search. However, $ and _ are special. You can search for sentences containing either of these characters by putting a backslash before the symbol.
6363
6464 * [\$](https://tatoeba.org/eng/sentences/search?query=%5C%24&from=und&to=und)
6565 * [\\_](https://tatoeba.org/eng/sentences/search?query=%5C_&from=und&to=und)
6666 * [\$(1|2|3|4|5|6|7|8|9)](https://tatoeba.org/eng/sentences/search?query=%5C%24%281%7C2%7C3%7C4%7C5%7C6%7C7%7C8%7C9%29&from=eng&to=und)
6767 finds sentences with a $ followed by a number.
6868
6969* Put a ^ before a word to find sentences beginning with that word. The example finds English sentences beginning with "Tom".
7070
7171 * [^Tom](http://tatoeba.org/eng/sentences/search?query=%5ETom&from=eng&to=und)
7272
7373* This example finds English sentences beginning with "Tom" and ending with "Mary".
7474
7575 * [^Tom Mary$](http://tatoeba.org/eng/sentences/search?query=%5ETom+Mary%24&from=eng&to=und)
7676
7777* This example finds English sentences beginning with either "Tom" or "He".
7878
7979 * [(^Tom|^He)](http://tatoeba.org/eng/sentences/search?query=%28%5ETom%7C%5EHe%29&from=eng&to=und)
8080
8181* This example finds English sentences including any of the following words: "fasting", "fasted", "fasts". Using the equals sign means you'll get exact matches, so you will not find the forms "faster" and "fastest".
8282
8383 * [(=fasting|=fasted|=fasts)](https://tatoeba.org/eng/sentences/search?query=%28%3Dfasting%7C%3Dfasted%7C%3Dfasts%29&from=eng&to=und)
8484
8585* To search for a phrase, put quotes (") around it. Put an equals sign in front of each word that you want to be matched exactly. Or put an equals sign directly before the quotes to match every word in the quotes.
8686 * If you want to see phrases like "live in Boston", "living in Boston", or "lives in Boston", use the following search:
8787
8888 * ["live in boston"](http://tatoeba.org/eng/sentences/search?query=%22live+in+boston%22&from=eng&to=und)
8989
9090 * The following searches will only find sentences with the exact phrase "live in Boston".
9191
9292 * ["=live =in =boston"](http://tatoeba.org/eng/sentences/search?query=%22%3Dlive+%3Din+%3Dboston%22&from=eng&to=und)
9393 * [="live in boston"](http://tatoeba.org/eng/sentences/search?query=%3D%22live+in+boston%22&from=eng&to=und)
9494
9595 * This search will only find sentences consisting of the exact words "I live in Boston", without any additional words.
9696
9797 * ["^I =live =in =Boston$"](http://tatoeba.org/eng/sentences/search?query=%22%5EI+%3Dlive+%3Din+%3DBoston%24%22&from=eng&to=und)
9898
9999* This example finds English sentences that have "Tom", but don't begin with "Tom."
100100
101101 * [-^Tom Tom](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom&from=eng&to=und)
102102
103103* This example finds English sentences that have "Tom", but don't begin or end with "Tom."
104104
105105 * [-^Tom Tom -Tom$](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom+-Tom%24&from=eng&to=und)
106106
107107* The question mark (?) as part of a word is a one-letter wildcard.
108108
109109 * The following will find sentences with either "whenever" and "wherever."
110110
111111 * [whe?ever](https://tatoeba.org/eng/sentences/search?query=whe%3Fever&from=und&to=und)
112112
113113 * The following will find sentences with with 6-letter words that have 2 letters, and then "eve" and then one more letter, such as "clever" "eleven", "peeves", and "uneven".
114114
115115 * [??eve?](https://tatoeba.org/eng/sentences/search?query=%3F%3Feve%3F&from=eng&to=und)
116116
117117* This example finds English sentences that have "Tom", then two words, then "Mary", then one word, and then "John."
118118
119119 * ["Tom * * Mary * John"](https://tatoeba.org/eng/sentences/search?query=%22Tom+*+*+Mary+*+John%22&from=eng&to=und)
120120
121121* This example finds English sentences that start with "Tom", then have three words, then end with "Mary".
122122
123123 * ["^Tom * * * Mary$"](https://tatoeba.org/eng/sentences/search?query=%22%5ETom+*+*+*+Mary%24%22&from=und&to=und)
124124
125125* This example finds English sentences that have words beginning with "red", including the word "red".
126126
127127 * [red*](https://tatoeba.org/eng/sentences/search?query=red*&from=eng&to=und)
128128
129129* This example finds English sentences that have words ending with "red", including the word "red".
130130
131131 * [*red](https://tatoeba.org/eng/sentences/search?query=*red&from=eng&to=und)
132132
133133* This example finds English sentences that have words containing the word "red", including the word "red".
134134
135135 * [\*red\*](https://tatoeba.org/eng/sentences/search?query=*red*&from=eng&to=und)
136136
137137* This example finds English sentences with the letter sequence "red", but not the word "red."
138138
139 * [red\*|\*red|\*red\* -red](https://tatoeba.org/en/sentences/search?from=eng
140&query=red*%7C*red%7C*red*+-red&to=und)
139 * [red\*|\*red|\*red\* -red](https://tatoeba.org/en/sentences/search?from=eng&query=red*%7C*red%7C*red*+-red&sort=random)
141140
142141
143142* This example finds English sentences that have the word "French", but don't have the word "Tom".
144143
145144 * [French -Tom](https://tatoeba.org/eng/sentences/search?query=French+-Tom&from=eng&to=und)
146145
147146* This example will find sentences with "cheek" (in any form: cheeks, etc.) that don't include any of the words preceded by a minus sign (-).
148147
149148 * [cheek -tear -slap -burn -red -hollow](http://tatoeba.org/eng/sentences/search?query=cheek+-tear+-slap+-burn+-red+-hollow&from=eng&to=und)
150149
151150* This example finds sentences in which the word "cat" comes before the word "dog."
152151
153152 * [cat << dog](https://tatoeba.org/eng/sentences/search?query=cat+%3C%3C+dog&from=eng&to=und)
154153
155154* This example finds sentences that contain at least two of the words "cat", "dog", and "fish" (a "quorum search").
156155
157156 * ["cat dog fish"/2](https://tatoeba.org/eng/sentences/search?query=%22cat+dog+fish%22/2&from=eng&to=und)
158157
159158
160159### How to limit sentences to "I can" without getting "I can't".
161160
162161* This shows just sentences beginning with "I can't."
163162
164163 * ["^I =can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%27t%22)
165164
166165* However, this search shows both the "I can" and "I can't" sentences.
167166
168167 * ["^I =can"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22)
169168
170169* To just get "I can" sentences, without the "I can't" sentences, use this search. (Note that the quotes are necessary.)
171170
172171 * ["^I =can" -"can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22+-"can%27t")
173172
174173### How to find English sentences without "the", "a" or "an"
175174
176175* This search will get an error message, since you must specify at least one word that you want to include, not only words that you want to exclude:
177176
178177> -the -a -an
179178
180179* If you are determined to get as many results as possible, you can search for words that start with any letter of the alphabet, after putting a minus before each word that you do not want (though this query will take a long time):
181180
182181> [-the -a -an a\*|b\*|c\*|d\*|e\*|f\*|g\*|h\*|i\*|j\*|k\*|l\*|m\*|n\*|o\*|p\*|q\*|r\*|s\*|t\*|u\*|v\*|w\*|x\*|y\*|z\*](https://tatoeba.org/eng/sentences/search?query=-the+-a+-an+a*%7Cb*%7Cc*%7Cd*%7Ce*%7Cf*%7Cg*%7Ch*%7Ci*%7Cj*%7Ck*%7Cl*%7Cm*%7Cn*%7Co*%7Cp*%7Cq*%7Cr*%7Cs*%7Ct*%7Cu*%7Cv*%7Cw*%7Cx*%7Cy*%7Cz*&from=eng)
183182
184183
185184### How to find sentences with "of" followed by words ending in "ing" without any intervening words
186185
187186 * [of NEAR/1 \*ing -"\*ing of"](https://tatoeba.org/eng/sentences/search?query=of+NEAR%2F1+*ing+-%22*ing+of%22&from=eng&to=none&user=&orphans=no&unapproved=no&has_audio=&tags=&list=&native=&trans_filter=limit&trans_to=und&trans_link=&trans_user=&trans_orphan=&trans_unapproved=&trans_has_audio=&sort=words&sort_reverse=)
188187
189188 * Notes
190189 * The -"ing of" part is necessary to avoid getting results where the -ing word comes before "of."
191190 * The search results will favor sentences that contain multiple occurrences of *ing. If you don't want this, change the search order.
192191
193192
194193### How to find sentences with the "I myself" phrase (without any intervening words)
195194
196195 * [=I NEAR/1 =myself -(myself << I)](https://tatoeba.org/en/sentences/search?from=eng&query=%3DI++NEAR%2F1++%3Dmyself+-%28myself+%3C%3C+I%29)
197196
198197 * Note
199198 * The 2nd part of the query with the minus sign (-) is to ignore sentences that have the sequence "myself I". There are a number of such sentences.
200199
201200### How to find sentences with the "do that", "does that", "did that", "doing that" and "done that".
202201
203202 * [(=do|=does|=did|=doing|=done) NEAR/1 that -^Does -^Did -"that << does" -"that << did"](https://tatoeba.org/en/sentences/search?from=eng&query= %28%3Ddo%7C%3Ddoes%7C%3Ddid%7C%3Ddoing%7C%3Ddone%29+NEAR%2F1+that+-%5EDoes+-%5EDid+-%22that+%3C%3C+does%22+-%22that+%3C%3C+did%22&sort=random)
204203
205204
206205 * Note
207206 * In order to limit this to getting sentences with the verb phrase "do that," the last part of the query is included to avoid sentences starting with "Does that" or "Did that" and to avoid the sequences "that does" and "that did."
208207
209208
210209## Languages without word boundaries
211210
212211For languages that don't use space characters to separate words, like Japanese, Chinese etc. the search engine interprets each character as a single word. For instance, searching for 逆に will return the same results as 逆 に, which actually matches sentences that only *include* these characters, but not necessarily in that particular order, or not contiguously. So you should surround keywords with quotes, as in this example:
213212
214213["逆に"](http://tatoeba.org/jpn/sentences/search?query=%22%E9%80%86%E3%81%AB%22&from=jpn).
215214
216215
217216## More details
218217
219218The search ignores capitalization and punctuation (unless the punctuation happens to match one of the special characters described elsewhere on the page).
220219
221220If you want to find an exact match for a word in a language (like English) that supports stemming, you must precede it with an equals sign, as in *=live*. This may come as a surprise to users who are accustomed to Google Search, where wrapping a word or phrase in double quotes forces an exact match. In Manticore, double quotes have a different function, which only affects multiword (phrase) searches: wrapping a phrase in double quotes requires matching sentences to contain words in the specified continuous sequence. Simply placing a phrase in quotes does not suppress stemming of its individual words. To do that, you will need to place an equals sign before each word in the phrase for which you want to suppress stemming, or directly before the first quotation mark to suppress stemming for each word. If you want to put both an equals sign and a caret before the same word, the equals sign should precede the caret. For instance, to find sentences that begin with the exact word *Noise*, search for *=^noise*, not *^=noise*.
222221
223222As an example, take the search *like thing*. This will find *like things*, *likely things*, and even *things like*. Adding quotes, as in *"like thing"*, will prevent a match against *things like* (where the words appear in the wrong order), but it will continue to match *like things*, *likely things*, and so on. By contrast, *"=like =thing"* will only match *like thing* (which does not occur in the Tatoeba corpus). Removing the double quotes, *=like =thing*, will match *What made you do a silly thing like that?* Removing one of the equals signs, as in *like =thing*, will find *Such a strange thing is not likely to happen.*
224223
225224Note that a star (*) can be placed at the beginning, middle, or end of a string representing a word, but a string beginning with a star must be at least three characters long, or it will be ignored.
226225
227226## Limit matches to transcriptions or alternative scripts or sentence text
228227
229228Some languages can be written in different scripts (such as traditional/simplified Chinese, or Latin/Cyrillic Uzbek). Others also have transcriptions (such as Pinyin Chinese or Japanese furigana).
230229
231230By default, keywords will be searched everywhere: sentence text, alternative script and transcription. This means a sentence might come up in the results just because the transcription is matching.
232231
233232You can control exactly what is searched by using the @text and @transcription prefixes, respectively targeting the sentence text and what’s under the sentence text.
234233
235234* To search for Japanese sentences containing かな in the furigana.
236235 * [@transcription かな](https://tatoeba.org/sentences/search?query=%40text+%E3%81%8B%E3%81%AA&from=jpn&to=und)
237236
238237* To search for Japanese sentences containing 国 in sentence text and くに in the furigana.
239238 * [@text 国 @transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)
240239
241240* To search for Japanese sentences containing 国 in sentence text but NOT くに in the furigana.
242241 * [@text 国 -@transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+-%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)
243242
244243* To search for Chinese sentences containing 著 in sentence text and zháo in Pinyin.
245244 * [@text 著 @transcription zhao3](https://tatoeba.org/sentences/search?query=%40text+%E8%91%97+%40transcription+zhao3&from=cmn&to=und)
246245
247246## Other search operators
248247
249248* A **vertical bar** (**|**, representing "or") finds examples where either of the words appears:
250249 * *hate | detest* will match sentences with either *hate* or *detest* (or both).
251250
252251* If you want to combine an or-expression with other terms, you need to put the or-expression in **parentheses**:
253252 * *(red|blue) house* will match sentences in which the word "house" appears together with either "red" or "blue" (or both)
254253
255254* A **dash** (**-**) before a word prevents matches with sentences where the word appears: *like -thing* (or *like !thing*) will match *I like ice cream* but not *I like that red thing*. An exclamation mark (!) works the same way.
256255
257256* Putting a **caret** (**^**) before a word will match only sentences that begin with that word: *^great* will match *Great people are not always wise.* but not *You are the great love of my life.*
258257
259258* Putting a **dollar sign** (**$**) after a word will match only sentences that end with that word: *life$* will match *This is the best day of my life.* but not *Life means nothing without friends.*
260259
261260* If you want to search for sentences that contain nothing other than the specified words, use **double quotes, a caret, and a dollar sign in combination**: *"^i love you$"* will find *I love you.* and *I love you!* but not *I love you more than you love me.* (However, it will find *I loved you.* To prevent this match, use *"^i =love you$"*.)
262261
263262* The **strict order operator** (**<<**) between two words will find sentences where the first word occurs before the second but not where the second word comes before the first. Thus _dog << cat_ will find examples where _dog_ precedes _cat_, but not vice versa. This operator is also useful for searching for multiple instances of a string. For instance, *these << these << these* will find sentences with three instances of the word "these".
264263
265264* The **proximity operator**(**~_N_**, where _N_ is a positive number) following a phrase will limit the number of words that can separate the specified words to fewer than _N_. Thus _"you are *ble"~1_ will find *You are irresistible.* but not *You are partially responsible.*
266265
267266* The **quorum operator** (**/_N_**, where _N_ is a positive number) following a phrase will match sentences that have up to _N_ matching words. Thus _"I wish you would stop"/4_ will find both *I wish you would stop.* and *I wish you would reconsider.*
268267
269268* The **MAYBE** keyword will make the part of the sentence to the right optional. However, sentences that do contain the optional words will be listed first in a default search (that is, one sorted by "Relevance"). For instance, _"perhaps i can MAYBE help"_ will find both *Perhaps I can help.* and *Perhaps I can answer that.*, but *Perhaps I can help.* will be listed first. By contrast, _"perhaps i can MAYBE answer"_ will find the same sentences, but listed in the reverse order. **MAYBE** must be written in all capitals; _"maybe"_ will match the word *maybe*.
270269
271270See the [Manticore documentation for operators](https://manual.manticoresearch.com/Searching/Full_text_matching/Operators) for more detail.
272271
273272
274273---
275274### Related Articles
276275
277276[Advanced Search](advanced-search)
278277
279278---
280279
281280<a href="http://tatoeba.org/">Go back to tatoeba.org</a>
diff view generated by jsdifflib

Version at: 21/09/2023, 23:41

[Go back to Advanced Search at tatoeba.org](https://tatoeba.org/eng/sentences/advanced_search)

#How to Search for Text



## Introduction

Tatoeba provides two ways to search for sentences: 

* the regular search bar at the top of every page
* [advanced search](https://en.wiki.tatoeba.org/articles/show/advanced-search#), which you can reach from the **Advanced search** link above the regular search bar 

### Regular search

For regular search, there are three fields:

* the main field, which selects the word or words that you're looking for
* the **From** field, which selects the language you're looking for matches in
* the **To** field, which limits the search to sentences that have been directly or indirectly translated into the language you choose

#### Main search field

If you leave the main search field empty, it will find all sentences that match the **From** and **To** values that you've chosen. Otherwise, it will search for sentences containing the word or words that you type in. 

The search engine that Tatoeba uses ([Manticore](https://manticoresearch.com/)) is a little different from other search engines that you may have used, such as Google's. Please note the following:
 
(1) Punctuation marks like _?_ and _!_ have special purposes in our search engine (Manticore, previously Sphinx). If you don't want to use those special functions, you should leave them out.

(2) In Arabic, Basque, Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Irish, Italian, Lithuanian, Nepali, Norwegian (Bokmål), Portuguese, Romanian, Russian, Spanish, Swedish, Tamil and Turkish (though not in related languages such as Ancient Greek), a search for a word such as _live_ will also find similar words, such as _lived_ and _living_. If you want to indicate that a word should be matched exactly, you must put an equals sign before it: _=live_

(3) If you are searching for sentences in a language (such as Japanese or Chinese) that does not put spaces between words, be sure to see the section [Languages without word boundaries](https://en.wiki.tatoeba.org/articles/show/text-search#languages-without-word-boundaries) below.

(4) You can use quotation marks to group words into phrases. For instance, _met him_ will find matches where the words _met_ and _him_ will occur anywhere in the sentence, but _"met him"_ will only find matches where the words occur in that order.

(5) For more information, read the section [Examples in English](https://en.wiki.tatoeba.org/articles/show/text-search#examples-in-english).
 
#### From

The "From" field can be set to "Any language", in which case the search will find words in any language. Otherwise, the search will only find words in the language you choose.

#### To

The "To" field can be set to "Any language", in which case it will be ignored. Otherwise, the search will only find sentences that are linked to sentences in the language you choose. They can either be directly linked, in which case they will be shown in black, or indirectly linked, in which case they will be shown in gray. Two sentences are indirectly linked when there is a chain of translations between them but no one has put a link between those two sentences themselves. This means you cannot be sure that the sentences are translations of each other.

## Examples in English

* To find English sentences with "live", "lives", "living" or "lived", search for the word "live". (This will also find sentences with "Live", "Living", etc., since capitalization is ignored.)

  * [live](http://tatoeba.org/eng/sentences/search?query=live+&from=eng&to=und)

* To match a word exactly (ignoring capitalization), put an equals sign (=) before it. 

  * [=live](http://tatoeba.org/eng/sentences/search?query=%3Dlive+&from=eng&to=und)

* Leave punctuation out of your search string. Most punctuation will be ignored, but a final exclamation mark (!) or question mark (?) will actually interfere with the search. (See [Sentences with punctuation marks](https://en.wiki.tatoeba.org/articles/show/text-search#sentences-with-punctuation-marks) for an example.) These symbols have other purposes, as described later on this page.

* Put a $ after a word to find sentences ending with that word. The example finds English sentences ending with "Tom".

  * [Tom$](http://tatoeba.org/eng/sentences/search?query=Tom%24&from=eng&to=und)

* Most punctuation symbols cannot be found via a search. However, $ and _ are special. You can search for sentences containing either of these characters by putting a backslash before the symbol.

  * [\$](https://tatoeba.org/eng/sentences/search?query=%5C%24&from=und&to=und)
  * [\\_](https://tatoeba.org/eng/sentences/search?query=%5C_&from=und&to=und)
  * [\$(1|2|3|4|5|6|7|8|9)](https://tatoeba.org/eng/sentences/search?query=%5C%24%281%7C2%7C3%7C4%7C5%7C6%7C7%7C8%7C9%29&from=eng&to=und)
 finds sentences with a $ followed by a number.

* Put a ^ before a word to find sentences beginning with that word. The example finds English sentences beginning with "Tom".

  * [^Tom](http://tatoeba.org/eng/sentences/search?query=%5ETom&from=eng&to=und)

* This example finds English sentences beginning with "Tom" and ending with "Mary".

  * [^Tom Mary$](http://tatoeba.org/eng/sentences/search?query=%5ETom+Mary%24&from=eng&to=und)

* This example finds English sentences beginning with either "Tom" or "He".

  * [(^Tom|^He)](http://tatoeba.org/eng/sentences/search?query=%28%5ETom%7C%5EHe%29&from=eng&to=und)

* This example finds English sentences including any of the following words: "fasting", "fasted", "fasts".  Using the equals sign means you'll get exact matches, so you will not find the forms "faster" and "fastest".

  * [(=fasting|=fasted|=fasts)](https://tatoeba.org/eng/sentences/search?query=%28%3Dfasting%7C%3Dfasted%7C%3Dfasts%29&from=eng&to=und)

* To search for a phrase, put quotes (") around it. Put an equals sign in front of each word that you want to be matched exactly. Or put an equals sign directly before the quotes to match every word in the quotes.
  * If you want to see phrases like "live in Boston", "living in Boston", or "lives in Boston", use the following search:

      * ["live in boston"](http://tatoeba.org/eng/sentences/search?query=%22live+in+boston%22&from=eng&to=und)

  * The following searches will only find sentences with the exact phrase "live in Boston".

      * ["=live =in =boston"](http://tatoeba.org/eng/sentences/search?query=%22%3Dlive+%3Din+%3Dboston%22&from=eng&to=und)
      * [="live in boston"](http://tatoeba.org/eng/sentences/search?query=%3D%22live+in+boston%22&from=eng&to=und)

  * This search will only find sentences consisting of the exact words "I live in Boston", without any additional words.

      * ["^I =live =in =Boston$"](http://tatoeba.org/eng/sentences/search?query=%22%5EI+%3Dlive+%3Din+%3DBoston%24%22&from=eng&to=und)

* This example finds English sentences that have "Tom", but don't begin with "Tom."

  * [-^Tom Tom](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom&from=eng&to=und)

* This example finds English sentences that have "Tom", but don't begin or end with "Tom."

  * [-^Tom Tom -Tom$](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom+-Tom%24&from=eng&to=und)

* The question mark (?) as part of a word is a one-letter wildcard.

    * The following will find sentences with either "whenever" and "wherever."

        * [whe?ever](https://tatoeba.org/eng/sentences/search?query=whe%3Fever&from=und&to=und)

    * The following will find sentences with with 6-letter words that have 2 letters, and then "eve" and then one more letter,  such as "clever" "eleven", "peeves", and "uneven".

        * [??eve?](https://tatoeba.org/eng/sentences/search?query=%3F%3Feve%3F&from=eng&to=und)

* This example finds English sentences that have "Tom", then two words, then "Mary", then one word, and then "John."

  * ["Tom * * Mary * John"](https://tatoeba.org/eng/sentences/search?query=%22Tom+*+*+Mary+*+John%22&from=eng&to=und)

* This example finds English sentences that start with "Tom", then have three words, then end with "Mary".

  * ["^Tom * * * Mary$"](https://tatoeba.org/eng/sentences/search?query=%22%5ETom+*+*+*+Mary%24%22&from=und&to=und)

* This example finds English sentences that have words beginning with "red", including the word "red".

  * [red*](https://tatoeba.org/eng/sentences/search?query=red*&from=eng&to=und)

* This example finds English sentences that have words ending with "red", including the word "red".

  * [*red](https://tatoeba.org/eng/sentences/search?query=*red&from=eng&to=und)

* This example finds English sentences that have words containing the word "red", including the word "red".

  * [\*red\*](https://tatoeba.org/eng/sentences/search?query=*red*&from=eng&to=und)

* This example finds English sentences with the letter sequence "red", but not the word "red."

 * [red\*|\*red|\*red\* -red](https://tatoeba.org/en/sentences/search?from=eng
&query=red*%7C*red%7C*red*+-red&to=und)


* This example finds English sentences that have the word "French", but don't have the word "Tom".

  * [French -Tom](https://tatoeba.org/eng/sentences/search?query=French+-Tom&from=eng&to=und)

* This example will find sentences with "cheek" (in any form: cheeks, etc.) that don't include any of the words preceded by a minus sign (-).

  * [cheek -tear -slap -burn -red -hollow](http://tatoeba.org/eng/sentences/search?query=cheek+-tear+-slap+-burn+-red+-hollow&from=eng&to=und)

* This example finds sentences in which the word "cat" comes before the word "dog."

  * [cat << dog](https://tatoeba.org/eng/sentences/search?query=cat+%3C%3C+dog&from=eng&to=und)

* This example finds sentences that contain at least two of the words "cat", "dog", and "fish" (a "quorum search").

  * ["cat dog fish"/2](https://tatoeba.org/eng/sentences/search?query=%22cat+dog+fish%22/2&from=eng&to=und)


### How to limit sentences to "I can" without getting "I can't".

* This shows just sentences beginning with "I can't."

  * ["^I =can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%27t%22)

* However, this search shows both the "I can" and "I can't" sentences.

  * ["^I =can"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22)

* To just get "I can" sentences, without the "I can't" sentences, use this search. (Note that the quotes are necessary.)

  * ["^I =can" -"can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22+-"can%27t")

### How to find English sentences without "the", "a" or "an"

* This search will get an error message, since you must specify at least one word that you want to include, not only words that you want to exclude: 

> -the -a -an

* If you are determined to get as many results as possible, you can search for words that start with any letter of the alphabet, after putting a minus before each word that you do not want (though this query will take a long time):

> [-the -a -an a\*|b\*|c\*|d\*|e\*|f\*|g\*|h\*|i\*|j\*|k\*|l\*|m\*|n\*|o\*|p\*|q\*|r\*|s\*|t\*|u\*|v\*|w\*|x\*|y\*|z\*](https://tatoeba.org/eng/sentences/search?query=-the+-a+-an+a*%7Cb*%7Cc*%7Cd*%7Ce*%7Cf*%7Cg*%7Ch*%7Ci*%7Cj*%7Ck*%7Cl*%7Cm*%7Cn*%7Co*%7Cp*%7Cq*%7Cr*%7Cs*%7Ct*%7Cu*%7Cv*%7Cw*%7Cx*%7Cy*%7Cz*&from=eng)


### How to find sentences with "of" followed by words ending in "ing" without any intervening words

 * [of NEAR/1 \*ing -"\*ing of"](https://tatoeba.org/eng/sentences/search?query=of+NEAR%2F1+*ing+-%22*ing+of%22&from=eng&to=none&user=&orphans=no&unapproved=no&has_audio=&tags=&list=&native=&trans_filter=limit&trans_to=und&trans_link=&trans_user=&trans_orphan=&trans_unapproved=&trans_has_audio=&sort=words&sort_reverse=)

 * Notes
   * The -"ing of" part is necessary to avoid getting results where the -ing word comes before "of."
   * The search results will favor sentences that contain multiple occurrences of *ing. If you don't want this, change the search order.


### How to find sentences with the "I myself" phrase (without any intervening words)

 * [=I NEAR/1 =myself -(myself << I)](https://tatoeba.org/en/sentences/search?from=eng&query=%3DI++NEAR%2F1++%3Dmyself+-%28myself+%3C%3C+I%29)

 * Note
   * The 2nd part of the query with the minus sign (-) is to ignore sentences that have the sequence "myself I". There are a number of such sentences.

### How to find sentences with the "do that", "does that", "did that", "doing that" and "done that".

 * [(=do|=does|=did|=doing|=done) NEAR/1 that -^Does -^Did -"that << does" -"that << did"](https://tatoeba.org/en/sentences/search?from=eng&query= %28%3Ddo%7C%3Ddoes%7C%3Ddid%7C%3Ddoing%7C%3Ddone%29+NEAR%2F1+that+-%5EDoes+-%5EDid+-%22that+%3C%3C+does%22+-%22that+%3C%3C+did%22&sort=random)
 

 * Note
   * In order to limit this to getting sentences with the verb phrase "do that," the last part of the query is included to avoid sentences starting with "Does that" or "Did that" and to avoid the sequences "that does" and "that did."


## Languages without word boundaries

For languages that don't use space characters to separate words, like Japanese, Chinese etc. the search engine interprets each character as a single word. For instance, searching for 逆に will return the same results as 逆 に, which actually matches sentences that only *include* these characters, but not necessarily in that particular order, or not contiguously. So you should surround keywords with quotes, as in this example: 

["逆に"](http://tatoeba.org/jpn/sentences/search?query=%22%E9%80%86%E3%81%AB%22&from=jpn).


## More details

The search ignores capitalization and punctuation (unless the punctuation happens to match one of the special characters described elsewhere on the page). 

If you want to find an exact match for a word in a language (like English) that supports stemming, you must precede it with an equals sign, as in *=live*. This may come as a surprise to users who are accustomed to Google Search, where wrapping a word or phrase in double quotes forces an exact match. In Manticore, double quotes have a different function, which only affects multiword (phrase) searches: wrapping a phrase in double quotes requires matching sentences to contain words in the specified continuous sequence. Simply placing a phrase in quotes does not suppress stemming of its individual words. To do that, you will need to place an equals sign before each word in the phrase for which you want to suppress stemming, or directly before the first quotation mark to suppress stemming for each word. If you want to put both an equals sign and a caret before the same word, the equals sign should precede the caret. For instance, to find sentences that begin with the exact word *Noise*, search for *=^noise*, not *^=noise*.

As an example, take the search *like thing*. This will find *like things*, *likely things*, and even *things like*. Adding quotes, as in *"like thing"*, will prevent a match against *things like* (where the words appear in the wrong order), but it will continue to match *like things*, *likely things*, and so on. By contrast, *"=like =thing"* will only match *like thing* (which does not occur in the Tatoeba corpus). Removing the double quotes, *=like =thing*, will match *What made you do a silly thing like that?* Removing one of the equals signs, as in *like =thing*, will find *Such a strange thing is not likely to happen.* 

Note that a star (*) can be placed at the beginning, middle, or end of a string representing a word, but a string beginning with a star must be at least three characters long, or it will be ignored.

## Limit matches to transcriptions or alternative scripts or sentence text

Some languages can be written in different scripts (such as traditional/simplified Chinese, or Latin/Cyrillic Uzbek). Others also have transcriptions (such as Pinyin Chinese or Japanese furigana).

By default, keywords will be searched everywhere: sentence text, alternative script and transcription. This means a sentence might come up in the results just because the transcription is matching.

You can control exactly what is searched by using the @text and @transcription prefixes, respectively targeting the sentence text and what’s under the sentence text.

* To search for Japanese sentences containing かな in the furigana.
  * [@transcription かな](https://tatoeba.org/sentences/search?query=%40text+%E3%81%8B%E3%81%AA&from=jpn&to=und)

* To search for Japanese sentences containing 国 in sentence text and くに in the furigana.
  * [@text 国 @transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)

* To search for Japanese sentences containing 国 in sentence text but NOT くに in the furigana.
  * [@text 国 -@transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+-%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)

* To search for Chinese sentences containing 著 in sentence text and zháo in Pinyin.
  * [@text 著 @transcription zhao3](https://tatoeba.org/sentences/search?query=%40text+%E8%91%97+%40transcription+zhao3&from=cmn&to=und)

## Other search operators

* A **vertical bar** (**|**, representing "or") finds examples where either of the words appears:
  *    *hate | detest* will match sentences with either *hate* or *detest* (or both). 

* If you want to combine an or-expression with other terms, you need to put the or-expression in **parentheses**: 
  *    *(red|blue) house* will match sentences in which the word "house" appears together with either "red" or "blue" (or both) 

* A **dash** (**-**) before a word prevents matches with sentences where the word appears: *like -thing* (or *like !thing*) will match *I like ice cream* but not *I like that red thing*. An exclamation mark (!) works the same way.

* Putting a **caret** (**^**) before a word will match only sentences that begin with that word: *^great* will match *Great people are not always wise.* but not *You are the great love of my life.*

* Putting a **dollar sign** (**$**) after a word will match only sentences that end with that word: *life$* will match *This is the best day of my life.* but not *Life means nothing without friends.*

* If you want to search for sentences that contain nothing other than the specified words, use **double quotes, a caret, and a dollar sign in combination**: *"^i love you$"* will find *I love you.* and *I love you!* but not *I love you more than you love me.* (However, it will find *I loved you.* To prevent this match, use *"^i =love you$"*.)

* The **strict order operator** (**<<**) between two words will find sentences where the first word occurs before the second but not where the second word comes before the first. Thus _dog << cat_ will find examples where _dog_ precedes _cat_, but not vice versa. This operator is also useful for searching for multiple instances of a string. For instance, *these << these << these* will find sentences with three instances of the word "these".

* The **proximity operator**(**~_N_**, where _N_ is a positive number) following a phrase will limit the number of words that can separate the specified words to fewer than _N_. Thus _"you are *ble"~1_ will find *You are irresistible.* but not *You are partially responsible.*

* The **quorum operator** (**/_N_**, where _N_ is a positive number) following a phrase will match sentences that have up to _N_ matching words. Thus _"I wish you would stop"/4_ will find both *I wish you would stop.* and *I wish you would reconsider.*
 
* The **MAYBE** keyword will make the part of the sentence to the right optional. However, sentences that do contain the optional words will be listed first in a default search (that is, one sorted by "Relevance"). For instance, _"perhaps i can MAYBE help"_ will find both *Perhaps I can help.* and *Perhaps I can answer that.*, but *Perhaps I can help.* will be listed first. By contrast, _"perhaps i can MAYBE answer"_ will find the same sentences, but listed in the reverse order. **MAYBE** must be written in all capitals; _"maybe"_ will match the word *maybe*.

See the [Manticore documentation for operators](https://manual.manticoresearch.com/Searching/Full_text_matching/Operators) for more detail.


---
### Related Articles

[Advanced Search](advanced-search)

---

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

version at: 21/09/2023, 23:43

[Go back to Advanced Search at tatoeba.org](https://tatoeba.org/eng/sentences/advanced_search)

#How to Search for Text



## Introduction

Tatoeba provides two ways to search for sentences: 

* the regular search bar at the top of every page
* [advanced search](https://en.wiki.tatoeba.org/articles/show/advanced-search#), which you can reach from the **Advanced search** link above the regular search bar 

### Regular search

For regular search, there are three fields:

* the main field, which selects the word or words that you're looking for
* the **From** field, which selects the language you're looking for matches in
* the **To** field, which limits the search to sentences that have been directly or indirectly translated into the language you choose

#### Main search field

If you leave the main search field empty, it will find all sentences that match the **From** and **To** values that you've chosen. Otherwise, it will search for sentences containing the word or words that you type in. 

The search engine that Tatoeba uses ([Manticore](https://manticoresearch.com/)) is a little different from other search engines that you may have used, such as Google's. Please note the following:
 
(1) Punctuation marks like _?_ and _!_ have special purposes in our search engine (Manticore, previously Sphinx). If you don't want to use those special functions, you should leave them out.

(2) In Arabic, Basque, Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Irish, Italian, Lithuanian, Nepali, Norwegian (Bokmål), Portuguese, Romanian, Russian, Spanish, Swedish, Tamil and Turkish (though not in related languages such as Ancient Greek), a search for a word such as _live_ will also find similar words, such as _lived_ and _living_. If you want to indicate that a word should be matched exactly, you must put an equals sign before it: _=live_

(3) If you are searching for sentences in a language (such as Japanese or Chinese) that does not put spaces between words, be sure to see the section [Languages without word boundaries](https://en.wiki.tatoeba.org/articles/show/text-search#languages-without-word-boundaries) below.

(4) You can use quotation marks to group words into phrases. For instance, _met him_ will find matches where the words _met_ and _him_ will occur anywhere in the sentence, but _"met him"_ will only find matches where the words occur in that order.

(5) For more information, read the section [Examples in English](https://en.wiki.tatoeba.org/articles/show/text-search#examples-in-english).
 
#### From

The "From" field can be set to "Any language", in which case the search will find words in any language. Otherwise, the search will only find words in the language you choose.

#### To

The "To" field can be set to "Any language", in which case it will be ignored. Otherwise, the search will only find sentences that are linked to sentences in the language you choose. They can either be directly linked, in which case they will be shown in black, or indirectly linked, in which case they will be shown in gray. Two sentences are indirectly linked when there is a chain of translations between them but no one has put a link between those two sentences themselves. This means you cannot be sure that the sentences are translations of each other.

## Examples in English

* To find English sentences with "live", "lives", "living" or "lived", search for the word "live". (This will also find sentences with "Live", "Living", etc., since capitalization is ignored.)

  * [live](http://tatoeba.org/eng/sentences/search?query=live+&from=eng&to=und)

* To match a word exactly (ignoring capitalization), put an equals sign (=) before it. 

  * [=live](http://tatoeba.org/eng/sentences/search?query=%3Dlive+&from=eng&to=und)

* Leave punctuation out of your search string. Most punctuation will be ignored, but a final exclamation mark (!) or question mark (?) will actually interfere with the search. (See [Sentences with punctuation marks](https://en.wiki.tatoeba.org/articles/show/text-search#sentences-with-punctuation-marks) for an example.) These symbols have other purposes, as described later on this page.

* Put a $ after a word to find sentences ending with that word. The example finds English sentences ending with "Tom".

  * [Tom$](http://tatoeba.org/eng/sentences/search?query=Tom%24&from=eng&to=und)

* Most punctuation symbols cannot be found via a search. However, $ and _ are special. You can search for sentences containing either of these characters by putting a backslash before the symbol.

  * [\$](https://tatoeba.org/eng/sentences/search?query=%5C%24&from=und&to=und)
  * [\\_](https://tatoeba.org/eng/sentences/search?query=%5C_&from=und&to=und)
  * [\$(1|2|3|4|5|6|7|8|9)](https://tatoeba.org/eng/sentences/search?query=%5C%24%281%7C2%7C3%7C4%7C5%7C6%7C7%7C8%7C9%29&from=eng&to=und)
 finds sentences with a $ followed by a number.

* Put a ^ before a word to find sentences beginning with that word. The example finds English sentences beginning with "Tom".

  * [^Tom](http://tatoeba.org/eng/sentences/search?query=%5ETom&from=eng&to=und)

* This example finds English sentences beginning with "Tom" and ending with "Mary".

  * [^Tom Mary$](http://tatoeba.org/eng/sentences/search?query=%5ETom+Mary%24&from=eng&to=und)

* This example finds English sentences beginning with either "Tom" or "He".

  * [(^Tom|^He)](http://tatoeba.org/eng/sentences/search?query=%28%5ETom%7C%5EHe%29&from=eng&to=und)

* This example finds English sentences including any of the following words: "fasting", "fasted", "fasts".  Using the equals sign means you'll get exact matches, so you will not find the forms "faster" and "fastest".

  * [(=fasting|=fasted|=fasts)](https://tatoeba.org/eng/sentences/search?query=%28%3Dfasting%7C%3Dfasted%7C%3Dfasts%29&from=eng&to=und)

* To search for a phrase, put quotes (") around it. Put an equals sign in front of each word that you want to be matched exactly. Or put an equals sign directly before the quotes to match every word in the quotes.
  * If you want to see phrases like "live in Boston", "living in Boston", or "lives in Boston", use the following search:

      * ["live in boston"](http://tatoeba.org/eng/sentences/search?query=%22live+in+boston%22&from=eng&to=und)

  * The following searches will only find sentences with the exact phrase "live in Boston".

      * ["=live =in =boston"](http://tatoeba.org/eng/sentences/search?query=%22%3Dlive+%3Din+%3Dboston%22&from=eng&to=und)
      * [="live in boston"](http://tatoeba.org/eng/sentences/search?query=%3D%22live+in+boston%22&from=eng&to=und)

  * This search will only find sentences consisting of the exact words "I live in Boston", without any additional words.

      * ["^I =live =in =Boston$"](http://tatoeba.org/eng/sentences/search?query=%22%5EI+%3Dlive+%3Din+%3DBoston%24%22&from=eng&to=und)

* This example finds English sentences that have "Tom", but don't begin with "Tom."

  * [-^Tom Tom](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom&from=eng&to=und)

* This example finds English sentences that have "Tom", but don't begin or end with "Tom."

  * [-^Tom Tom -Tom$](https://tatoeba.org/eng/sentences/search?query=-%5ETom+Tom+-Tom%24&from=eng&to=und)

* The question mark (?) as part of a word is a one-letter wildcard.

    * The following will find sentences with either "whenever" and "wherever."

        * [whe?ever](https://tatoeba.org/eng/sentences/search?query=whe%3Fever&from=und&to=und)

    * The following will find sentences with with 6-letter words that have 2 letters, and then "eve" and then one more letter,  such as "clever" "eleven", "peeves", and "uneven".

        * [??eve?](https://tatoeba.org/eng/sentences/search?query=%3F%3Feve%3F&from=eng&to=und)

* This example finds English sentences that have "Tom", then two words, then "Mary", then one word, and then "John."

  * ["Tom * * Mary * John"](https://tatoeba.org/eng/sentences/search?query=%22Tom+*+*+Mary+*+John%22&from=eng&to=und)

* This example finds English sentences that start with "Tom", then have three words, then end with "Mary".

  * ["^Tom * * * Mary$"](https://tatoeba.org/eng/sentences/search?query=%22%5ETom+*+*+*+Mary%24%22&from=und&to=und)

* This example finds English sentences that have words beginning with "red", including the word "red".

  * [red*](https://tatoeba.org/eng/sentences/search?query=red*&from=eng&to=und)

* This example finds English sentences that have words ending with "red", including the word "red".

  * [*red](https://tatoeba.org/eng/sentences/search?query=*red&from=eng&to=und)

* This example finds English sentences that have words containing the word "red", including the word "red".

  * [\*red\*](https://tatoeba.org/eng/sentences/search?query=*red*&from=eng&to=und)

* This example finds English sentences with the letter sequence "red", but not the word "red."

 * [red\*|\*red|\*red\* -red](https://tatoeba.org/en/sentences/search?from=eng&query=red*%7C*red%7C*red*+-red&sort=random)


* This example finds English sentences that have the word "French", but don't have the word "Tom".

  * [French -Tom](https://tatoeba.org/eng/sentences/search?query=French+-Tom&from=eng&to=und)

* This example will find sentences with "cheek" (in any form: cheeks, etc.) that don't include any of the words preceded by a minus sign (-).

  * [cheek -tear -slap -burn -red -hollow](http://tatoeba.org/eng/sentences/search?query=cheek+-tear+-slap+-burn+-red+-hollow&from=eng&to=und)

* This example finds sentences in which the word "cat" comes before the word "dog."

  * [cat << dog](https://tatoeba.org/eng/sentences/search?query=cat+%3C%3C+dog&from=eng&to=und)

* This example finds sentences that contain at least two of the words "cat", "dog", and "fish" (a "quorum search").

  * ["cat dog fish"/2](https://tatoeba.org/eng/sentences/search?query=%22cat+dog+fish%22/2&from=eng&to=und)


### How to limit sentences to "I can" without getting "I can't".

* This shows just sentences beginning with "I can't."

  * ["^I =can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%27t%22)

* However, this search shows both the "I can" and "I can't" sentences.

  * ["^I =can"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22)

* To just get "I can" sentences, without the "I can't" sentences, use this search. (Note that the quotes are necessary.)

  * ["^I =can" -"can't"](https://tatoeba.org/eng/sentences/search?from=eng&to=und&has_audio=yes&sort=created&query=%22%5EI+%3Dcan%22+-"can%27t")

### How to find English sentences without "the", "a" or "an"

* This search will get an error message, since you must specify at least one word that you want to include, not only words that you want to exclude: 

> -the -a -an

* If you are determined to get as many results as possible, you can search for words that start with any letter of the alphabet, after putting a minus before each word that you do not want (though this query will take a long time):

> [-the -a -an a\*|b\*|c\*|d\*|e\*|f\*|g\*|h\*|i\*|j\*|k\*|l\*|m\*|n\*|o\*|p\*|q\*|r\*|s\*|t\*|u\*|v\*|w\*|x\*|y\*|z\*](https://tatoeba.org/eng/sentences/search?query=-the+-a+-an+a*%7Cb*%7Cc*%7Cd*%7Ce*%7Cf*%7Cg*%7Ch*%7Ci*%7Cj*%7Ck*%7Cl*%7Cm*%7Cn*%7Co*%7Cp*%7Cq*%7Cr*%7Cs*%7Ct*%7Cu*%7Cv*%7Cw*%7Cx*%7Cy*%7Cz*&from=eng)


### How to find sentences with "of" followed by words ending in "ing" without any intervening words

 * [of NEAR/1 \*ing -"\*ing of"](https://tatoeba.org/eng/sentences/search?query=of+NEAR%2F1+*ing+-%22*ing+of%22&from=eng&to=none&user=&orphans=no&unapproved=no&has_audio=&tags=&list=&native=&trans_filter=limit&trans_to=und&trans_link=&trans_user=&trans_orphan=&trans_unapproved=&trans_has_audio=&sort=words&sort_reverse=)

 * Notes
   * The -"ing of" part is necessary to avoid getting results where the -ing word comes before "of."
   * The search results will favor sentences that contain multiple occurrences of *ing. If you don't want this, change the search order.


### How to find sentences with the "I myself" phrase (without any intervening words)

 * [=I NEAR/1 =myself -(myself << I)](https://tatoeba.org/en/sentences/search?from=eng&query=%3DI++NEAR%2F1++%3Dmyself+-%28myself+%3C%3C+I%29)

 * Note
   * The 2nd part of the query with the minus sign (-) is to ignore sentences that have the sequence "myself I". There are a number of such sentences.

### How to find sentences with the "do that", "does that", "did that", "doing that" and "done that".

 * [(=do|=does|=did|=doing|=done) NEAR/1 that -^Does -^Did -"that << does" -"that << did"](https://tatoeba.org/en/sentences/search?from=eng&query= %28%3Ddo%7C%3Ddoes%7C%3Ddid%7C%3Ddoing%7C%3Ddone%29+NEAR%2F1+that+-%5EDoes+-%5EDid+-%22that+%3C%3C+does%22+-%22that+%3C%3C+did%22&sort=random)
 

 * Note
   * In order to limit this to getting sentences with the verb phrase "do that," the last part of the query is included to avoid sentences starting with "Does that" or "Did that" and to avoid the sequences "that does" and "that did."


## Languages without word boundaries

For languages that don't use space characters to separate words, like Japanese, Chinese etc. the search engine interprets each character as a single word. For instance, searching for 逆に will return the same results as 逆 に, which actually matches sentences that only *include* these characters, but not necessarily in that particular order, or not contiguously. So you should surround keywords with quotes, as in this example: 

["逆に"](http://tatoeba.org/jpn/sentences/search?query=%22%E9%80%86%E3%81%AB%22&from=jpn).


## More details

The search ignores capitalization and punctuation (unless the punctuation happens to match one of the special characters described elsewhere on the page). 

If you want to find an exact match for a word in a language (like English) that supports stemming, you must precede it with an equals sign, as in *=live*. This may come as a surprise to users who are accustomed to Google Search, where wrapping a word or phrase in double quotes forces an exact match. In Manticore, double quotes have a different function, which only affects multiword (phrase) searches: wrapping a phrase in double quotes requires matching sentences to contain words in the specified continuous sequence. Simply placing a phrase in quotes does not suppress stemming of its individual words. To do that, you will need to place an equals sign before each word in the phrase for which you want to suppress stemming, or directly before the first quotation mark to suppress stemming for each word. If you want to put both an equals sign and a caret before the same word, the equals sign should precede the caret. For instance, to find sentences that begin with the exact word *Noise*, search for *=^noise*, not *^=noise*.

As an example, take the search *like thing*. This will find *like things*, *likely things*, and even *things like*. Adding quotes, as in *"like thing"*, will prevent a match against *things like* (where the words appear in the wrong order), but it will continue to match *like things*, *likely things*, and so on. By contrast, *"=like =thing"* will only match *like thing* (which does not occur in the Tatoeba corpus). Removing the double quotes, *=like =thing*, will match *What made you do a silly thing like that?* Removing one of the equals signs, as in *like =thing*, will find *Such a strange thing is not likely to happen.* 

Note that a star (*) can be placed at the beginning, middle, or end of a string representing a word, but a string beginning with a star must be at least three characters long, or it will be ignored.

## Limit matches to transcriptions or alternative scripts or sentence text

Some languages can be written in different scripts (such as traditional/simplified Chinese, or Latin/Cyrillic Uzbek). Others also have transcriptions (such as Pinyin Chinese or Japanese furigana).

By default, keywords will be searched everywhere: sentence text, alternative script and transcription. This means a sentence might come up in the results just because the transcription is matching.

You can control exactly what is searched by using the @text and @transcription prefixes, respectively targeting the sentence text and what’s under the sentence text.

* To search for Japanese sentences containing かな in the furigana.
  * [@transcription かな](https://tatoeba.org/sentences/search?query=%40text+%E3%81%8B%E3%81%AA&from=jpn&to=und)

* To search for Japanese sentences containing 国 in sentence text and くに in the furigana.
  * [@text 国 @transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)

* To search for Japanese sentences containing 国 in sentence text but NOT くに in the furigana.
  * [@text 国 -@transcription くに](https://tatoeba.org/sentences/search?query=%40text+%E5%9B%BD+-%40transcription+%E3%81%8F%E3%81%AB&from=jpn&to=und)

* To search for Chinese sentences containing 著 in sentence text and zháo in Pinyin.
  * [@text 著 @transcription zhao3](https://tatoeba.org/sentences/search?query=%40text+%E8%91%97+%40transcription+zhao3&from=cmn&to=und)

## Other search operators

* A **vertical bar** (**|**, representing "or") finds examples where either of the words appears:
  *    *hate | detest* will match sentences with either *hate* or *detest* (or both). 

* If you want to combine an or-expression with other terms, you need to put the or-expression in **parentheses**: 
  *    *(red|blue) house* will match sentences in which the word "house" appears together with either "red" or "blue" (or both) 

* A **dash** (**-**) before a word prevents matches with sentences where the word appears: *like -thing* (or *like !thing*) will match *I like ice cream* but not *I like that red thing*. An exclamation mark (!) works the same way.

* Putting a **caret** (**^**) before a word will match only sentences that begin with that word: *^great* will match *Great people are not always wise.* but not *You are the great love of my life.*

* Putting a **dollar sign** (**$**) after a word will match only sentences that end with that word: *life$* will match *This is the best day of my life.* but not *Life means nothing without friends.*

* If you want to search for sentences that contain nothing other than the specified words, use **double quotes, a caret, and a dollar sign in combination**: *"^i love you$"* will find *I love you.* and *I love you!* but not *I love you more than you love me.* (However, it will find *I loved you.* To prevent this match, use *"^i =love you$"*.)

* The **strict order operator** (**<<**) between two words will find sentences where the first word occurs before the second but not where the second word comes before the first. Thus _dog << cat_ will find examples where _dog_ precedes _cat_, but not vice versa. This operator is also useful for searching for multiple instances of a string. For instance, *these << these << these* will find sentences with three instances of the word "these".

* The **proximity operator**(**~_N_**, where _N_ is a positive number) following a phrase will limit the number of words that can separate the specified words to fewer than _N_. Thus _"you are *ble"~1_ will find *You are irresistible.* but not *You are partially responsible.*

* The **quorum operator** (**/_N_**, where _N_ is a positive number) following a phrase will match sentences that have up to _N_ matching words. Thus _"I wish you would stop"/4_ will find both *I wish you would stop.* and *I wish you would reconsider.*
 
* The **MAYBE** keyword will make the part of the sentence to the right optional. However, sentences that do contain the optional words will be listed first in a default search (that is, one sorted by "Relevance"). For instance, _"perhaps i can MAYBE help"_ will find both *Perhaps I can help.* and *Perhaps I can answer that.*, but *Perhaps I can help.* will be listed first. By contrast, _"perhaps i can MAYBE answer"_ will find the same sentences, but listed in the reverse order. **MAYBE** must be written in all capitals; _"maybe"_ will match the word *maybe*.

See the [Manticore documentation for operators](https://manual.manticoresearch.com/Searching/Full_text_matching/Operators) for more detail.


---
### Related Articles

[Advanced Search](advanced-search)

---

<a href="http://tatoeba.org/">Go back to tatoeba.org</a>

Note

The lines in green are the lines that have been added in the new version. The lines in red are those that have been removed.