Full-Text Stopwords
941
+----+-------------------+------------------------------------------+
| id | title | body |
+----+-------------------+------------------------------------------+
| 5 | MySQL vs. YourSQL | In the following database comparison ... |
| 1 | MySQL Tutorial | DBMS stands for DataBase ... |
+----+-------------------+------------------------------------------+
2 rows in set (0.00 sec)
mysql>
SELECT * FROM articles
->
WHERE MATCH (title,body)
->
AGAINST ('database' WITH QUERY EXPANSION);
+----+-------------------+------------------------------------------+
| id | title | body |
+----+-------------------+------------------------------------------+
| 1 | MySQL Tutorial | DBMS stands for DataBase ... |
| 5 | MySQL vs. YourSQL | In the following database comparison ... |
| 3 | Optimizing MySQL | In this tutorial we will show ... |
+----+-------------------+------------------------------------------+
3 rows in set (0.00 sec)
Another example could be searching for books by Georges Simenon about Maigret, when a user is not
sure how to spell “Maigret”. A search for “Megre and the reluctant witnesses” finds only “Maigret and
the Reluctant Witnesses” without query expansion. A search with query expansion finds all books with
the word “Maigret” on the second pass.
Note
Because blind query expansion tends to increase noise significantly by returning
nonrelevant documents, it is meaningful to use only when a search phrase is
rather short.
12.9.4. Full-Text Stopwords
The stopword list is loaded and searched for full-text queries using the server character set and
collation (the values of the
character_set_server
and
collation_server
system variables).
False hits or misses may occur for stopword lookups if the stopword file or columns used for full-text
indexing or searches have a character set or collation different from
character_set_server
or
collation_server
.
Case sensitivity of stopword lookups depends on the server collation. For example, lookups are case
insensitive if the collation is
latin1_swedish_ci
, whereas lookups are case sensitive if the collation
is
latin1_general_cs
or
latin1_bin
.
The following table shows the default list of full-text stopwords. In a MySQL source distribution, you can
find this list in the
myisam/ft_static.c
file.
a's
able
about
above
according
accordingly
across
actually
after
afterwards
again
against
ain't
all
allow
allows
almost
alone
along
already
also
although
always
am
among
amongst
an
and
another
any
anybody
anyhow
anyone
anything
anyway
anyways
anywhere
apart
appear
appreciate
appropriate
are
aren't
around
as
aside
ask
asking
associated
at
available
away
awfully
be
became
because
become
becomes
becoming
been
before
beforehand
behind
being
believe
Summary of Contents for 5.0
Page 1: ...MySQL 5 0 Reference Manual ...
Page 18: ...xviii ...
Page 60: ...40 ...
Page 396: ...376 ...
Page 578: ...558 ...
Page 636: ...616 ...
Page 844: ...824 ...
Page 1234: ...1214 ...
Page 1427: ...MySQL Proxy Scripting 1407 ...
Page 1734: ...1714 ...
Page 1752: ...1732 ...
Page 1783: ...Configuring Connector ODBC 1763 ...
Page 1793: ...Connector ODBC Examples 1773 ...
Page 1839: ...Connector Net Installation 1819 2 You must choose the type of installation to perform ...
Page 2850: ...2830 ...
Page 2854: ...2834 ...
Page 2928: ...2908 ...
Page 3000: ...2980 ...
Page 3122: ...3102 ...
Page 3126: ...3106 ...
Page 3174: ...3154 ...
Page 3232: ...3212 ...