174
Chapter 8 Verity Spider
You cannot use the question mark ( ? ) wildcard, and the
-regexp
option does not
allow you to use regular expressions.
Use
-indmimeexclude
to allow the Verity Spider to follow documents, without
indexing them, to gain access to other desirable document types.
-mimeinclude
Syntax
:
-mimeinclude mime_1 [mime_n] ...
Specifies MIME types to be included.
On Windows NT, you should include double quotes around the argument to protect
the special characters such as (*). On UNIX, you should use single quotes. Note that
this is only required when you run the indexing job from a command line. Quotes are
not necessary within a command file (-cmdfile).
The default is to include all MIME types. For the mime variable, you can include the
asterisk ( * ) wildcard for text strings. For example:
’text/*’
You cannot use the question mark ( ? ) wildcard, and the -regexp option does not
allow you to use regular expressions.
-mindocsize
Syntax
:
-mindocsize integer
Specifies the minimum size, in kilobytes, for documents to be indexed. Any
documents smaller than the value specified by mindocsize will be ignored.
The default is to index documents of any sizes.
-skip
Syntax
:
-skip HTML_tag "exp"
Type
: Web crawling only
Specifies Verity Spider is to not index any HTML document which contains the text of
exp within the given HTML_tag. For multiple HTML_tag and exp combinations, use
multiple instances of the -skip option.
You can use wildcard expressions, where the asterisk ( * ) is for text strings and the
question mark ( ? ) is for single characters. For example:
’/my_doc*/year199?’
On Windows NT, you should include double quotes around the argument to protect
the special characters such as (*). On UNIX, you should use single quotes. Note that
this is only required when you run the indexing job from a command line. Quotes are
not necessary within a command file (-cmdfile).
Summary of Contents for COLDFUSION 5-ADVANCED ADMINISTRATION
Page 1: ...Macromedia Incorporated Advanced ColdFusion Administration ColdFusion 5...
Page 20: ......
Page 56: ...38 Chapter 1 Advanced Data Source Management...
Page 74: ...56 Chapter 2 Administrator Tools...
Page 76: ......
Page 86: ...68 Chapter 3 ColdFusion Security...
Page 87: ...To Learn More About Security 69...
Page 88: ...70 Chapter 3 ColdFusion Security...
Page 130: ...112 Chapter 5 Configuring Advanced Security...
Page 132: ......
Page 154: ...136 Chapter 6 Configuring Verity K2 Server...
Page 162: ...144 Chapter 7 Indexing XML Documents...
Page 202: ...184 Chapter 8 Verity Spider...
Page 236: ...218 Chapter 10 Verity Troubleshooting Utilities...
Page 238: ......
Page 348: ...330 Chapter 14 ClusterCATS Utilities...
Page 349: ...Using sniff 331...
Page 350: ...332 Chapter 14 ClusterCATS Utilities...
Page 362: ...344 Chapter 15 Optimizing ClusterCATS...
Page 372: ...354 Index...