Indexing Your Documents
256
Netscape Enterprise Server Administrator’s Guide • November 2001
For example, a document could have these lines of HTML code:
<META NAME="Writer" CONTENT="R. Hunter">
<META NAME="Song" CONTENT="Stella Blue">
If this document was indexed with its META tags extracted, you could search it for
specific values in the writer or product fields. For example, you could enter this
query:
Writer <contains> Hunter
or
Song <contains> Blue
.
Creating a New Collection
You can only have twelve collections on your server. To use a thirteenth collection,
you must first remove an existing collections using Search /Maintain Collection.
You can only have entries for a maximum of 16 million documents in your
collections. A document that is indexed in multiple collections counts as multiple
documents. It is best to create new collections of over 10,000 documents at
low-traffic times, or the indexing operation may affect your system’s performance.
You can create a collection that indexes the content of all or some of the files in a
directory. You can define collections that contain only one kind of file, or you can
create a collection of documents in various formats that are automatically
converted to HTML during indexing. When you define a multiple format collection
with the auto-convert option the indexer first converts the documents into HTML,
and then indexes their contents. The converted HTML documents are put into the
html_doc
directory in the server’s search collections folder.
The file format you choose defines which default attributes are used in the
collection, and whether automatic HTML conversion of the content is needed
during indexing. For information about the attributes for each format, see , and
“About Collection Attributes” on page 254.
Regardless of the file type chosen, the content of the file is always indexed. If you
choose HTML as the file type, the server creates the collection with the HTML
default attributes, and does not attempt to convert any non-HTML files you try to
index. If you index HTML files into an ASCII collection, even the HTML markup
tags are indexed as part of the file’s contents, and the contents are displayed as raw
text.
NOTE
Attribute values in META-tagged fields are text strings only, which means
that all numeric values, such as date and time, are sorted as text. Any
illegal HTML characters in a META-tagged attribute are replaced with a
hyphen.
Summary of Contents for NETSCAPE ENTREPRISE SERVER 6.0 - ADMINISTRATOR
Page 1: ...Administrator s Guide Netscape Enterprise Server Version6 0 November 2001...
Page 18: ...18 Netscape Enterprise Server Administrator s Guide November 2001...
Page 26: ...26 Netscape Enterprise Server Administrator s Guide November 2001...
Page 48: ...Migrating a Server 48 Netscape Enterprise Server Administrator s Guide November 2001...
Page 50: ...50 Netscape Enterprise Server Administrator s Guide November 2001...
Page 146: ...146 Netscape Enterprise Server Administrator s Guide November 2001...
Page 242: ...242 Netscape Enterprise Server Administrator s Guide November 2001...
Page 294: ...294 Netscape Enterprise Server Administrator s Guide November 2001...
Page 332: ...Deleting a Virtual Server 332 Netscape Enterprise Server Administrator s Guide November 2001...
Page 378: ...378 Netscape Enterprise Server Administrator s Guide November 2001...
Page 396: ...Responses 396 Netscape Enterprise Server Administrator s Guide November 2001...
Page 414: ...Posting to JSPs 414 Netscape Enterprise Server Administrator s Guide November 2001...
Page 432: ...Further Information 432 Netscape Enterprise Server Administrator s Guide November 2001...
Page 444: ...444 Netscape Enterprise Server Administrator s Guide November 2001...