|
|
 |
Crawling:
-
Runs
as a
Windows
Service;
Controlled
through
your
Web
browser
-
Supports
an
unlimited
number
of
collections
-
Collections
can
be
crawled
and
indexed
concurrently
-
Automated
indexing/updating
using
built-in
scheduler
-
Supports
IFilters
providing
support
for
DOC,
PDF,
XLS,
RTF,
and
more
-
Supports
cookies,
sessions,
and
redirect
pages
-
Proxy
server
supported
-
Optionally
caches
documents
similar
to
Google
-
Finds
new
documents
to
index
-
New
documents
are
indexed
and
searchable
immediately*
-
Follows
and
indexes
dynamic
Web
pages
-
Keyword
rules
allow
documents
to
be
collected
based
on
content
-
Domain
rules
give
you
100%
control
over
where
the
spider
crawls
-
Definable
maximum
directory
depth
to
control
how
deep
into
a
site
the
spider
crawls
-
Definable
maximum
document
size
-
Definable
file
extensions
(ie.,
.HTM,
.PDF,
etc.)
to
allow
-
Definable
domain
extensions
(ie.,
.COM,
.ORG,
etc.)
to
allow
-
Logs
crawler
activity
per
collection
-
Email
notifications
when
unexpected
error
occurs
(ie.,
SQL
Server
connection
lost)
-
User
defined
tags
using
RegEx
to
extract
and
search
specific
content
within
documents
-
Optionally,
indexes
ALT
and
Comment
tags
-
Configurable
via
your
Web
browser
from
any
location*
-
Can
be
set
to
continously
crawl
and
update
indexed
documents
-
Can
be
set
to
sleep
when
it
has
no
more
URLs
to
visit
instead
of
stopping.
When
a
new
URLs
is
added,
it
will
automatically
resume.
-
Fully
supports
Robots
Exclusion
*When
Change
Tracking
w/
Update
in
Background
is
enabled.
**When
Web
server
is
reachable
from the
Internet
Searching:
-
Fast,
relevant
search
results
-
Search
results
can
be
sorted
by
Relevance,
URL,
or
Date
-
Search
supports
advanced
search
syntax
(AND,
OR,
NOT,
NEAR,
and
Phrases)
-
Complete
control
over
the
look
and
feel
of
search
results
using
HTML,
.NET,
and
SQL
-
Detailed
search
logging,
including
search
time,
results
found,
and
more.
-
Caches
search
results
to
provide
instant
results
for
repeat
searches
|
|