|
|
 |
ES.NET
2004,
a major
upgrade
to
Enterprise
Search
v1.0,
adds
many new
features
and
changes.
Some of
the most
significant
new
features
include:
-
Windows
Service:
ES.NET
runs
as a
Windows
Service.
All
configuration
is
done
through
your
Web
browser.
This
allows
the
software
to
crawl
and
index
without
require
a
user
to
be
logged
into
the
machine.
-
True
Collection
Support:
ES.NET
provides
full
support
for
collections.
Collections
allow
you
to
create
searchable
indexes
of
groups
of
documents.
Unlike
ES
v1.0,
documents
may
appear
in
multiple
collections.
-
Multi-Threading
&
Load
Balancing:
ES.NET
has
greatly
improved
threading
over
ES
v1.0.
It
automatically
determines
the
maximum
safe
number
of
threads
your
system
can
handle.
Threads
are
evenly
distributed
among
active
collections.
The
new
threading
allows
you
to
run
collections
concurrently
rather
than
a
single
collection
at a
time.
-
Improved
Logging:
Each
collection
maintains
its
own
log
file,
allowing
you
to
easily
track
the
crawling
history
for
each
collection.
Log
files
are
formatted
for
easy
importing
into
applications,
such
as
Access
or
Excel.
-
Email
Notifications:
You
can
now
be
notified
via
email
when
a
severe
error
occurs.
At
this
time,
email
notifications
are
limited
to
database
server
connection
issues.
-
Robots.txt
Caching:
ES.NET
obeys
the
robots.txt
standard.
Rather
than
re-requesting
the
robots.txt
by
each
thread,
the
robots.txt
is
cached.
The
cached
robots.txt
is
kept
for
24-hours
and
shared
among
all
collections.
-
Abuse
Prevention:
ES.NET
adds
a
new
feature
that,
optionally,
prevents
the
spider
from
requesting
a
document
more
than
once
every
20
seconds
from
the
same
domain.
This
prevents
hammering
Web
sites,
which
can
cause
your
IP
to
be
banned.
|
|