From: <Saved by Microsoft Internet Explorer 5>
Subject: Webalizer Quick Help
Date: Fri, 13 Apr 2007 01:19:03 -0500
MIME-Version: 1.0
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Content-Location: http://www.mrunix.net/webalizer/webalizer_help.html
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3028

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE>Webalizer Quick Help</TITLE>
<META http-equiv=3DContent-Type content=3D"text/html; =
charset=3Diso-8859-1">
<META content=3D"MSHTML 6.00.2900.3059" name=3DGENERATOR></HEAD>
<BODY text=3Dblack vLink=3Dblue link=3Dblue bgColor=3Dwhite><FONT =
size=3D+2><B>Webalizer=20
Quick Help</B></FONT><BR>
<UL>
  <LI><B>Main Headings</B>
  <P><A name=3DHITS><FONT color=3Dgreen><B>Hits</B></FONT> represent the =
total=20
  number of requests made to the server during the given time period =
(month,=20
  day, hour etc..).
  <P><A name=3DFILES><FONT color=3Dblue><B>Files</B></FONT> represent =
the total=20
  number of hits (requests) that actually resulted in something being =
sent back=20
  to the user. Not all hits will send data, such as 404-Not Found =
requests and=20
  requests for pages that are already in the browsers cache.
  <P><B>Tip:</B> By looking at the difference between hits and files, =
you can=20
  get a rough indication of repeat visitors, as the greater the =
difference=20
  between the two, the more people are requesting pages they already =
have cached=20
  (have viewed already).
  <P><A name=3DSITES><FONT color=3Dorange><B>Sites</B></FONT> is the =
number of=20
  unique IP addresses/hostnames that made requests to the server. Care =
should be=20
  taken when using this metric for anything other than that. Many users =
can=20
  appear to come from a single site, and they can also appear to come =
from many=20
  ip addresses so it should be used simply as a rough guage as to the =
number of=20
  visitors to your server.
  <P><A name=3DVISITS><FONT color=3D#e04000><B>Visits</B></FONT> occur =
when some=20
  remote site makes a request for a <I>page</I> on your server for the =
first=20
  time. As long as the same site keeps making requests within a given =
timeout=20
  period, they will all be considered part of the same <B>Visit</B>. If =
the site=20
  makes a request to your server, and the length of time since the last =
request=20
  is greater than the specified timeout period (<I>default is 30 =
minutes</I>), a=20
  new <B>Visit</B> is started and counted, and the sequence repeats. =
Since only=20
  <I>pages</I> will trigger a visit, remotes sites that link to graphic =
and=20
  other non- page URLs will not be counted in the visit totals, reducing =
the=20
  number of <I>false</I> visits.
  <P><A name=3DPAGES><FONT color=3Dcyan><B>Pages</B></FONT> are those =
URLs that=20
  would be considered the actual page being requested, and not all of =
the=20
  individual items that make it up (such as graphics and audio clips). =
Some=20
  people call this metric <I>page views</I> or <I>page impressions</I>, =
and=20
  defaults to any URL that has an extension of <B>.htm</B>, <B>.html</B> =
or=20
  <B>.cgi</B>.
  <P><A name=3DKBYTES>A <FONT color=3Dred><B>KByte</B></FONT> (KB) is =
1024 bytes (1=20
  Kilobyte). Used to show the amount of data that was transfered between =
the=20
  server and the remote machine, based on the data found in the server =
log.
  <P><BR></P>
  <LI><B>Common Definitions</B>
  <P><A name=3DSITE>A <B>Site</B> is a remote machine that makes =
requests to your=20
  server, and is based on the remote machines IP Address/Hostname.
  <P><A name=3DURL><B>URL</B> - Uniform Resource Locator. All requests =
made to a=20
  web server need to request <I>something</I>. A URL is that =
<I>something</I>,=20
  and represents an object somewhere on your server, that is accessable =
to the=20
  remote user, or results in an error (ie: 404 - Not found). URLs can be =
of any=20
  type (HTML, Audio, Graphics, etc...).
  <P><A name=3DREFERRER><B>Referrers</B> are those URLs that lead a user =
to your=20
  site or caused the browser to request something from your server. The =
vast=20
  majority of requests are made from your own URLs, since most HTML =
pages=20
  contain links to other objects such as graphics files. If one of your =
HTML=20
  pages contains links to 10 graphic images, then each request for the =
HTML page=20
  will produce 10 more hits with the referrer specified as the URL of =
your own=20
  HTML page.
  <P><A name=3DSSTRING><B>Search Strings</B> are obtained from examining =
the=20
  referrer string and looking for known patterns from various search =
engines.=20
  The search engines and the patterns to look for can be specified by =
the user=20
  within a configuration file. The default will catch most of the major =
ones.
  <P><B>Note:</B> Only available if that information is contained in the =
server=20
  logs.
  <P><A name=3DUAGENT><B>User Agents</B> are a fancy name for =
<I>browsers</I>.=20
  Netscape, Opera, Konqueror, etc.. are all <B>User Agents</B>, and each =
reports=20
  itself in a unique way to your server. Keep in mind however, that many =

  <I>browsers</I> allow the user to change it's reported name, so you =
might see=20
  some obvious fake names in the listing.
  <P><B>Note:</B> Only available if that information is contained in the =
server=20
  logs.
  <P><A name=3DENTRYEXIT><B>Entry/Exit</B> pages are those pages that =
were the=20
  first requested in a visit (<B>Entry</B>), and the last requested=20
  (<B>Exit</B>). These pages are calculated using the <B>Visits</B> =
logic above.=20
  When a visit is first triggered, the requested page is counted as an=20
  <B>Entry</B> page, and whatever the last requested URL was, is counted =
as an=20
  <B>Exit</B> page.
  <P><A name=3DCOUNTRIES><B>Countries</B> are determined based on the =
<I>top level=20
  domain</I> of the requesting site. This is somewhat questionable =
however, as=20
  there is no longer strong enforcement of domains as there was in the =
past. A=20
  .COM domain may reside in the US, or somewhere else. An .IL domain may =

  actually be in Isreal, however it may also be located in the US or =
elsewhere.=20
  The most common domains seen are .COM (US Commercial), .NET (Network), =
.ORG=20
  (Non-profit Organization) and .EDU (Educational). A large percentage =
may also=20
  be shown as <I>Unresolved/Unknown</I>, as a fairly large percentage of =
dialup=20
  and other customer access points do not resolve to a name and are left =
as an=20
  IP address.
  <P><A name=3DRCODES><B>Response Codes</B> are defined as part of the =
HTTP/1.1=20
  protocol (<A =
href=3D"http://andrew2.andrew.cmu.edu/rfc/rfc2068.html">RFC=20
  2068</A>; See Chapter 10). These codes are generated by the web server =
and=20
  indicate the completion status of each request made to it.
  <P></P></LI></UL></BODY></HTML>

