[Doc/Faq] Webalizer 설치 및 분석
Date : 2002-03-14 21:05 | Hit : 20,004
개요 : What is your Web Server doing today ?
The Webalizer 는 무료로 제공되는 웹 분석 프로그램이다,
그것은 상당히 자세하고도 표준 웹 브라우저를 통해 Html 형태로 쉽게 레포팅 해준다.
밑 부분은 저자의 말이네요.... 히힛.~~
It was written to solve several problems that I had with currently available analysis packages.
A vast majority of them were written in Perl or some other scripting language, and took forever to run. Some were not free.
Some even produced wrong results, or results that were not in a format I found very useful.
In order to get the stats I wanted, in a format that I liked, I wrote The Webalizer, and have made it available here, to anyone who wants it, for any purpose.
Starting as a simple Perl script with limited capabilities, it has grown into a full featured, robust and fast analysis tool, being used by thousands of systems around the globe.
News: Version 2.01 now available! (V2.01-06 released 17-Oct-2000).
This is a major upgrade that adds many of the most requested features I've received since the release of 1.30, including support for PNG images, Squid proxy logs, Gzip compressed logs, reverse DNS lookups, display of all objects on a seperate HTML pages, user defined search engines for search string analysis, the ability to export the data to other programs and a lot more. See the CHANGES file for a complete list.
이제 함 설치편을 보겠습니다.
* 설치 편
1. GD graphics
[[email protected] gd]# vi gd.c
[[email protected] gd]# vi gd_jpeg.c
[[email protected] gd]# vi gd_png.c
[[email protected] gd]# vi Makefile
INCLUDEDIRS=-I. -I/usr/local/include/freetype2 -I/usr/local/include
[[email protected] gd]# make && make install
2. zlib GD graphics library 구하기
[[email protected] zlib] ./configure -s && make ----------> libz.a 를 만든다.
[[email protected] zlib] ./configure && make test && make install
[[email protected] zlib] cp libz.so* /usr/local/lib
[[email protected] zlib] cd /usr/local/lib; rm -rf libz.so; rm -rf libz.so.1
[[email protected] zlib] ln -s libz.so.1.1.4 libz.so; ln -s libz.so.1.1.4 libz.so.1
3. libpng graphics
[[email protected] libpng]# make -f scripts/makefile.std CC=gcc ZLIBLIB=../zlib ZLIBINC=../zlib
4. Webalizer 구하기
Version 1.7.3 을 다운 받습니다../configure --help 하면 configure의 자세한 사항을 볼 수 있습니다.
[[email protected] webalizer] rm -f webalizer_lang.h
[[email protected] webalizer] ln -s lang/webalizer_lang.korean webalizer_lang.h
[[email protected] webalizer] ./configure --with-language=korean --with-gdlib=../gd --with-gd=../gd --with-zlib=../zlib --with-png=../libpng
[[email protected] webalizer] make && make install
위와 같은 곳으로 install되시는 것을 보실 수 있을 것입니다.
5. Webalizer 실행하기
[[email protected] webalizer] mkdir conf
[[email protected] webalizer] webalizer -c rootman.co.kr
webalizer -h 로 자세한 정보 참고하세요~
6. 환경 설정하기
webalizer -c mysite.conf 라는 명령어를 주고 실행하여 보십시오.c의 옵션은 아래와 같습니다.
-c file This option specifies a configuration file to use.
Configuration files allow greater control over how The Webalizer behaves, and there are several ways to use them.
As of version 0.98, The Webalizer searches for a default configuration file in the current directory named
"webalizer.conf", and if not found, will search in the /etc/ directory for a file of the same name.
In addition, you may specify a configuration file to use with this command line option.
환경 설정 파일에서 바꾸어야 할 부분들.
Usage Statistics for
이런 식으로 사용하시면 됩니다.
잠시 레포팅 되는 명칭에 대해 살펴 보겠습니다.
Any request made to the server which is logged, is considered a 'hit'.
The requests can be for anything... html pages, graphic images, audio files, CGI scripts, etc...
Each valid line in the server log is counted as a hit.
This number represents the total number of requests that were made to the server during the specified report period.
찬호역) HIt 수란. 말그대로 홈페이지에 접속하고 부터의 모든 엑세스를 뜻한다고 하군요, html 페이지, 그래픽이미지, 오디오 파일, cgi 스크립트 등등입니다.
Some requests made to the server, require that the server then send something back to the requesting client, such as a html page or graphic image.
When this happens, it is considered a 'file' and the files total is incremented.
The relationship between 'hits' and 'files' can be thought of as 'incoming requests' and 'outgoing responses'.
Pages are, well, pages! Generally, any HTML document, or anything that generates an HTML document, would be considered a page.
This does not include the other stuff that goes into a document, such as graphic images, audio clips, etc...
This number represents the number of 'pages' requested only, and does not include the other 'stuff' that is in the page.
What actually constitutes a 'page' can vary from server to server.
The default action is to treat anything with the extension '.htm','.html' or '.cgi' as a page.
A lot of sites willprobably define other extensions, such as '.phtml', '.php3' and '.pl' as pages as well.
Some people consider this number as the number of 'pure' hits...
I'm not sure if I totally agree with that viewpoint.
Some other programs (and people :) refer to this as 'Pageviews'.
찬호역) 참 좋은 말 같네염.하핫. 순수한 웹 페이지를 뜻한다고 나오네욤..
htm, html, cgi, php, php3, php4,phtml 등등 의 화면 View 수라합니다.
Each request made to the server comes from a unique 'site', which can be referenced by a name or ultimately, an IP address.
The 'sites'number shows how many unique IP addresses made requests to the server during the reporting time period.
This DOES NOT mean the number of unique individual users (real people) that visited, which is impossible to determine using just logs and the HTTP protocol
(however, this number might be about as close as you will get).
5. 설정예 샘플(rootman.co.kr)
# Your own site should be hidden
# Your own site gives most referrals
# This one hides non-referrers ("-" Direct requests)
#HideReferrer Direct Request
# Usually you want to hide these
# Hiding agents is kind of futile
# You can also hide based on authenticated username
# Grouping options
#GroupURL /cgi-bin/* CGI scripts
#GroupURL /images/* Images
#GroupReferrer yahoo.com/ Yahoo!
#GroupReferrer excite.com/ Excite
#GroupReferrer infoseek.com/ InfoSeek
#GroupReferrer webcrawler.com/ WebCrawler
#GroupUser root Admin users
#GroupUser admin Admin users
#GroupUser wheel Admin users
# The following is a great way to get an overall total
# for browsers, and not display all the detail records.
# (You should use MangleAgent to refine further...)
#GroupAgent MSIE Micro$oft Internet Exploder
#GroupAgent Mozilla Netscape
#GroupAgent Lynx* Lynx
# Example: Only show stats on Joe User's pages...
# Or based on an authenticated username
SearchEngine yahoo.com p=
SearchEngine altavista.com q=
SearchEngine google.com q=
SearchEngine eureka.com q=
SearchEngine lycos.com query=
SearchEngine hotbot.com MT=
SearchEngine msn.com MT=
SearchEngine infoseek.com qt=
SearchEngine webcrawler searchText=
SearchEngine excite search=
SearchEngine netscape.com search=
SearchEngine mamma.com query=
SearchEngine alltheweb.com query=
SearchEngine northernlight.com qr=
# The Dump* keywords allow the dumping of Sites, URL's, Referrers
# User Agents, Usernames and Search strings to seperate tab delimited
# text files, suitable for import into most database or spreadsheet
# DumpPath specifies the path to dump the files. If not specified,
# it will default to the current output directory. Do not use a
# trailing slash ('/').
# These control the dumping of each individual table. The value
# can be either 'yes' or 'no'.. the default is 'no'.
# End of configuration file... Have a nice day!";