Monthly Archives: December 2005

Analyzing Web Server Logs

Log files are not fun to look at. They are ugly, contain too much information, and often lead to massive headaches. Fortunately, these beasts can be tamed for more capacities than just debugging; they can used to generate wonderful reports that make sense. A number of programs are out there to analyze Web server logs, and this article will cast the spotlight in their direction.
 
Before evaluating the software packages, determine the type of data you wish to see. Although most of the software we looked at supports more than just Web server logs, this article discusses only Web server output. Log analysis programs can show everything from a list of IP addresses connected to the Web server to a pie chart detailing which files were accessed most often. The majority of popular Web log analysis tools try to make sense of every piece of data in the logs, but few succeed in making the data readable.
 
Some log file analysis packages cannot distinguish pertinent information from the raw log file itself. Displaying statistics in an aesthetically pleasing manner is a very important attribute. Every once in a while, user interface designers create a new paradigm, setting a standard that other designers attempt to emulate. Arguably, Apple has done this with its OS X desktop environment, and some Web log analysis programs do this better than others.
 
Webalizer is one popular log analysis tool. Many people prefer it because it is written in C and runs quite fast. The graphics, however, are not optimal. The gd graphics library supplies some readable charts, but they are not as aesthetically attractive as they could be. The reports themselves are sufficient for providing a quick glimpse of a few important data points; namely "what pages are accessed" and "how many hits are we getting." A wealth of information can be extracted from Web logs. When done properly, the information is not so overwhelming. Webalizer is adequate, but its mediocre graphics and lack of statistics, earn it a mere three stars in our five-star ad hoc award system.
 
Analog, favored by a small group of die-hard fans, is another worthy contender. Analog attempts to present everything, but it is an example of how to include too much information for normal human consumption. By default, everything is displayed on the same Web page. A navigation bar at the top allows users to click on a specific report, which drills down to another section of the page. Analog’s saving grace is the navigation bar at the top of each section, which simplifies the navigation — somewhat. Analog’s more interesting reports include listings of: how many hits come from each country (TLD, actually), search engine queries that brought users to the Web site, and which browsers and operating systems visitors used. The software is capable of presenting just about everything else derivable from Web server logs. The graphics are a slight improvement over Webalizer gd-based graphics, but the pie and bar charts still leave much to be desired. Because Analog includes much useful information, and the navigation isn’t completely unusable, we feel it deserves an apprehensive four out of five stars.
 
Summary is a commercial log analysis tool for which a 30-day trial is available. This package includes all possible information and lists options in a text Web page for users to click on. When you follow a link, for example, "Bandwidth Peak," you are brought to fairly decent Web page that lists bandwidth usage by time. A small bar graph accompanies each entry, but the graphics in Summary are quite minimal. Here, minimal is not a defect. Quite the contrary; Summary is really decent looking. However, the overall GUI is cumbersome, and it took us a good bit of time to browse to each report we wished to see. The cost of Summary is not prohibitive, and the reports are decent, albeit not awe-inspiring. We rate it four out of five.
 
No discussion of Web log analysis software would be complete without at least a nod to WebTrends. The sheer scope of WebTrends Web Log Analyzer (another commercial offering) earns it an honorable mention here. Its Web site makes the auspicious claim of increasing return on investment, and even asserts "This is Complete Web Analysis." Not surprisingly, WebTrends is not for organizations with skinny wallets. The online demos reflect how great GUI design should look, and it does indeed look great. The company’s claims of usability appear founded, and it has even included a way to access all of the information available from Web server logs. WebTrends has been around for more than a decade and plays nicely with IIS. We are giving it four out of five stars, based solely on what we learned in the product’s impressive Web-based demo.
 
The grail of log analysis, AWStats, is by far the best looking of all of the Free Web log analysis tools we’ve seen. AWStats is also the only Perl-based application on the list. Its graphics are superb, and its information is presented in an excellent manner. At a glance, users can view all available reports and navigate seamlessly between them. Many users will be amazed at the amount of detail the program can extract from the log files. Small browser icons and flags for various countries add to the already-pleasing GUI. AWStats includes all of the features mentioned above for other programs, and is in a readable format, to boot. We give it the full five stars.
 
Of course, there are countless other log analysis programs, but these are the more commonly deployed ones.
 
Compatibility, which is normally a key issue, is not a great concern when it comes to log analysis tools. The Apache Web server produces logs in a standardized format, called NCSA combined log files. IIS W3C conformant format is also supported by most of the analysis programs listed here.
 
In a later article, we will explore the other types of log files most of these programs can work on, including mail and FTP.

web log分析

我需要一个web分析软件。应该具有如下的功能:
1、具有普通log分析的功能
2、可以记录每个页面访问时提交的信息,比如asp?后面的内容
3、可以具体分析出访问某个具体页面的visitors。或者可以查看某个visitor访问过的页面
 
第一条似乎100%的log分析软件实现了。
第二条似乎大部分分析软件都去掉了:(
第三条支持的也不多,我只见过一个
 
 
今天测试了n个log分析软件。发现还是一个叫做web log explorer的符合要求,而且是他的pro版本。免费试用30,我没有找到破解。。
 
主要功能如下图。
 
 
 
2005-12-27_15-28-07.jpg 2005-12-27_15-28-16.jpg 2005-12-27_15-28-22.jpg 2005-12-27_15-28-43.jpg

给blog增加了一段mybloglog的代码

今天受到dba notes 的提示,给站点增加了一段mybloglog的代码。
效果就是可以看到每个连接的点击次数以及从什么站点链接过来的。可惜的是,注册后属于MyBlogLog Pro类型,只能使用3天,3天之后不交费就变成了free类型,也就无法看到详细的log了。但也足够了:)
唯一不爽的地方,修改时区只能是pro帐户修改,虽然我现在是试用pro帐户,也应改可以吧,可惜,不行。。。于是我现在属于了USA – New York City – New York (GMT -0500)。。
 
 
2005-12-27_09-15-03.jpg

Life Is Beautiful

2005-12-26_22-07-59.jpg
 
   

《美丽人生》是一部意大利影片,一部非常优秀的电影作品。
 
  影片的时间背景二战时期,主人翁是一个幽默而乐观的年轻的犹太人,如千百万的犹太人一样,老实努力勤奋地工作着。
  影片在舒缓的音乐,优美的田园风光中拉开了序幕。主人翁与他的朋友开着一辆老式的跑车开始了“惊心动魄”的旅程。车子终于停下来了,一个美丽的姑娘忽然从天而降坠入了他的怀中,姓丘的那个小家伙的箭就在那一瞬间击中他了。“哦,我的公主,”年轻的犹太人的绅士般的优雅风度把姑娘逗笑了。
  在城里的又一次偶然的邂逅,小伙子开始为姑娘神魂颠倒和辗转反侧了,于是,他使出浑身解数,虽然他只是一个小小的酒吧生,虽然姑娘有一个做高官的未婚夫,可他还是勇往直前,终于在姑娘的订婚宴席上,作为应侍生的他骑着涂得五颜六色的高头大马把姑娘接到了他的摆满鲜花的小房子里。
  下面是蒙太奇的手法,从花房里出来了一个五、六岁的孩子,他们的孩子。主人翁也在市中心开了家小书店,小日子过得和和美美。
  那个时候,因为战争,形势有些紧张了。虽然他们所在的城市虽然并没有受到战争的袭击,但已经被纳粹党统领了,街上的商店很多都写着“犹太人与狗不得入内”。孩子十分不解,问父亲:为什么?
  怎么回答孩子呢?战争可以说是战争狂人们玩的游戏,但对于孩子来讲,却是真正的灾难,怎么能告诉他因为他是犹太人就必须遭受不公平的待遇?怎么能让他幼小的心受到如此的打击?
  父亲对儿子讲:“别人不喜欢啊,就象前面的商店不喜欢中国人和马,而再前面的商店不喜欢印度人一样,别人不喜欢你有什么办法。你不喜欢什么?”
  儿子说:“蜘蛛!”
  父亲说:“我不喜欢野蛮人,明天我们在我们的书店写上蜘蛛和野蛮人不得入内!”
  明天就是儿子的生日了,夫妻俩十分高兴,想着要好好地庆祝,久不理睬女儿的外婆也要来了,来祝贺乖孙儿的生日。
  可是,当第二天妈妈接了外婆来到小屋的时候,不幸的事终于发生了,小屋一遍狼藉,儿子和丈夫已经不在了。
  爸爸和儿子被纳粹强行送到了集中营。在挤进如沙丁鱼罐头般拥挤不堪的火车里,爸爸对儿子说:“我说要在你的生日给你一个惊喜,看,这就是了,我带你做一个游戏,一个真正的游戏,如果我们赢了,就可以得一辆坦克,一辆真正的坦克。”
  影片的前面一直是喜剧,我一直在笑,笑得眼泪都出来了,而这时,我的眼眶又湿了,我的鼻子酸酸的,我的心开始痛了。
  母亲走到了纳粹的办公室,对那个纳粹分子说:“让我也去!”
  “女士,你不是犹太人,你可以不去。”
  “让我去,让我和我的丈夫我的儿子在一起。”母亲厉声地说。
  纳粹分子同意了。
  他们被送到了集中营,虽然是在同一个地方,可是男女分开,不能讲话,他们只有在偶尔在做工的时候四目相望。
  儿子与爸爸在一起,为了让儿子不受到战争和死亡的威胁和恐吓,父亲费尽了心思。当纳粹军官找翻译,给大家讲集中营的纪律的时候,不懂德语的父亲急忙充当翻译,他怕别人将真实的情况讲出来,吓着孩子了,军官一边讲着纪律,父亲一边“翻译”着游戏规则。
  每天都是干的十分重的体力活,累得不堪的爸爸回来时还是要做出非常轻松的样子,给儿子讲,“我们今天积了很多分了,我们领先了。”然后将自己省下的干面包给儿子吃。
  儿子却不能忍受这样恶劣的环境和枯燥的生活了,他毕竟是小孩子,他要回家了,不想做游戏了。父亲同意了,他说:“好吧,我们回家吧,虽然我们目前的分数是第一,大大地超过了别人,可是我们退出了,我们不想要那辆真坦克了。”
  儿子心动了:“我们真的是第一吗?”
  “那当然,”父亲说“我们让他们捡个便宜吧,我们不要坦克了,我们不想驾驶它了,我们要回家了。”
  “哦,爸爸,外面下着雨,我怕得感冒呢。”儿子转身向屋子里面跑去。
  父亲苦涩地笑了。
  就这样,儿子在父亲的保护下,奇迹般地身心健康地活了下来。父亲想了很多的办法想逃走,可是都行不通。
  那天晚上,屋外的警报响个不停,纳粹党开始大批大批运送犯人离开,父亲意识到,这是盟军攻打过来了,如果现在不走,一定会死在这里的。
  父亲抱着儿子,开始逃亡,虽然四面都是纳粹军队,他们还是快逃走了。父亲将儿子藏在屋外墙角边的一个废弃的柜子里,千叮呤万嘱咐,无论发生什么一定不能出来,熬过今夜就可以得到真坦克了。然后开始四处地寻找他的妻子,他一遍遍地叫着妻子的名字,一个个人地问妻子的下落。
  不幸发生了,他被纳粹发现了。
  纳粹分子押着他经过儿子藏身的柜子,他故着潇洒地大步向前,还向儿子的方向挥挥手。他被押到了一个角落处,两声枪响。
  第二天,四周静悄悄地,看不见一个人影了,儿子钻出了柜子,只听见一阵轰隆隆的声音,儿子睁大了眼睛,他的嘴巴也合不上了,来了一辆坦克,真正的坦克。
  坦克上的叔叔将孩子抱上坦克,对他说:“孩子,我们胜利了,盟军胜利了。”
  胜利的凯歌开始奏响,儿子戴着头盔,雄纠纠,气昂昂。“啊,妈妈,”孩子在被解救的人群中发现了他的妈妈。妈妈与儿子幸福地站在坦克上。
  画外音:这就是我的父亲送我的生日礼物!
  伟大的父爱!我忍不住潸然泪下。
  字幕,灯亮。
  感谢上帝,让我们生在这和平的年代,生在这太平盛世!让我们拥有这美丽人生!
 

我们做web2.0的口号就是:不求最好,但求最2

一定得选最好的托管中心 
全套Cisco的网络设备加SUN的服务器 
建就建最酷的用户体验 
免费注册帐户 
每个帐户存储空间最少也得两个G 
什么AJAX呀、Tag呀、Rss呀 
能给他整地全部给他整上 
社区附带一个VIP区,有牛人7×24小时蹲点帮你解惑 
Blog上常驻一个叫Keso的家伙 
留小辫儿,特大牛的那种 
只要一打开页面,甭管有事没事都得用Skype跟人家说 
“你丫赶紧给我注册!” 
一口地道的京片子 
倍儿有面子 
网站里还要建一个wiki系统 
全部翻译自维基百科 
每天翻译量起码百兆计算吧 
再建一个站内搜索 
支持所有内容全文检索 
文本呀、RSS呀、Blog呀,你要搜什么我给你找出什么 
就是一个字——快 
全站搜一次才用0.00001秒 
在这里注册用户的不是CEO就是网络精英 
你要是分不清Blog和WebLog 
你都不好意思去TrackBack人家 
你说这样的网站,VC会投多少? 
我觉得怎么着也得两千万美金吧? 
两千万美金?那是成本 
四千万美金起 
你别嫌贵,我还挑东家呢 
你得研究VC的投资心理 
能掏起两千万的主儿 
根本不在乎再掏两千万 
什么叫Blogger,你知道吗? 
Blogger挑服务商都用最2.0的,不用最好的 
所以,我们做web2.0的口号就是: 
不求最好,但求最2