Loading ...
Sorry, an error occurred while loading the content.

Re: [webalizer] Re: spaces in useragents (and regexp in conf files)

Expand Messages
  • waldo kitty
    ... i ve been working on this a bit this evening... here s my counts before i started my tweaking... Top 15 of 709 Total User Agents # Hits
    Message 1 of 10 , Mar 28 6:09 PM
    • 0 Attachment
      enventa2000 wrote:
      > Look here:
      >
      > http://griho.udl.es/webalizer/grouping_browsers_by_version.txt
      >
      > If the list is useful to you, then please thank the original author:
      > http://www.tnl.net/email/
      >
      > Unfortunately this list will match any agent containing certain
      > version numbers. I have avoided this as much as posible in my config
      > file by placing a long list for all every other posible agent on top
      > of that list so they are matched before.

      i've been working on this a bit this evening... here's my counts before i started my tweaking...

      Top 15 of 709 Total User Agents
      # Hits User Agent
      1 6008 44.25% Browser: Internet Explorer 6.0 (Win)
      2 2073 15.27% Browser: Internet Explorer 5.0
      3 958 7.06% Spider: Yahoo! Indexing Spider
      4 745 5.49% Spider: Ask Jeeves/Temoa
      5 732 5.39% Spider: Google Indexing Spider
      6 390 2.87% Browser: America Online
      7 384 2.83% Spider: Zyborg Looksmart.net/WISEnut.com Dead Link Checker
      8 356 2.62% Browser: Internet Explorer 5.5 (Win)
      9 319 2.35% Browser: Internet Explorer 5.01
      10 196 1.44% Spider: FAST-WebCrawler v3.xx Indexing Spider (AlltheWEB)
      11 117 0.86% Browser: Netscape 4.04
      12 111 0.82% Spider: MSN.com
      13 103 0.76% Spider: Archive.org/Alexa/Amazon.com
      14 100 0.74% Browser: Navigator 3.01 (16-bit version)
      15 96 0.71% Coding: Java-based client


      and here's my new counts after my tweaking... i think i still have a few more to move and try to make more unique but i won't know
      that without more analysis...

      Top 15 of 709 Total User Agents
      # Hits User Agent
      1 7075 52.07% Browser: Internet Explorer 6.0 (Win)
      2 958 7.05% Spider: Yahoo! Indexing Spider
      3 752 5.53% Spider: Ask Jeeves/Temoa
      4 732 5.39% Spider: Google Indexing Spider
      5 485 3.57% Browser: Internet Explorer 5.5 (Win)
      6 401 2.95% Browser: Internet Explorer 5.0
      7 390 2.87% Browser: America Online
      8 384 2.83% Spider: Zyborg Looksmart.net/WISEnut.com Dead Link Checker
      9 319 2.35% Browser: Internet Explorer 5.01
      10 196 1.44% Spider: FAST-WebCrawler v3.xx Indexing Spider (AlltheWEB)
      11 160 1.18% Browser: Mozilla 1.4
      12 140 1.03% Browser: Mozilla 1.6
      13 117 0.86% Browser: Netscape 4.04
      14 111 0.82% Spider: MSN.com
      15 103 0.76% Spider: Archive.org/Alexa/Amazon.com


      you can see that the MSIE 6.0, 5.5 and 5.0 entries changed and that the mozilla entries appeared... the biggest thing that i saw
      during my manual analysis was that NT 5.0 was being counted as MSIE 5.0 when, in fact, there were several different browsers running
      on NT 5.0... once i rearranged the agent strings and the numbers to catch the 5.0 stuff after the others, things seem to have come
      more into line with what i actually expect to see... i'm positive that there is a bit more to do to really tweak this to produce
      more accurate numbers...

      of course, this would all be moot if MSIE would id itself as "MSIE\version" rather than "MSIE version" and that would go a long way
      as far as working out the need to be able to put spaces in the UA string matching... even then, though, having webalizer be able to
      take spaces and/or regex would be a huge boon and, IMHO, make webalizer much more valuable than it already is ;)

      eventa, i'll be answering your private mail later, ok?

      --
      _\/
      (@@) Waldo Kitty, Waldo's Place USA
      __ooO_( )_Ooo_____________________ telnet://bbs.wpusa.dynip.com
      _|_____|_____|_____|_____|_____|_____ http://www.wpusa.dynip.com
      ____|_____|_____|_____|_____|_____|_____ ftp://ftp.wpusa.dynip.com
      _|_Eat_SPAM_to_email_me!_YUM!__|_____|_____ wkitty42 -at- alltel.net
    Your message has been successfully submitted and would be delivered to recipients shortly.