Loading ...
Sorry, an error occurred while loading the content.

Re: failure to log search strings

Expand Messages
  • Deathlok_the_Demolisher
    Hello everyone. I joined the webalizer group because I was having this exact same problem. 20% of the searches going into my site weren t showing up. After
    Message 1 of 11 , Sep 5, 2002
    • 0 Attachment
      Hello everyone. I joined the webalizer group because I was having
      this exact same problem. 20% of the searches going into my site
      weren't showing up. After upgrading from 2.01-06 to 2.01-10, spending
      hours trying to get jpeglib and gdlib to compile, and finally
      hammering on webalizer until it almost fell apart I finally found out
      what was causing this behaviour. (well, okay, it wasn't all that
      difficult, but it was frustrating because I, like most people who had
      this problem, spend hours and hours messing with the .conf file and
      referral section of my combined log when that ISN'T the problem).

      So, what did I find?

      webalizer only logs searches that hit 'pages'. Pages being defined as
      htm* and .cgi by default. Now, you can go in and define all your file
      types as 'pages' but then that makes your 'pages' count a bit less
      accurate. I decided to just remove that requirement.

      in webalizer.c, around line 1163
      change:
      /* Pages (pageview) calculation */
      if (ispage(log_rec.url))
      {
      t_page++;
      tm_page[rec_day-1]++;
      th_page[rec_hour]++;

      /* do search string stuff if needed */
      if (ntop_search) srch_string(log_rec.srchstr);
      }
      to
      /* Pages (pageview) calculation */
      if (ispage(log_rec.url))
      {
      t_page++;
      tm_page[rec_day-1]++;
      th_page[rec_hour]++;
      }
      /* do search string stuff if needed */
      if (ntop_search) srch_string(log_rec.srchstr);

      And that takes care of the whole problem. ^_^


      --- In webalizer@y..., David Koski <david@K...> wrote:
      > Hello,
      >
      > My webalizer search strings list almost never has anything but it
      did recently find the following:
      >
      > 205.158.231.99 - - [12/Jul/2002:11:20:00 -
      0700] "GET /island/david/index.html HTTP/1.1" 200
      864 "http://groups.google.com/groups?
      q=Orinoco+Linux+USB&hl=en&lr=&ie=UTF-8&oe=UTF-
      8&selm=8sk3iuolqp975gaki4195huqa327po1lte%
      404ax.com&rnum=8" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0;
      Q312461)"
      >
      > However, there are many "hits" that never make it to the table and
      I would like to know how to fix it. Revelant settings are:
      >
      > SearchEngine yahoo.com p=
      > SearchEngine altavista.com q=
      > SearchEngine google. q=
      > SearchEngine eureka.com q=
      > SearchEngine lycos.com query=
      > SearchEngine hotbot.com MT=
      > SearchEngine msn.com MT=
      > SearchEngine infoseek.com qt=
      > SearchEngine webcrawler searchText=
      > SearchEngine excite search=
      > SearchEngine netscape.com search=
      > SearchEngine mamma.com query=
      > SearchEngine alltheweb.com query=
      > SearchEngine northernlight.com qr=
      >
      > AllSites yes
      > AllURLs yes
      > AllReferrers yes
      > AllAgents yes
      > AllSearchStr yes
      > AllUsers yes
      >
      > DumpSearchStr
      yes

      >
      > Some example failures (separated by blank lines for clarity) are:
      >
      > 129.69.66.147 - - [16/Jul/2002:10:00:48 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.0" 200
      1884 "http://www.google.de/search?hl=en&ie=ISO-8859-
      1&newwindow=1&q=pci%3Dbiosirq&btnG=Google+Search" "Mozilla/4.78 [en]
      (X11; U; Linux 2.4.10-4GB i686)"
      >
      > 62.47.201.60 - - [16/Jul/2002:10:52:56 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 200
      1884 "http://www.google.at/search?q=linux+redhat+orinoco&ie=ISO-8859-
      1&hl=de&meta=" "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)"
      >
      > 131.193.48.116 - - [16/Jul/2002:14:07:51 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.0" 200
      1884 "http://www.google.com/search?hl=en&ie=ISO-8859-
      1&q=orinoco+wireless+redhat+linux" "Mozilla/4.79 [en] (X11; U; Linux
      2.4.18-3 i686)"
      >
      > 198.129.219.189 - - [17/Jul/2002:00:49:24 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 200
      1884 "http://www.google.com/search?hl=en&lr=&ie=ISO-8859-
      1&q=pci+biosirq+pcmcia" "Mozilla/5.0 (compatible; Konqueror/2.2;
      Linux)"
      >
      > 212.97.184.25 - - [17/Jul/2002:05:22:38 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 200
      1884 "http://www.google.com/search?q=%2Borinoco+%2Bpci+%2Birq&ie=UTF-
      8&oe=UTF-8&hl=es&lr=" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT
      5.1)"
      >
      > 193.243.66.2 - - [17/Jul/2002:09:12:47 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.0" 200
      1884 "http://www.google.com/search?q=pci%3Dbiosirq&hl=en&lr=&ie=ISO-
      8859-1" "Mozilla/4.0 (compatible; MSIE 5.0; Windows NT; DigExt)"
      >
      > 212.234.248.186 - - [17/Jul/2002:09:23:39 -
      0700] "GET /island/david/lverror.txt HTTP/1.1" 200
      3626 "http://www.google.fr/search?q=+Sorry._Although_I%
      27m_listed_as_a_best-preference_MX_or_A_for_that_host%2C%2Fit_isn%
      27t_in_my_control%2Flocals_file%2C_so_I_don%
      27t_treat_it_as_local&ie=ISO-8859-
      1&hl=fr&btnG=Recherche+Google&meta=" "Mozilla/5.0 (compatible;
      Konqueror/3.0.0; OpenBSD)"
      >
      > 129.79.16.185 - - [17/Jul/2002:09:32:35 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.0" 200
      1884 "http://www.google.com/search?hl=en&lr=&ie=ISO-8859-
      1&q=orinoco+wireless+redhat" "Mozilla/4.77 [en] (Windows NT 5.0; U)"
      >
      > 216.23.59.184 - - [17/Jul/2002:12:07:09 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 200
      1884 "http://www.google.com/search?hl=en&ie=ISO-8859-
      1&q=hermes.conf" "Mozilla/5.0 (compatible; Konqueror/2.2.2; Linux)"
      >
      > 198.129.219.189 - - [17/Jul/2002:15:04:09 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 304 -
      "http://www.google.com/search?hl=en&ie=ISO-8859-
      1&q=pci+biosirq+pcmcia&btnG=Google+Search" "Mozilla/5.0 (compatible;
      Konqueror/2.2; Linux)"
      >
      > 66.127.222.170 - - [17/Jul/2002:16:51:35 -
      0700] "GET /island/david/wireless-rh-install.txt HTTP/1.1" 200
      1884 "http://www.google.com/search?sourceid=navclient&q=pci%
      3Dbiosirq" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; .NET
      CLR 1.0.3705)"
      >
      > Stats:
      >
      > Mandrake 8.1
      > Webalizer 2.01.06-4mdk
      >
      > TIA,
      > David Koski
      > david@K...
    Your message has been successfully submitted and would be delivered to recipients shortly.