Loading ...
Sorry, an error occurred while loading the content.

Re: [webalizer] Problem with recognition of php links from CMS

Expand Messages
  • Bradford L. Barrett
    See: http://groups.yahoo.com/group/webalizer/message/3033 -- ... -- Bradford L. Barrett brad@mrunix.net A free electron in a sea of
    Message 1 of 4 , Feb 15, 2005
    • 0 Attachment
      See: http://groups.yahoo.com/group/webalizer/message/3033

      --

      On Tue, 15 Feb 2005, thommie_es wrote:

      >
      >
      > Hi,
      > I have a problem with php links: I want webalizer to analyze links
      > which are generated by a php-based CMS. The real URLs are like
      >
      > "http://www.solargeneration.de/index.php/pages/DisplayPage?id=256"
      >
      > In the access.log (apache 1.3x) they appear as
      >
      > 85.74.164.38 - - [28/Jan/2005:19:43:56 +0100] "GET
      > /index.php/pages/DisplayPage?id=185 HTTP/1.1" 200 7811
      >
      > I want them grouped so that I can see an article's request number
      > (articles are identified by the "id=xxx" which corresponds to its ID
      > in the SQL database) as a unique value. I played with the "PageType",
      > "AllURL" and the "GroupURL" directives. "PageType php" is also set, so
      > that php pages are recognized at all. No matter what I do, the
      > DisplayPage?id= entries in the logs are always added to the entries
      > from the default "/" top-level entries.
      >
      > I assume that the problem comes from the "unusual" composition of the
      > URL-string ("index.php/.../DisplayPage?=" instead of
      > "index.php?something") But how can I make Webalizer understand this
      > syntax? Any ideas?
      >
      > PS. I'm using Webalizer V2.01-10 (Linux 2.6.8-24.11-default) German
      > and apache 1.3.28.
      >
      > Bye & Thanks, Thomas
      >
      --
      Bradford L. Barrett brad@...
      A free electron in a sea of neutrons DoD#1750 KD4NAW

      The only thing Micro$oft has done for society, is make people
      believe that computers are inherently unreliable.
    • thommie_es
      ... Huh, I m not a C++ hacker, only a little php idiot, so need some advice for the correct syntax ;-): You mean line 1734ff in webalizer.c: int
      Message 2 of 4 , Feb 15, 2005
      • 0 Attachment
        --- In webalizer@yahoogroups.com, "Bradford L. Barrett" <brad@m...> wrote:
        >
        > See: http://groups.yahoo.com/group/webalizer/message/3033


        Huh, I'm not a C++ hacker, only a little php idiot, so need some
        advice for the correct syntax ;-): You mean line 1734ff in webalizer.c:

        int isurlchar(unsigned char ch)
        {
        if (isalnum((int)ch)) return 1; /* allow letters,
        numbers... */
        if (ch > 127) return 1; /* allow extended chars...
        */
        return (strchr(":/\\.,' *-+_@~()[]",ch)!=NULL); /* and a few special
        ones */
        }

        so, shall I just add something like this

        if (ch == ('?' && '&' && '=') ) return 1; /* allow more extended
        chars for php/CMS URLs - caution: may be dangerous */

        ??

        Bye, Thomas
      • jd_314159
        You can analyze query strings with this build: http://www.stonesteps.ca/projects/webalizer/faq.asp?qid=q20050201-01&topic=webalizer You can t group them
        Message 3 of 4 , Feb 15, 2005
        • 0 Attachment
          You can analyze query strings with this build:

          http://www.stonesteps.ca/projects/webalizer/faq.asp?qid=q20050201-01&topic=webalizer

          You can't group them though.

          J.D.

          --- In webalizer@yahoogroups.com, "thommie_es" <t.rother@g...> wrote:
          >
          > Hi,
          > I have a problem with php links: I want webalizer to analyze links
          > which are generated by a php-based CMS. The real URLs are like
          >
          > "http://www.solargeneration.de/index.php/pages/DisplayPage?id=256"
          >
          > In the access.log (apache 1.3x) they appear as
          >
          > 85.74.164.38 - - [28/Jan/2005:19:43:56 +0100] "GET
          > /index.php/pages/DisplayPage?id=185 HTTP/1.1" 200 7811
          >
          > I want them grouped so that I can see an article's request number
          > (articles are identified by the "id=xxx" which corresponds to its ID
          > in the SQL database) as a unique value. I played with the "PageType",
          > "AllURL" and the "GroupURL" directives. "PageType php" is also set, so
          > that php pages are recognized at all. No matter what I do, the
          > DisplayPage?id= entries in the logs are always added to the entries
          > from the default "/" top-level entries.
          >
          > I assume that the problem comes from the "unusual" composition of the
          > URL-string ("index.php/.../DisplayPage?=" instead of
          > "index.php?something") But how can I make Webalizer understand this
          > syntax? Any ideas?
          >
          > PS. I'm using Webalizer V2.01-10 (Linux 2.6.8-24.11-default) German
          > and apache 1.3.28.
          >
          > Bye & Thanks, Thomas
        Your message has been successfully submitted and would be delivered to recipients shortly.