Loading ...
Sorry, an error occurred while loading the content.
 

Re: Can someone please confirm that this is a xenu bug?

Expand Messages
  • frank visser
    hi tilman, this is another example of the ampersand/asp issue, raised by Raj: can you (or anybody else) explain why on this page:
    Message 1 of 7 , Jan 1, 2005
      hi tilman,

      this is another example of the ampersand/asp issue, raised by Raj:

      can you (or anybody else) explain why on this page:
      http://www.intel.com/cd/business/enterprise/apac/eng/96322.htm

      the link "IBM" link in the body area goes to an IBM Thinkpad
      Notebook page:

      http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
      langId=344&catalogId=-
      344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
      ooks&re=ibmcomhk&me=ad

      but in Xenu it gives an IBM "Page not Found" page:

      http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
      langId=344

      the link itself looks like this in Xenu:

      http://appzone.intel.com/util/serve-url.asp?
      iid=APACWTB+96322_Content_84&url=http://www-
      605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
      langId=344&catalogId=-
      344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
      ooks&re=ibmcomhk&me=ad&

      so again, after the second "&" Xenu breaks off the URL.

      this URL is quite complex:

      - it is embedded in a javascript function popExt()
      - it runs through an asp file with an id tag and an URL as parameter
      - this parameter URL has again some parameters (langId, categoryId,
      storeId, dualCurrId, ca, met, re, me)

      it seems that Xenu makes it to the first of these parameters
      (langId) and then breaks off the URL.

      is this a dilemma so that either way you solve this, other problems
      will arise, as you said?

      can you elaborate a bit more on this? What problems?

      i'd like to understand what is involved here.

      thanks,

      frank

      --- In xenu-usergroup@yahoogroups.com, Tilman Hausherr <tilman@s...>
      wrote:
      > On Wed, 22 Dec 2004 07:50:46 -0000, frank visser wrote:
      >
      > >
      > >
      > >hi tilman,
      > >
      > >any comment?
      >
      > Yes, I believe I already did to that person in mail a few weeks
      ago. If
      > I would change the way xenu treats "&", it would bring a ton of new
      > protests because some people want it the way it is currently
      done :-(
      >
      > Tilman
      >
      > >
      > >i have noticed similar issues before: URLs "breaking off" in Xenu
      > >when they contain multiple "&" parameters after "?"...
      > >
      > >will try to find a good example.
      > >
      > >frank
      > >
      > >--- In xenu-usergroup@yahoogroups.com, "Rajesh Duggal"
      > ><rajeshduggal@h...> wrote:
      > >>
      > >> 1. Create a test.html file containing.
      > >> <a href="test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2">Test</a>
      > >>
      > >> 2. Create a test.asp file (Or an equivelent in other language)
      > >> <%
      > >> Response.Buffer = true
      > >> dim strURL
      > >> strURL = Request.QueryString("URL")
      > >> 'For now I've commented out the http redirect instruction
      > >> 'and instead have it write out the url it will redirect to.
      > >> 'Response.Redirect(strURL)
      > >> Response.Write(strURL)
      > >> %>
      > >>
      > >> 3. View the test.html file in your browser... and click the
      test
      > >link (Using
      > >> IE6) I end up at..
      > >> test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2
      > >> and it correctly contains the content..
      > >> /test.html?a=1&b=2
      > >>
      > >> 4. Point xenu at the test.html file. Notice it goes to
      > >> test.asp?URL=%2Ftest.html%3Fa=1&b=2
      > >> and the content contains..
      > >> /test.html?a=1
      > >>
      > >> ---
      > >> This seems like a bug to me..
      > >>
      > >> How can we prevent xenu from urldecoding the "=" and "&"
      > >> symbols in the urls before it makes the http request.
      > >> (Note: it seems to keep the "?" encoded correctly :)
      > >>
      > >> -------------------------------------
      > >>
      > >> Here's another example demonstrating the same xenu behavour...
      > >>
      > >> Go to www.google.com and in the search input box enter...
      > >> a=1&b=2 then click search
      > >>
      > >> Notice the url is now..
      > >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
      > >>
      > >> Now if you open a new browser window and type that url
      > >> in you'll see the search input box will still contain..
      > >> a=1&b=2
      > >>
      > >> Now start xenu and ask it to check the same url..
      > >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
      > >>
      > >> Notice it breaks!
      > >> and if you double click the url in xenu.. you will notice
      > >> the url is different
      > >> http://www.google.com/search?hl=en&q=a=1&b=2&meta=
      > >> and the search input box only contains...
      > >> a=1
      > >>
      > >>
      > >> Cheers,
      > >> Raj.
      > >
      > >
      > >
      > >
      > >
      > >
      > >
      > >Yahoo! Groups Links
      > >
      > >
      > >
      > >
      > >
      > >
    • Tilman Hausherr
      ... Yes because I have had this for years... there were always some sort of complaints about wether I should convert the & , # etc, and the current
      Message 2 of 7 , Jan 1, 2005
        On Sat, 01 Jan 2005 12:35:56 -0000, frank visser wrote:

        >hi tilman,
        >
        >this is another example of the ampersand/asp issue, raised by Raj:
        >
        >can you (or anybody else) explain why on this page:
        >http://www.intel.com/cd/business/enterprise/apac/eng/96322.htm
        >
        >the link "IBM" link in the body area goes to an IBM Thinkpad
        >Notebook page:
        >
        >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
        >langId=344&catalogId=-
        >344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
        >ooks&re=ibmcomhk&me=ad
        >
        >but in Xenu it gives an IBM "Page not Found" page:
        >
        >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
        >langId=344
        >
        >the link itself looks like this in Xenu:
        >
        >http://appzone.intel.com/util/serve-url.asp?
        >iid=APACWTB+96322_Content_84&url=http://www-
        >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
        >langId=344&catalogId=-
        >344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
        >ooks&re=ibmcomhk&me=ad&
        >
        >so again, after the second "&" Xenu breaks off the URL.
        >
        >this URL is quite complex:
        >
        >- it is embedded in a javascript function popExt()
        >- it runs through an asp file with an id tag and an URL as parameter
        >- this parameter URL has again some parameters (langId, categoryId,
        >storeId, dualCurrId, ca, met, re, me)
        >
        >it seems that Xenu makes it to the first of these parameters
        >(langId) and then breaks off the URL.
        >
        >is this a dilemma so that either way you solve this, other problems
        >will arise, as you said?

        Yes because I have had this for years... there were always some sort of
        complaints about wether I should convert the "&", "#" etc, and the
        current situation seemed to keep things quiet. My solution was to
        convert as much as possible.

        Here's whats in the link:

        javascript:popExt('http://appzone.intel.com/util/serve-url.asp?iid=APACWTB+96322_Content_84&url=http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%2526catalogId=-344%2526categoryId=2056943%2526storeId=344%2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%2526me=ad&')"

        Here's what IE has:

        javascript:popExt('http://appzone.intel.com/util/serve-url.asp?iid=APACWTB+96322_Content_84&url=http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%2526catalogId=-344%2526categoryId=2056943%2526storeId=344%2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%2526me=ad&')

        here's what Xenu makes of it:


        http://appzone.intel.com/util/serve-url.asp?iid=APACWTB+96322_Content_84&url=http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344&catalogId=-344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=notebooks&re=ibmcomhk&me=ad&

        here's what Xenu would make if it without my extra conversions, and
        using only the conversions by Microsoft:


        http://appzone.intel.com/util/serve-url.asp?iid=APACWTB+96322_Content_84&url=http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%2526catalogId=-344%2526categoryId=2056943%2526storeId=344%2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%2526me=ad&

        Now I tried the URL of Raj:

        http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
        and entered it in Xenu (with my conversions):
        http://www.google.com/search?hl=en&q=a=1&b=2&meta=
        without:
        http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=


        I'll send you the "special" version by separate mail.

        Tilman

        >can you elaborate a bit more on this? What problems?
        >
        >i'd like to understand what is involved here.
        >
        >thanks,
        >
        >frank
        >
        >--- In xenu-usergroup@yahoogroups.com, Tilman Hausherr <tilman@s...>
        >wrote:
        >> On Wed, 22 Dec 2004 07:50:46 -0000, frank visser wrote:
        >>
        >> >
        >> >
        >> >hi tilman,
        >> >
        >> >any comment?
        >>
        >> Yes, I believe I already did to that person in mail a few weeks
        >ago. If
        >> I would change the way xenu treats "&", it would bring a ton of new
        >> protests because some people want it the way it is currently
        >done :-(
        >>
        >> Tilman
        >>
        >> >
        >> >i have noticed similar issues before: URLs "breaking off" in Xenu
        >> >when they contain multiple "&" parameters after "?"...
        >> >
        >> >will try to find a good example.
        >> >
        >> >frank
        >> >
        >> >--- In xenu-usergroup@yahoogroups.com, "Rajesh Duggal"
        >> ><rajeshduggal@h...> wrote:
        >> >>
        >> >> 1. Create a test.html file containing.
        >> >> <a href="test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2">Test</a>
        >> >>
        >> >> 2. Create a test.asp file (Or an equivelent in other language)
        >> >> <%
        >> >> Response.Buffer = true
        >> >> dim strURL
        >> >> strURL = Request.QueryString("URL")
        >> >> 'For now I've commented out the http redirect instruction
        >> >> 'and instead have it write out the url it will redirect to.
        >> >> 'Response.Redirect(strURL)
        >> >> Response.Write(strURL)
        >> >> %>
        >> >>
        >> >> 3. View the test.html file in your browser... and click the
        >test
        >> >link (Using
        >> >> IE6) I end up at..
        >> >> test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2
        >> >> and it correctly contains the content..
        >> >> /test.html?a=1&b=2
        >> >>
        >> >> 4. Point xenu at the test.html file. Notice it goes to
        >> >> test.asp?URL=%2Ftest.html%3Fa=1&b=2
        >> >> and the content contains..
        >> >> /test.html?a=1
        >> >>
        >> >> ---
        >> >> This seems like a bug to me..
        >> >>
        >> >> How can we prevent xenu from urldecoding the "=" and "&"
        >> >> symbols in the urls before it makes the http request.
        >> >> (Note: it seems to keep the "?" encoded correctly :)
        >> >>
        >> >> -------------------------------------
        >> >>
        >> >> Here's another example demonstrating the same xenu behavour...
        >> >>
        >> >> Go to www.google.com and in the search input box enter...
        >> >> a=1&b=2 then click search
        >> >>
        >> >> Notice the url is now..
        >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
        >> >>
        >> >> Now if you open a new browser window and type that url
        >> >> in you'll see the search input box will still contain..
        >> >> a=1&b=2
        >> >>
        >> >> Now start xenu and ask it to check the same url..
        >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
        >> >>
        >> >> Notice it breaks!
        >> >> and if you double click the url in xenu.. you will notice
        >> >> the url is different
        >> >> http://www.google.com/search?hl=en&q=a=1&b=2&meta=
        >> >> and the search input box only contains...
        >> >> a=1
        >> >>
        >> >>
        >> >> Cheers,
        >> >> Raj.
        >> >
        >> >
        >> >
        >> >
        >> >
        >> >
        >> >
        >> >Yahoo! Groups Links
        >> >
        >> >
        >> >
        >> >
        >> >
        >> >
        >
        >
        >
        >
        >
        >
        >Yahoo! Groups Links
        >
        >
        >
        >
        >
        >
      • frank visser
        tilman, ok, so shouldn t the URL without your conversions bring me to the Thinkpad page? it gives a 404 pages for me as well. not sure what you mean by your
        Message 3 of 7 , Jan 1, 2005
          tilman,

          ok, so shouldn't the URL without your conversions bring me to the
          Thinkpad page? it gives a 404 pages for me as well.

          not sure what you mean by "your conversions".

          but great if this can be solved!

          please note i am using the cookie based version of xenu.

          btw. how did you manage to keep the URL clickable over more then one
          line, add backslashes?

          frank

          --- In xenu-usergroup@yahoogroups.com, Tilman Hausherr <tilman@s...>
          wrote:
          > On Sat, 01 Jan 2005 12:35:56 -0000, frank visser wrote:
          >
          > >hi tilman,
          > >
          > >this is another example of the ampersand/asp issue, raised by Raj:
          > >
          > >can you (or anybody else) explain why on this page:
          > >http://www.intel.com/cd/business/enterprise/apac/eng/96322.htm
          > >
          > >the link "IBM" link in the body area goes to an IBM Thinkpad
          > >Notebook page:
          > >
          > >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
          > >langId=344&catalogId=-
          >
          >344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=note
          b
          > >ooks&re=ibmcomhk&me=ad
          > >
          > >but in Xenu it gives an IBM "Page not Found" page:
          > >
          > >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
          > >langId=344
          > >
          > >the link itself looks like this in Xenu:
          > >
          > >http://appzone.intel.com/util/serve-url.asp?
          > >iid=APACWTB+96322_Content_84&url=http://www-
          > >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
          > >langId=344&catalogId=-
          >
          >344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=note
          b
          > >ooks&re=ibmcomhk&me=ad&
          > >
          > >so again, after the second "&" Xenu breaks off the URL.
          > >
          > >this URL is quite complex:
          > >
          > >- it is embedded in a javascript function popExt()
          > >- it runs through an asp file with an id tag and an URL as
          parameter
          > >- this parameter URL has again some parameters (langId,
          categoryId,
          > >storeId, dualCurrId, ca, met, re, me)
          > >
          > >it seems that Xenu makes it to the first of these parameters
          > >(langId) and then breaks off the URL.
          > >
          > >is this a dilemma so that either way you solve this, other
          problems
          > >will arise, as you said?
          >
          > Yes because I have had this for years... there were always some
          sort of
          > complaints about wether I should convert the "&", "#" etc, and the
          > current situation seemed to keep things quiet. My solution was to
          > convert as much as possible.
          >
          > Here's whats in the link:
          >
          > javascript:popExt('http://appzone.intel.com/util/serve-url.asp?
          iid=APACWTB+96322_Content_84&url=http://www-
          605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
          2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
          2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
          2526me=ad&')"
          >
          > Here's what IE has:
          >
          > javascript:popExt('http://appzone.intel.com/util/serve-url.asp?
          iid=APACWTB+96322_Content_84&url=http://www-
          605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
          2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
          2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
          2526me=ad&')
          >
          > here's what Xenu makes of it:
          >
          >
          > http://appzone.intel.com/util/serve-url.asp?
          iid=APACWTB+96322_Content_84&url=http://www-
          605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
          langId=344&catalogId=-
          344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
          ooks&re=ibmcomhk&me=ad&
          >
          > here's what Xenu would make if it without my extra conversions, and
          > using only the conversions by Microsoft:
          >
          >
          > http://appzone.intel.com/util/serve-url.asp?
          iid=APACWTB+96322_Content_84&url=http://www-
          605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
          2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
          2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
          2526me=ad&
          >
          > Now I tried the URL of Raj:
          >
          > http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
          > and entered it in Xenu (with my conversions):
          > http://www.google.com/search?hl=en&q=a=1&b=2&meta=
          > without:
          > http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
          >
          >
          > I'll send you the "special" version by separate mail.
          >
          > Tilman
          >
          > >can you elaborate a bit more on this? What problems?
          > >
          > >i'd like to understand what is involved here.
          > >
          > >thanks,
          > >
          > >frank
          > >
          > >--- In xenu-usergroup@yahoogroups.com, Tilman Hausherr
          <tilman@s...>
          > >wrote:
          > >> On Wed, 22 Dec 2004 07:50:46 -0000, frank visser wrote:
          > >>
          > >> >
          > >> >
          > >> >hi tilman,
          > >> >
          > >> >any comment?
          > >>
          > >> Yes, I believe I already did to that person in mail a few weeks
          > >ago. If
          > >> I would change the way xenu treats "&", it would bring a ton of
          new
          > >> protests because some people want it the way it is currently
          > >done :-(
          > >>
          > >> Tilman
          > >>
          > >> >
          > >> >i have noticed similar issues before: URLs "breaking off" in
          Xenu
          > >> >when they contain multiple "&" parameters after "?"...
          > >> >
          > >> >will try to find a good example.
          > >> >
          > >> >frank
          > >> >
          > >> >--- In xenu-usergroup@yahoogroups.com, "Rajesh Duggal"
          > >> ><rajeshduggal@h...> wrote:
          > >> >>
          > >> >> 1. Create a test.html file containing.
          > >> >> <a href="test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2">Test</a>
          > >> >>
          > >> >> 2. Create a test.asp file (Or an equivelent in other
          language)
          > >> >> <%
          > >> >> Response.Buffer = true
          > >> >> dim strURL
          > >> >> strURL = Request.QueryString("URL")
          > >> >> 'For now I've commented out the http redirect instruction
          > >> >> 'and instead have it write out the url it will redirect to.
          > >> >> 'Response.Redirect(strURL)
          > >> >> Response.Write(strURL)
          > >> >> %>
          > >> >>
          > >> >> 3. View the test.html file in your browser... and click the
          > >test
          > >> >link (Using
          > >> >> IE6) I end up at..
          > >> >> test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2
          > >> >> and it correctly contains the content..
          > >> >> /test.html?a=1&b=2
          > >> >>
          > >> >> 4. Point xenu at the test.html file. Notice it goes to
          > >> >> test.asp?URL=%2Ftest.html%3Fa=1&b=2
          > >> >> and the content contains..
          > >> >> /test.html?a=1
          > >> >>
          > >> >> ---
          > >> >> This seems like a bug to me..
          > >> >>
          > >> >> How can we prevent xenu from urldecoding the "=" and "&"
          > >> >> symbols in the urls before it makes the http request.
          > >> >> (Note: it seems to keep the "?" encoded correctly :)
          > >> >>
          > >> >> -------------------------------------
          > >> >>
          > >> >> Here's another example demonstrating the same xenu
          behavour...
          > >> >>
          > >> >> Go to www.google.com and in the search input box enter...
          > >> >> a=1&b=2 then click search
          > >> >>
          > >> >> Notice the url is now..
          > >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
          > >> >>
          > >> >> Now if you open a new browser window and type that url
          > >> >> in you'll see the search input box will still contain..
          > >> >> a=1&b=2
          > >> >>
          > >> >> Now start xenu and ask it to check the same url..
          > >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
          > >> >>
          > >> >> Notice it breaks!
          > >> >> and if you double click the url in xenu.. you will notice
          > >> >> the url is different
          > >> >> http://www.google.com/search?hl=en&q=a=1&b=2&meta=
          > >> >> and the search input box only contains...
          > >> >> a=1
          > >> >>
          > >> >>
          > >> >> Cheers,
          > >> >> Raj.
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >Yahoo! Groups Links
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >
          > >> >
          > >
          > >
          > >
          > >
          > >
          > >
          > >Yahoo! Groups Links
          > >
          > >
          > >
          > >
          > >
          > >
        • Tilman Hausherr
          ... Yes... Here s what IE points to:
          Message 4 of 7 , Jan 1, 2005
            On Sat, 01 Jan 2005 14:38:57 -0000, frank visser wrote:

            >ok, so shouldn't the URL without your conversions bring me to the
            >Thinkpad page? it gives a 404 pages for me as well.

            Yes...

            Here's what IE points to:
            http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344&catalogId=-344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=notebooks&re=ibmcomhk&me=ad
            here's what Xenu points to:
            http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%26catalogId=-344%26categoryId=2056943%26storeId=344%26dualCurrId=97%26ca=product%26met=notebooks%26re=ibmcomhk%26me=ad

            That's what I meant: the complaints never stop. Obviously, now you'd
            want me to convert the %26. But probably not in other URLs...

            I'm also not sure if this %26 conversion isn't possibly done already in
            the javascript code already?

            >not sure what you mean by "your conversions".

            Converting everything with % and &

            >but great if this can be solved!
            >
            >please note i am using the cookie based version of xenu.

            Uh, the one I sent you has no cookies.

            >btw. how did you manage to keep the URL clickable over more then one
            >line, add backslashes?

            No, my newsreader does this. Your newsreader is one that breaks long
            lines.

            Tilman

            >
            >frank
            >
            >--- In xenu-usergroup@yahoogroups.com, Tilman Hausherr <tilman@s...>
            >wrote:
            >> On Sat, 01 Jan 2005 12:35:56 -0000, frank visser wrote:
            >>
            >> >hi tilman,
            >> >
            >> >this is another example of the ampersand/asp issue, raised by Raj:
            >> >
            >> >can you (or anybody else) explain why on this page:
            >> >http://www.intel.com/cd/business/enterprise/apac/eng/96322.htm
            >> >
            >> >the link "IBM" link in the body area goes to an IBM Thinkpad
            >> >Notebook page:
            >> >
            >> >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
            >> >langId=344&catalogId=-
            >>
            >>344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=note
            >b
            >> >ooks&re=ibmcomhk&me=ad
            >> >
            >> >but in Xenu it gives an IBM "Page not Found" page:
            >> >
            >> >http://www-605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
            >> >langId=344
            >> >
            >> >the link itself looks like this in Xenu:
            >> >
            >> >http://appzone.intel.com/util/serve-url.asp?
            >> >iid=APACWTB+96322_Content_84&url=http://www-
            >> >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
            >> >langId=344&catalogId=-
            >>
            >>344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=note
            >b
            >> >ooks&re=ibmcomhk&me=ad&
            >> >
            >> >so again, after the second "&" Xenu breaks off the URL.
            >> >
            >> >this URL is quite complex:
            >> >
            >> >- it is embedded in a javascript function popExt()
            >> >- it runs through an asp file with an id tag and an URL as
            >parameter
            >> >- this parameter URL has again some parameters (langId,
            >categoryId,
            >> >storeId, dualCurrId, ca, met, re, me)
            >> >
            >> >it seems that Xenu makes it to the first of these parameters
            >> >(langId) and then breaks off the URL.
            >> >
            >> >is this a dilemma so that either way you solve this, other
            >problems
            >> >will arise, as you said?
            >>
            >> Yes because I have had this for years... there were always some
            >sort of
            >> complaints about wether I should convert the "&", "#" etc, and the
            >> current situation seemed to keep things quiet. My solution was to
            >> convert as much as possible.
            >>
            >> Here's whats in the link:
            >>
            >> javascript:popExt('http://appzone.intel.com/util/serve-url.asp?
            >iid=APACWTB+96322_Content_84&url=http://www-
            >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
            >2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
            >2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
            >2526me=ad&')"
            >>
            >> Here's what IE has:
            >>
            >> javascript:popExt('http://appzone.intel.com/util/serve-url.asp?
            >iid=APACWTB+96322_Content_84&url=http://www-
            >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
            >2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
            >2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
            >2526me=ad&')
            >>
            >> here's what Xenu makes of it:
            >>
            >>
            >> http://appzone.intel.com/util/serve-url.asp?
            >iid=APACWTB+96322_Content_84&url=http://www-
            >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?
            >langId=344&catalogId=-
            >344&categoryId=2056943&storeId=344&dualCurrId=97&ca=product&met=noteb
            >ooks&re=ibmcomhk&me=ad&
            >>
            >> here's what Xenu would make if it without my extra conversions, and
            >> using only the conversions by Microsoft:
            >>
            >>
            >> http://appzone.intel.com/util/serve-url.asp?
            >iid=APACWTB+96322_Content_84&url=http://www-
            >605.ibm.com/webapp/wcs/stores/servlet/CategoryDisplay?langId=344%
            >2526catalogId=-344%2526categoryId=2056943%2526storeId=344%
            >2526dualCurrId=97%2526ca=product%2526met=notebooks%2526re=ibmcomhk%
            >2526me=ad&
            >>
            >> Now I tried the URL of Raj:
            >>
            >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
            >> and entered it in Xenu (with my conversions):
            >> http://www.google.com/search?hl=en&q=a=1&b=2&meta=
            >> without:
            >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
            >>
            >>
            >> I'll send you the "special" version by separate mail.
            >>
            >> Tilman
            >>
            >> >can you elaborate a bit more on this? What problems?
            >> >
            >> >i'd like to understand what is involved here.
            >> >
            >> >thanks,
            >> >
            >> >frank
            >> >
            >> >--- In xenu-usergroup@yahoogroups.com, Tilman Hausherr
            ><tilman@s...>
            >> >wrote:
            >> >> On Wed, 22 Dec 2004 07:50:46 -0000, frank visser wrote:
            >> >>
            >> >> >
            >> >> >
            >> >> >hi tilman,
            >> >> >
            >> >> >any comment?
            >> >>
            >> >> Yes, I believe I already did to that person in mail a few weeks
            >> >ago. If
            >> >> I would change the way xenu treats "&", it would bring a ton of
            >new
            >> >> protests because some people want it the way it is currently
            >> >done :-(
            >> >>
            >> >> Tilman
            >> >>
            >> >> >
            >> >> >i have noticed similar issues before: URLs "breaking off" in
            >Xenu
            >> >> >when they contain multiple "&" parameters after "?"...
            >> >> >
            >> >> >will try to find a good example.
            >> >> >
            >> >> >frank
            >> >> >
            >> >> >--- In xenu-usergroup@yahoogroups.com, "Rajesh Duggal"
            >> >> ><rajeshduggal@h...> wrote:
            >> >> >>
            >> >> >> 1. Create a test.html file containing.
            >> >> >> <a href="test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2">Test</a>
            >> >> >>
            >> >> >> 2. Create a test.asp file (Or an equivelent in other
            >language)
            >> >> >> <%
            >> >> >> Response.Buffer = true
            >> >> >> dim strURL
            >> >> >> strURL = Request.QueryString("URL")
            >> >> >> 'For now I've commented out the http redirect instruction
            >> >> >> 'and instead have it write out the url it will redirect to.
            >> >> >> 'Response.Redirect(strURL)
            >> >> >> Response.Write(strURL)
            >> >> >> %>
            >> >> >>
            >> >> >> 3. View the test.html file in your browser... and click the
            >> >test
            >> >> >link (Using
            >> >> >> IE6) I end up at..
            >> >> >> test.asp?URL=%2Ftest.html%3Fa%3D1%26b%3D2
            >> >> >> and it correctly contains the content..
            >> >> >> /test.html?a=1&b=2
            >> >> >>
            >> >> >> 4. Point xenu at the test.html file. Notice it goes to
            >> >> >> test.asp?URL=%2Ftest.html%3Fa=1&b=2
            >> >> >> and the content contains..
            >> >> >> /test.html?a=1
            >> >> >>
            >> >> >> ---
            >> >> >> This seems like a bug to me..
            >> >> >>
            >> >> >> How can we prevent xenu from urldecoding the "=" and "&"
            >> >> >> symbols in the urls before it makes the http request.
            >> >> >> (Note: it seems to keep the "?" encoded correctly :)
            >> >> >>
            >> >> >> -------------------------------------
            >> >> >>
            >> >> >> Here's another example demonstrating the same xenu
            >behavour...
            >> >> >>
            >> >> >> Go to www.google.com and in the search input box enter...
            >> >> >> a=1&b=2 then click search
            >> >> >>
            >> >> >> Notice the url is now..
            >> >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
            >> >> >>
            >> >> >> Now if you open a new browser window and type that url
            >> >> >> in you'll see the search input box will still contain..
            >> >> >> a=1&b=2
            >> >> >>
            >> >> >> Now start xenu and ask it to check the same url..
            >> >> >> http://www.google.com/search?hl=en&q=a%3D1%26b%3D2&meta=
            >> >> >>
            >> >> >> Notice it breaks!
            >> >> >> and if you double click the url in xenu.. you will notice
            >> >> >> the url is different
            >> >> >> http://www.google.com/search?hl=en&q=a=1&b=2&meta=
            >> >> >> and the search input box only contains...
            >> >> >> a=1
            >> >> >>
            >> >> >>
            >> >> >> Cheers,
            >> >> >> Raj.
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >Yahoo! Groups Links
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >> >
            >> >
            >> >
            >> >
            >> >
            >> >
            >> >
            >> >Yahoo! Groups Links
            >> >
            >> >
            >> >
            >> >
            >> >
            >> >
            >
            >
            >
            >
            >
            >
            >Yahoo! Groups Links
            >
            >
            >
            >
            >
            >
          Your message has been successfully submitted and would be delivered to recipients shortly.