Loading ...
Sorry, an error occurred while loading the content.

Re: [xenu-usergroup] checking links in javascript

Expand Messages
  • Tilman Hausherr
    ... I tried to use the regexp class at http://www.codeguru.com/Cpp/Cpp/string/regex/article.php/c2779/ After a lot of trying, I got it to work: Regexp reXenu3=
    Message 1 of 35 , Sep 30, 2004
    • 0 Attachment
      On Thu, 23 Sep 2004 10:23:43 -0000, frank visser wrote:

      >javascript:openJump
      >('/business/enterprise/emea/fra/pdf/TBOwp012601_sum.pdf');
      >
      >javascript:openSite('http://mobilizedsoftware.com/enterprise/');
      >javascript:openSpecial
      >('http://www.intel.com/ebusiness/notebook/intel_offwar.swf','685','514
      >','auto','no','no','yes','5','5','win');
      >
      >javascript:openWin
      >('http://www.intel.com/personal/computing/emea/fra/glossary/index.htm?
      >enh_int_spe','540','420','no','no','no','no','200','200','glossary');
      >
      >javascript:popup('http://www.altiris.com');
      >
      >javascript:popup
      >('ftp://ftp.webseminarslive.com/pub/webseminarslive/12-05-02.pdf');
      >
      >
      >so basically:
      >
      >javascript:function('URL');
      >
      >with some URLs having ?parameters and some having popup size
      >parameters. i include the first, but ignore the second.
      >
      >even then, these are the steps i have to take:
      >
      >- run xenu on the site
      >- export the list of URLs to txt
      >- import in Excel
      >- sort by status
      >- select skip type URLs
      >- copy to txt
      >- run regular expression*
      >- fine tune list of URLs
      >- run xenu on this list
      >- identify broken links
      >- locate the source pages of these broken links in the 1st report
      >
      >*i use a regular expression now to catch the URL out of this code:
      >javascript:*\('(f|h|/)*'* (using Funduc's Search and Replace)
      >
      >Xenu now "skips" these javascript links, but apparently is able to
      >locate them and give them a "skip type" status in the report.
      >
      >What is needed to let Xenu run this regex and check the URL directly?

      I tried to use the regexp class at
      http://www.codeguru.com/Cpp/Cpp/string/regex/article.php/c2779/

      After a lot of trying, I got it to work:

      Regexp reXenu3=
      "javascript:(.*)\\('(.*(html?|gif|jpe?g|png|jsp|cfm|zip|exe|aspx?|pl|pdf|xml|ra|asx|ram|swf))'(.*)";

      CString csJava =
      "javascript:popup('http://www.gartner.com/reprints/intel/110673.html');";

      BOOL brc = reXenu3.Match( csJava );
      printf ("b = %d, 1:%s\t2:%s\t3:%s\n", brc, reXenu3[1], reXenu3[2],
      reXenu3[3]);


      I'll come back on this in a few days. Send me a HTML file that has some
      JS links.

      Tilman
    • Tilman Hausherr
      Did anyone try to run Xenu with Sandboxie? Does it work? I m thinking about mentioning it on my web page so that paranoid people can use Xenu too :) Tilman
      Message 35 of 35 , Sep 18, 2010
      • 0 Attachment
        Did anyone try to run Xenu with Sandboxie? Does it work? I'm thinking
        about mentioning it on my web page so that paranoid people can use Xenu
        too :)

        Tilman
      Your message has been successfully submitted and would be delivered to recipients shortly.