Loading ...
Sorry, an error occurred while loading the content.
 

Re: [xenu-usergroup] checking links in javascript

Expand Messages
  • Tilman Hausherr
    ... I tried to use the regexp class at http://www.codeguru.com/Cpp/Cpp/string/regex/article.php/c2779/ After a lot of trying, I got it to work: Regexp reXenu3=
    Message 1 of 35 , Sep 30, 2004
      On Thu, 23 Sep 2004 10:23:43 -0000, frank visser wrote:

      >javascript:openJump
      >('/business/enterprise/emea/fra/pdf/TBOwp012601_sum.pdf');
      >
      >javascript:openSite('http://mobilizedsoftware.com/enterprise/');
      >javascript:openSpecial
      >('http://www.intel.com/ebusiness/notebook/intel_offwar.swf','685','514
      >','auto','no','no','yes','5','5','win');
      >
      >javascript:openWin
      >('http://www.intel.com/personal/computing/emea/fra/glossary/index.htm?
      >enh_int_spe','540','420','no','no','no','no','200','200','glossary');
      >
      >javascript:popup('http://www.altiris.com');
      >
      >javascript:popup
      >('ftp://ftp.webseminarslive.com/pub/webseminarslive/12-05-02.pdf');
      >
      >
      >so basically:
      >
      >javascript:function('URL');
      >
      >with some URLs having ?parameters and some having popup size
      >parameters. i include the first, but ignore the second.
      >
      >even then, these are the steps i have to take:
      >
      >- run xenu on the site
      >- export the list of URLs to txt
      >- import in Excel
      >- sort by status
      >- select skip type URLs
      >- copy to txt
      >- run regular expression*
      >- fine tune list of URLs
      >- run xenu on this list
      >- identify broken links
      >- locate the source pages of these broken links in the 1st report
      >
      >*i use a regular expression now to catch the URL out of this code:
      >javascript:*\('(f|h|/)*'* (using Funduc's Search and Replace)
      >
      >Xenu now "skips" these javascript links, but apparently is able to
      >locate them and give them a "skip type" status in the report.
      >
      >What is needed to let Xenu run this regex and check the URL directly?

      I tried to use the regexp class at
      http://www.codeguru.com/Cpp/Cpp/string/regex/article.php/c2779/

      After a lot of trying, I got it to work:

      Regexp reXenu3=
      "javascript:(.*)\\('(.*(html?|gif|jpe?g|png|jsp|cfm|zip|exe|aspx?|pl|pdf|xml|ra|asx|ram|swf))'(.*)";

      CString csJava =
      "javascript:popup('http://www.gartner.com/reprints/intel/110673.html');";

      BOOL brc = reXenu3.Match( csJava );
      printf ("b = %d, 1:%s\t2:%s\t3:%s\n", brc, reXenu3[1], reXenu3[2],
      reXenu3[3]);


      I'll come back on this in a few days. Send me a HTML file that has some
      JS links.

      Tilman
    • Tilman Hausherr
      Did anyone try to run Xenu with Sandboxie? Does it work? I m thinking about mentioning it on my web page so that paranoid people can use Xenu too :) Tilman
      Message 35 of 35 , Sep 18, 2010
        Did anyone try to run Xenu with Sandboxie? Does it work? I'm thinking
        about mentioning it on my web page so that paranoid people can use Xenu
        too :)

        Tilman
      Your message has been successfully submitted and would be delivered to recipients shortly.