when i deleted the part between <...>, it did work:
probably < and > are not recognized by xenu?
- wasn't there a / missing after the second https:/ ?
- what did [^' between the <...> section do?
thanks again for your help,
--- In email@example.com, "Sattler,Eugeny,SAMARA,B&C"
> 23.11.04 2:07, Frank Visser <f.visser3 (a)chello.nl> wrote
> FV> Would you know a way to rewrite the regex i am currently using:
> FV> xenu now parses it incorrectly, which leads to many "broken"
> FV> So I want the regex to match only URL like strings, starting
> FV> for http, ftp and /relative_links.
> Hi Frank!
> Still haven't read "Regular Expressions syntax" part of PowerGREP
> The task you mentioned is so-o-o easy!
> I suggest this:
> matches either "/" or "ftp://" or "http://" or "https://"
> BTW, that already was a part of my initial (not simplified)
> but, as you remember, we decided to get rid of this because we
> our regular expression to be as simple as possible so as to be
> regex processing library (not entirely perl compatible) of
> understands it.
> But I would go further - I suggest to try to match URL chunks
> starting with "?param_name=param_value" like here:
> So I suggest this
> Explanation: due to presence of "\?[_a-zA-Z]+=" in the regex we
> consisting from letters and/or digits and/or underscores, followed
> an equals sign to be passed to URL checker.
> "?param_name=param_value" URL chunk will be concatenated with
> href and then the whole thing will be checked.
> Best regards,
> Eugeny mailto:accmailer%20AT%20yandex.ru
> [Non-text portions of this message have been removed]