nice code. BUt what about if you write some comments on your code.
Now its like full of a-z alphabets with for loops. :)
Or else if you can change the variable name with some specific names.
That will be good, and code will be easily understandable.
~ ankur ~
On Sat, Jan 24, 2009 at 11:35 AM, thisistrinath
> Hello friends,
> I made this URL extractor a while back and posted a message here
> but nobody replied. So I am posting again. It might be useful to you
> people. I have made a URL extractor(to get all the links in a web
> page) and put it at http://yerra.trinadh.googlepages.com/Linkfinder1.html
> Please test that, just by changing File_get_contents parameters
> at the top. Its features are: it looks for links in A & FRAME tags,
> rejects links with no or one character decription, rejects URL's with
> length greater than 200 chars, rejects URL's which are links to files
> like PDF, MPEG etc(26 such extensions) and finally I have also made it
> to reject image links because in my view most of image links are only
> advertisements and are thus useless.
> Use it and give me your comments and feedback, my next step is
> to convert relative URL's into absolute URL's and then to use streams
> to start looking for links as soon as I am getting the webpage instead
> of waiting for the whole download to finish, this will significantly
> reduce the time.
> PLease test it!
> Trinadh Yerra
Fred Allen - "Television is a medium because anything well done is rare."
[Non-text portions of this message have been removed]