Skip to content

Correct url structure to scrape for success/verify

medwaymedway
edited September 2012 in Need Help
I've been scraping guestbooks on the post page but then saw for instance in Basti that it seems to modify the url like:

modify url=%targethost%%targetpath%?new_message=1

and verify here

verify url=%targethost%%targetpath%


But if I'm scraping the sites to already have the new_message=1 in the url is this ok?  Also what exactly defines targetpath (I assume targethost is the raw domain name?)

Thanks.

Comments

  • OzzOzz
    edited September 2012
    You don't need to scrape for the message site but it won't hurt also.

    www.example.com/guestbook?new_message=!

    %targethost%/%targetpath%?new_message=!

  • Ok thanks,  What I'm seeing though is the target path is different for same platform as there are variations.  I do see a few variations in the engines but can it really account for all types it might find?
Sign In or Register to comment.