Dead Page Checker
Hi,
in GSA SER i'd really love to have a function that removes all dead pages from site lists.
Over time the list gets gigantic and from watching the log/checking with scrapebox more than 50% are dead now, which means 50% of the pages that SER tries to process are for nothing.
Of course i can do it with Scrapebox, but thats a lot of work because you have to load each engine seperately and i think an alive check is a simple function (says the non-programmer ).
Regards
Comments
or
remove all PR N/A and 0 from list
in my experience LOW PR N/A and 0 typically are on FREE hosting = limited bandwidth (usually 4-6GB/month) and many/most of such free host sites have their months quota exceeded by MID-of each months = resulting in a 404 or any 5xx server response depending on HOST admin configuration
if you INCREASE the minimum PR requirements for your T and projects to 1 or higher,
you encounter less dead sites
for the current LIVE check
may be a feature request to SB could help
or
a manual check of all, folder by folder
the FASTEST UN-scientific method to test page existence is a single PING to a PAGE
rather than a regular LIVE check which asks for server headers for a page and results in a server response
200 = OK or anything else = failed
in my experience the second option is more frequent and I usually prefer to start at PR1 for T and PR 3 for projects
a temporary PR limit-reduction I ONLY use if I need fast extra BLs
there are LOTS of high PR sites out there, just be creative to find using SB with country specific search or without search at all using creative SB methods
of course
it depends on whether you need hundred thousands or millions of links
or just a few ten thousands
free hosted sites come and go
higher PR sites = ppl have invested efforts + time + $ and are more likely to stay for years or longer
= resulting in lasting BLs vs volatile BLs
and
scrapebox of course has a LIVE check
if EVER you have a little learning time left
do a thousand or more account verifications MANUALLY - including low PR sites (I did some 20k some 3 yrs ago - just for learning / studying the article-site environment in www)
you find among such low PR sites numerous errors such as:
A page PR is of little importance for the functionality of a site
but high PR sites just have invested more efforts to get site running and faster than cheap hosted sites
quality hosting has a price
success in life as well has a price
and often to increase quality requirements and investments into quality results in MUCH greater success = MORE site visitors = more fun to work for = more rewards in life