how to filter 20M link list for GSA in shorter time
Ok so i just got a 20M links list off a friend , went over to GSA and said Import links from text. A whole day gsa was processing to identify the list and made only 800k identified and 300k unidentified meaning it processed only 1M out of 20M during that day. Is there any way to filter links to what works for gsa within a short time?
Comments
You could run the list through "Sort and identify" and that will put the links SER thinks it can post to in your identified folder in sitelist format.
Then set your projects to post from identified folder, they will continually try to post to all of the links so you will get as many as possible from them at the end.
We have tested this quite extensively and you get more links using sort and identify, as much as 10% more with the same list. You could get the same results if you imported the list more than 5 times, but it still takes a very long time to process all of those dup URLs.
So, it's a choice between more links or more speed IMO.
The main reason i use it is so that i don't have to import URLs across numerous servers everyday.
So if i am too busy to take care of it, i use identified lists.
If i had a way to auto import when projects where out of targets, i probably wouldn't bother sort and identify.