Skip to content

What is the expectant life span of a GSA SER list

2»

Comments

  • If you have a huge source of proxies then scraping your own targets is better.

    Getting paid list saves you from the cost of scraping software (one time cost), proxies (monthly recurring cost) and VPS (optional) cost plus add the time (from couple hours to a 1-2 days) to scrape and convert to verified lists by feeding to GSA SER.

    Only you can decide which is the best option for you.
  • Can someone answer me here some things. I am newbie in scraping my own list, so when you scrape let's say using scrapebox, you import all footprints from all platforms articles,blog comments,wikis etc. And add some random list of keywords let's say 5k keywords and merge them. When scraping finish what do you do then? You remove all duplicate domains/urls right? And what is left you import in GSA, and then gsa sort all those links to specific platform right?
  • Thank you @sabahat and @spammasta

    Those verified lists are so abused. Even if they are sold to just "50" people, each person prob has 100+ campaigns they run the list through daily.

    Instead of buying a 1 off list ( which is what creates the abuse ), you need a system to continuous feed yourself URLs.

    One of those "systems" is scraping for yourself.
  • 2Take22Take2 UK
    edited March 2015
    What people don't seem to understand is that scraping for yourself or even feeding SER with 'identified' urls is only half the battle. In fact, it's not even half the battle, as anyone who knows anything about building lists will tell you that the 'hard' part isn't scraping the urls, but PROCESSING them into verified targets.

    To build anything like a decent list you're going to need multiple copies of SER running 24/7 just to process everything, otherwise your scrapes will be probably be dead before you even get around to using them. That is, assuming you want to build a decent volume of links on a nice amount of unique domains.

    People seem to think that there's some infinite unlimited pool of contextual sites that SER can post to out of the box, and that if you scrape yourself then you'll somehow be ahead of the pack?

    With all the list sellers scraping on an industrial scale now, you'll be lucky to scrape 10 sites out of every 100k that haven't already been spammed to death due to being on someones list somewhere, and once you build your list to a certain size, it gets progressively harder to increase the amount of unique domains.

    I guess it depends on how you value your time, but generally speaking IMHO you're much better off buying a ready verified list or subscribing to a verified list service, hammering the targets, and moving on.

    That said, it is worth scraping your own targets if you're modding the engine files and using custom footprints, or trying to scrape specific or niche relevant things like blogs etc. but scraping for the sake of scraping is largely a waste of time when you can just pay someone else to do it for you.

    Just my 2p, and I'm sure others will disagree though.


    Edit: To answer the OP's question there's no real set time limit on how long they last, although it is true that they will degrade over time (as will a list that you scrape yourself).

  • edited March 2015
    Thats a good view @2Take2.

    Here is my perspective though....

    Why buy a verified list that 50 ( or so ) other people have bought, and that you might be number 49 of, who will share it among friends or create footprints on it that someone who scraps will find later at no costs? More than likely it's not a fresh list ( its a list of sites other people have dropped several links on already, if not the owners too ) since it was scraped to begin with to find it.

    More than likely if you sold a truly fresh list to someone, I can find it maybe in a few days later for free by scraping. Im sure one of those buyers is in some big niches I can hunt down with footprints and keywords like bitcoin, payday loans, weight loss, etc... once you do this for a while, you start noticing certain URLs and domains and can figure out certain people must buy lists or scrape a ton and then it's just modifying your footprints to find just them and traveling over to Ahrefs.com to get more links on that domain/URL and the ones like it on the same spammed page.

    Will this tactic get you everything... no, but its 1 "system" you can use.

    Also, let's not mention that even just 1 person with the purchased verified list who has 100+ campaigns hitting it will cause it to be worthless by themselves in a few days. 50+ people makes it worthless the day it hits the market. 9 times out of 10, the creators of the list abused it already before they sold it too.

    This is why I think verified lists are a waste of money. Scraping your own lists is not a waste of time though if you value your time. If you are successful with spamming why not hire a VA for $5 an hour to babysit your VPS and software? Your time is not wasted there now. This is just 1 way to "not waste" your time.

    I disagree that finding more and more contextual every day is getting harder. I also disagree that scraping gets you only spammed to death URLs or dead URLs by the time you process them.

    Numbers differ from around the web, but:

    Today Alone - 156,459 domains were registered new ( and that's just for 6 tlds ONLY, not including all the country code ones ). That's just 156k+ domains registered new today for only 6 tlds!

    When I see how many were transferred for the top 6 tlds, that reaches 246k+ today alone for just transferred domains. Surely a good percentage of them are getting sold to new owners or moved to new servers for new projects ( like hosting wordpress maybe ).

    If I work off just the new registrations, thats 4.6M new domains a month.

    Also, Wordpress powers about one in every 6 websites online.. with over 100,000 more popping up every day ( from forbes.com ) If I take 4.6M and divide by 6, I get 766k possible wordpress installs on new domains every month from just the top 6 tlds.

    These numbers above.. they are only for top 6 tlds and only for wordpress and working off rough estimates.  These numbers alone are staggering. Imagine all the other country code TLDs and all the other platforms like drupal, directory sites, etc.

    So I have to disagree somewhat. Also, many of the scraped targets I hit Im one of the first links on the page even.. so I don't think they are spammed to death.

    I've been using SER since it first came out and I will AGREE that the way you guys scrape today is limiting. However, there comes a point in time when you think of new ideas and fresh ways of doing things that creates new avenues to finding and scraping targets that aren't spammed to death and aren't drying up every day.

    Go out and innovate. Doing what has been done the last 2-3 years will not get you very far.

    The problem with buying verified lists is you end up basically being a slave to the list seller. You get a list of URLs based on settings in their SER they dreamed up and when they are ready to sell it. Why should people buy a list of URLs not to their SER settings, plug it into their SER and get a crap ton of indexer links and spammed URLs and then have to wait 2-4 weeks for another set?

    Also, processing is not an issue. I personally scrape about 172k URLs an hour on 1 low end VPS and then feed the results to my copy of SER with sits on my desktop home computer with a crappy 16Mbps DSL line and 10 proxies. I dont process in real time with SER, I collect those URLs and use them the next day in my personal setup and Im able to have SER finish those URLs in a few hours.

    I mean, im scraping over 4M urls a day and once Im done scraping and use it in SER I am getting finished in SER within hours ( maybe 4-5 tops ). That means I have processing power left over actually for more since I am not even running SER 24 hours a day. Also, look at my setup.. if I stuck SER on a high end VPS with more proxies I would still have even more processing power left over. Scraping 4M urls might be low to some, but then again I am not even using the best equipment and I am using pretty restrictive settings too on my software.

    Like I've said before:
    Go out and innovate. Doing what has been done the last 2-3 years will not get you very far.

    I will agree that a list you scrape yourself will degrade over time. However, I'd rather be one of the first links on that degraded domain then number #4378 which is not even prob getting counted as much as a link by the time you drop it and I would rather not have to "depend" on someone else to sell me their list that is created when they wanted to create it and sell it when they felt like it and based off what they deemed correct settings in their scrapebox, ser, or gscraper install ( or custom for that matter ).

    I mean, if I only wanted PR2+ contextual domains.. buying a verified list isn't going to help me much. This problem is compounded even more if I want just ANY URL, but it was less than 100 outbounds or had to contain the word "diet" in it.

    Scraping just works out better. Sure I can buy a verified list and set those settings up ( PR2+ only, or less than 100 outbound, or only sites that contain the word "diet" ), but all the URLs that don't match those settings in a verified list.. im paying to throw away.
  • edited March 2015
    Also, let's not mention that even just 1 person with the purchased verified list who has 100+ campaigns hitting it will cause it to be worthless by themselves in a few days.
    Person who buy identified list will do same thing + will get way lower vpm posting to identified list.

    Scraping your own lists is not a waste of time though if you value your time.
    If you have custom engines, perfect footprints etc... scraping is ok. But mostly people who harvest will find same targets as lists sellers. If you can find some targets i others can find it also.

    The problem with buying verified lists is you end up basically being a slave to the list seller.
    Same with identified list + you need tons time to process list + you will get lower LPM. Here are stats of my identified list from one server, how long it would take normal ser user to process this list (3 milion wordpress alone)? One time processing is not enough to find most of possible targets. Normal user with 1 SER license wont be able to process such list in resonable time.
    image

    Also, processing is not an issue. I personally scrape about 172k URLs an hour
    I mean, im scraping over 4M urls a day and once Im done scraping and use it in SER I am getting finished in SER within hours.
    Thats how much i scrape:

    image

    And i feel its still not enough to get tons of targets. If you harvest 4m urls per day, and some people here harvest 4m urls per X-XX minutes how many of your targets can be unique and not found by someone else? Not many.

    Time to harvest tons of links with uber harvester:
    1-2 days
    Time to sort and identify:
    ~ 1-2 days
    Time needed to verify identified list (posting X times to identified list):
    Depending on number of SER instances but it would take high X or 1X days.

    Creating verified list from identified is difficult process - you need tons of private proxies, tons of cpu power, and SER licenses and time. Harvesting + sort and identify is easiest part of the game.

    @OP
    Its hard to say. Depends how many custom ways to find targets seller use. Its not possible to say that list will be working for X or Y days.
  • 2Take22Take2 UK
    edited March 2015
    @IdentifiedURLs  - With respect, I think we're going to have to agree to disagree on this one.

    Anyway, I'll let you get on with your 'innovating' now. lol

    @satyr85 - Agree totally - glad to see someone 'gets' it.
  • @satyr85

    What captcha solving are you using?
  • leafcrazy 
    CB + captchatronix, now im testing eve.cm and it looks very nice.
    I nearly dont use paid services like deathbycaptcha etc.
  • edited March 2015
    @satyr85 - I love how in every thread I am in, someone is quick to mention "buying" identified list even though I never mentioned it in any thread I am in except my own BST. It's especially true of those that are not even my customer who have never paid for my service. All I want to say is:

    image

    I dont mention my service here either, but you and others are quick to always bring it up in all threads I am in. I guess I am a threat to many of you for so much hate. I must be doing something right for you all to quickly bring it up and try to shoot it down even though I never mention it? You guys are promoting for me for free, so thank you.

    I simply promoted the user scraping his own list. The people who do things like they always have done seem to have a problem with that though...

    @2Take2, I agree to disagree. I wouldn't expect a verified list seller to understand what I was saying, especially a list seller that does what other verified list sellers have been doing for a few years now... sell verified lists as 1 off products. Not much innovation really... lol


  • @satyr85

    I'm also using Eve, agreed. I like it.

    Was curious because I dont seem to be getting that many verifications with some of those article engines :D
  • I dont mention my service here either, but you and others are quick to always bring it up.
    I dont mention your service anywhere. 

    I guess I am a threat to many of you for so much hate.
    I dont hate anyone on this forum. I love everyone because everyone can bring something to this community. I know your opinion about harvesting, identified lists, verified list and in my opinion is different than yours. Thats nothing personal.

    Stats from my post are identified, not verified;)



  • 2Take22Take2 UK
    edited March 2015
    IdentifiedURLs

    The only person mentioning your service in this thread was you, and although I fail to see what's so innovative about it, I do, however, wish you the best of luck with sales. The same goes to all of the other vendors on the forum.

    All I said was that processing a scrape or list of identified urls that will amount to a decent size verified list takes up [lots of] resources. That's not slur on your's or anyone else's business, just a simple fact.

    Also, you seem to have taken my first post out of context. I wasn't saying that people shouldn't scrape their own lists. I'm all for that. I was just trying to point out some misconceptions that people might have about whether it's actually worth it. Next time I won't bother trying to be helpful.

    Each to their own though, and as I said, there ARE obviously situations where it might be a worth while thing to do.

    Anyway, this is the last thing I've got to say on the matter, and this will be the last comment I make on this thread.
  • Trevor_BanduraTrevor_Bandura 267,647 NEW GSA SER Verified List
    What was the original topic here again? LOL guys.
  • davbeldavbel UK
    edited March 2015
    To answer the op question, you need to buy my new WSO -

    was
    $97
    now only

    $7


    :))
  • If you guys are going to have a dick measuring contest can you at least keep it in one thread? This has spanned days over multiple threads.
  • Hi guys, what about those lists that update new ones every week or month or day?
    Are those better?
    1linklist was good and they i think update their lists daily, but emailed and skyped them but no one replied so wanna try others , thanks
Sign In or Register to comment.