Skip to content

GSA can't Scrape. Google blocks proxies.

rduquerduque
edited August 2014 in GSA Search Engine Ranker
Hi,

I think GSA scraper don't work with Google. I have:

Custom time between search queries: 80 sec.
Fresh proxies: 20 private
Threads: 200. 
html timeout: 120 sec.

With this settings, google always ban my proxies. 

I checked ONLY Google US in my project settings, and only with this single search engine selected, Google eventually ban all my proxies (15-20 min after the project is running). Notice that I only have Google US selected.

This means, with this proxy, GSA is only searching on Google.com every 80 secs (which is reasonable) and this wouldn't have to be consider "spam". But Google ban my proxies.

"Proxy (ip) blocked on Google.com" and GSA can't find new targets. My project eventually stopped.

This means, in some way, Google has "blacklisted" the GSA footprints, or is detecting patterns which identify GSA is searching, and automatically ban the proxies. 

I think @Sven should read this and give his oppinion. Because for me, it is impossible to scrape with GSA on Google.

Maybe I'm wrong, but it doesn't make sense. The query is each 80 sec, only on Google.com. The proxy should not be banned.

Do you guys can scrape with GSA SER on Google without proxy ban?


Comments

  • elliotps932elliotps932
    200 threads per proxy? or 10 threads per proxy?
  • rduquerduque
    @elliotps932 where can I see it? I mean 200 threads in option --> submission tab

    Can you scrape on Google without proxy ban?
  • elliotps932elliotps932
    Try severely reducing threads
  • rduquerduque
    @elliotps932 I tried with 50 threads. After 10 minutes I had 10 proxies blocked on Google...

    Is this happening to you too? 
  • VijayarajVijayaraj India
    edited August 2014
    Nope.. it's still working fine for us.. who is your proxy provider?
  • rduquerduque
    @Vijayaraj I use buyproxies.org

    Do you use only private proxies for scraping or public? I use only private
  • If they are shared, it's probably someone else hammering the proxies.
  • svabb2000svabb2000
    I would suggest maxin the time between searches. That always helps ....a little slower indeed but you can actually scrape.
  • rduquerduque
    @svabb2000 how much time between search queries do you have?
  • svabb2000svabb2000
    @rduque i have 120 sec timeout but im using backconnect proxies and in gscraper i use 30 sec
Sign In or Register to comment.