Skip to content

first attempt at GSA SER submission - I have many questions!

Hi guys,

Today I finally put in my first project in GSA SER after spending several weeks trying to ramp up on this and several related tools (Scrapebox, GSA CB, WAC).

I have to say, it didn't work out as I expected. Hopefully the other forum members can give me some feedback.

Over the course of several hours, I have kept tweaking certain settings, so I can't even say anymore with certainty what is causing what, but here are the things I've been experiencing.

DOWNLOAD FAILED
I've been getting lot of 'download failed'. checking this forum, of course, it seems that may have to do with connection speed and quality of proxies. I have 10 private proxies (because everyone recommends them so highly), and I am adding to that public proxies scraped through GSA SER (even though forum users advise against them for being slow and undependable), and at one point I had more than a thousand public proxies used.

I am happy to be very conservative at the moment in terms of my link speed. I just want to get comfortable that I am setting up these submissions correctly, and once I see I can create links consistently, I will probably invest in more hardware. For now I would be happy to just create 100s of links in day. I am just running 10 threads, 1 GB ram, on a VPS.  So  I think 10 private proxies should be more than enough.


At some point I realized I should take a more careful look at the speed of the proxies. Indeed I found a wide range in proxy speeds, from less than 1 to over 10 (and I have no idea what this is measuring - can someone tell me?) 

So, I deleted all the public proxies slower than 10. Will this help me?

Another thing I found out was that my fully private proxies (from proxyblaze) are really slow, with speeds under 0.3. (so I guess, the opposite of "blaze"). So, on the one hand it's great, because they're private, but are they likely to be causing me a lot of trouble because they are so slow?

so what's better, private and slow, or public and fast?

NO ENGINE MATCHES
So, I am not using a pre-scraped and verified list of sites. I am having GSA scrape according to footprint and keywords. I guess this message means that a site that was returned in the search simply does not a fit a profile of a platform recognized by the program. is this principally because the list is not pre-verified?

EMAILS
I think one major problem I am having is with my emails. I have put in 50 hotmail accounts. Or actually, it is 50 aliases based on 5 accounts. I recently bought them. Yes, they all have pop3 enabled. And they have the same rule in there recommended on this forum to prevent incoming messages from going to the spam box. However, it seems that that feature (routing rules) is disabled until the email account is verified via SMS. So, that anti-spam rule must not be operating. However, I don't seen any emails in any spam folder.

However, what I do see is that there are plenty of new/unread emails in my inbox. I also see plenty of mails in my deleted box (so presumably GSA SER read that information and then deleted the mail) but the vast majority of those emails are also unread.

So I can' t tell, is one problem I'm having simply that GSA SER is simply not opening my emails accounts to get that verification information?

I am seeing a lot of email login errors (via "Important messages") having to do with 'connection reset by peer' and 'connection timed out'.  is this because of using slow proxies?

And, for all those unopened mails in both my inbox and my deleted boxes, is there any way to force GSA SER to read and recognize that information?

I've got that setting to tell it to not do a pop3 login sooner than  every 900 seconds. Does this actually tell it to not login to hotmail at all for at least 15 minutes, or is that per account?

PROXIES FOR EMAIL VERIFICATION
Another major question I have is whether to use proxies for the email verification. I've seen it mentioned on this forum that it is not necessary. That confuses me. I would think that a provider like hotmail would monitor and find it very fishy that a single IP would be logging into many different email accounts (although, who knows, maybe that IP is a public library). Alternatively, it would be fishy if one email account is being logged into from IPs all over the world. So, is the final answer really to not use proxies for email verification?

LINK VERIFICATION
I have it set to verify links automatically (instead of after a fixed period of time). I understand this verifies the link at random intervals after submission. But is this minutes, hours, or days? The row with the project summary tells me 169 submitted but only 5 verified. And even when I try to force the verification, that number does not increase. Why not?

if any could help me understand some of these issues I would be grateful.

thanks!

Comments

  • No Answer for it. :(
  • Oh my... you are asking a lot of very basic questions that has been answered often before. I will try helping you out, but I am going to be brief.

    Download failed is expected on some sites as they will be down. Frequently getting this means bad proxies.

    your public proxies are likely causing trouble if you do not have a good source. They die all the time. Yes, faster proxies are better and a response time of 0,3 is fine to have. I would say private and slow unless you got a quality public proxy source.

    General rule is to use 10 threads per proxy. 10 threads is nothing, I am using 1000.

    No engine matches, the site does not match any selected engine.

    Emails. Maybe SER does not make mails seem opened after reading through pop3? I also get the errors sometimes but do not care. Proxies are good to use because you will not be able to login if logging in too frequently from the same ip.

    Verifications. On automatic it verifies when the status bar of the project turns blue and it says so in the log. You are not getting more verified urls because you have not made more submissions that has resulted in a backlink.


    Next ime you should try searching the forum first and help yourself.

  • hi @fakenichahl, thanks for your answer despite the apparent frustration. Yes, trust me, I have been through the forum plenty searching for the answer to many topics, but have seldom found definitive answers to specific questions. Threads tend to contain a lot of speculation and discussion of related and probable causes, but still mostly leave me with the same open questions I started with.

    thanks for your recommendation that private and slow is better than public and fast. that helps a lot.

    No engine matches. I understand what it means. If I am using GSA SER to do the scraping, I imagine this will happen  a lot more than if I am using a pre-verified list. Just trying to get a sense for how much is a "normal" amount of these messages that others are experiencing when using GSA to scrape.

    Emails. beyond  doing the pop3 fix, I'm not sure what else I can do. What kind of emails do you use? I took off using proxies for email verification and am no longer getting those email login errors.

    Verifications. what is a normal range of submitted to verified? 2 to 1? 10 to 1? If I know what other are experiencing that would help set the expectations for myself. For me, it's attempted to solve captcha 1300 times, I have 130 submitted, and 10 verified. Are these ratios within a band of normalcy?





Sign In or Register to comment.