BlazingSEO
I tried searching for similar threads, but couldn't really find anything that directly related to my question:

I'm trying to make sure CB only REALLY solves captchas that are over 70% success rate. However, I'm noticing that very often on many different engines that it is blatantly recognizing the wrong engine, and therefore still answering the captcha. Example:

http://postimg.org/image/ometvmj2n/  -> my image = Article Beach
Engine Detected: Pligg (blue)   -> which has a 100% success rate, but is not our image above


Engine Detected: Guest book (big captcha)  --> is not our image above (I'm not sure what engine the image above is, but it's definitely not this one)

Is there any suggestions or settings to reduce the amount of mismatched engines that CB finds? Even if I were to set the solve rate to 100%, in our scenario 1 above it would still solve the captcha wrong every single time :(


  Sven
    can you post a screenshot from the log?
  BlazingSEO
    Here is scenario 1. The last image I sent I unchecked the box for Pligg blue -- so it didn't even attempt to check other engines for validity :(.


    Here is scenario 2:


    (just look at last image in log). My house is hovered over Guestbook (big captcha) on the engines above and you can see what those captchas should look like.
  Sven
    try making use of the option to treat unchecked types as not present.
  BlazingSEO
    @Sven - That's not the issue here though. The issue is that it is trying to match a captcha to an engine that it isn't. Even if I went through and unchecked all the engines with less than 70%, the first scenario with the Pligg (blue) would cause issues because if I unchecked Pligg (blue) then it will never be solved if a REAL Pligg blue captcha comes in. I should be able to leave Pligg blue checked, and when the Article Beach captcha comes in (which is what is coming in on my screenshot) it should skip it because Article Beach is less than 70% solve rate.
  Sven
    the problem is that the colours match, the size matches and so on...it's hard to see if thats the same image or a different one. As you see in your logs, it has 7 matching types here.
  BanditIM  its very hard to differ images for software/ocr if they are same size and having similar properties such as same type of colors , so thus seg and recognition failing for them 

  • @sven, I jumped in on this one too now.. (sorry for double bothering you)

    But you know, this should defenitely be your next update on Captcha Breaker.
    I am using it CB to do just ONE thing on ONE site with my own script, so i will only use one type of captcha engine (my own one that I created). 

    Yet it detects 4 others due to as you explained : Similar color, image size etc..

    BUT.. You know what really is bothering me

    Is that I have all the other engines unchecked to point CB in the right direction, and it STILL detects the other engines and even logs "skipped by settings" because I disabled that engine to be used for these type of captcha's. 

    so the next LIVE SAVING update of CB for me would be, please use the other available / checked engines. You already display like 5 engines, yet take the first only and if its unchecked its skipped by settings. Maybe add that if unchecked go to the checked engine for this detectede captcha!!

    @sven you would save a lot of trouble for me if you could update this :) Thanks!!!

  Sven
    Thats not quite right. Even though CB lists like 5 engines that could probably match, it takes not the first in list but sorts that list already by the best match.

    However as written in the other threat, you might want to enable the option to treat unchecked captchas as not present.
