Skip to content

Big scrape domain (by extension) reward

KaineKaine thebestindexer.com
edited March 2014 in GSA Search Engine Ranker
Hi everyone. 
I just offer you something interesting and only participants will be rewarded. 
I created lists for bruteforce domaine scraping. It's lists are sorted by extension. 

Each list contains 475,255 rows Google gives 1000 results by search ...
each list are going to do more than 100 million results in minimum (domain dedupe trim to root).

Each person who scrape a list have the right to retrieve the result of others. 

The scrape must be complete and realized on google without footprint, maximum result per page = 1000. The goal is to retrieve a maximum domain. 
If someone is interested in. which manifests and say what he want to deal list. 

Do not drop matches on the forum, the exchange will be made by MP (so that those who want to enjoy, do them).

Here are the lists available at the moment, if you want a particular called me.

image
«13456

Comments

  • KaineKaine thebestindexer.com
    I start .fr list.
  • i didn't understand what's in those files .
  • KaineKaine thebestindexer.com
    I send you pm

  • KaineKaine thebestindexer.com
    If you are interested ask for the list to scrape that interests you.
  • @Kaine I will scrape .co.uk
  • KaineKaine thebestindexer.com
    Ok i pm you :)

  • I'd do the .com :)
  • KaineKaine thebestindexer.com
    Hi fakenickakl :)

    .com is specifique (multi-langage) specify langage used after scraping.

    I PM you :)

    At this time i have + 1 000 000 domain Fr
  • Okay i will take le german.
  • KaineKaine thebestindexer.com
    Thank's PM sent too :)
  • KaineKaine thebestindexer.com
    edited March 2014
    .Net just taken too.

    These are not lists of words, but request. The interest is to recover a maximum of domain extensions. Google is the best database.

    Summary file at this time and still available:

    (Once the work is finished it will not be possible to recover the joint work)


    image

    If you see another interesting extension to add mention it.



  • I will scrape .org
  • KaineKaine thebestindexer.com
    edited March 2014
    I PM you :)

    .Org taken too.

  • goonergooner SERLists.com
    @kaine - I'll do it too, give me without extension you want, i don't mind :)
  • KaineKaine thebestindexer.com
    lol I don't know ... maybe .edu or .gov may be interested people :)
  • goonergooner SERLists.com
    Sure, i'll do .edu
  • KaineKaine thebestindexer.com
    Ok i PM you :)

  • goonergooner SERLists.com
    Thanks :)
  • KaineKaine thebestindexer.com
    edited March 2014
    It is I who thank you. 
    I think we'll have a big packet domain to exchange ;)

    .Edu is taken now.



    History to identify the thing: It's free labor.
     
    Each manages it as he wants, all the participants are free to exchange their list when they want. 

    The only rule is to share only to those who will participate. 
    If someone wants to stop, he should say to liberate space for another member. 

    I think we'll have to sort all lists in one place so that participants can access a time the job ends.


  • Trevor_BanduraTrevor_Bandura 267,647 NEW GSA SER Verified List
    edited March 2014
    What ones are still available?

    I'll do .org I guess if thats still available.
  • KaineKaine thebestindexer.com
    Same at last picture less:

    -.Org
    -.Edu

    :)
  • Trevor_BanduraTrevor_Bandura 267,647 NEW GSA SER Verified List
    .ca available?
  • KaineKaine thebestindexer.com
    edited March 2014
    Yes :) I PM you

    .Ca taken now.


    MAJ


    image
  • KaineKaine thebestindexer.com
    .Ch taken too now.
  • I'll do the .ch ;)
  • KaineKaine thebestindexer.com
    I have Pm you :)
  • KaineKaine thebestindexer.com
    edited March 2014
    For those who have started their list, how it goes? (Normally even in dedoublonnant on the fly, the file must grow very quickly).

    Once completed I need to know if multi-language extension could benefit from complementary scrape. 
    Change Language, test a little and see if the domains are added.

    Do not forget to trim at root and deduplicated, otherwise the files will be huge :)


    I advise those who are not yet working with us to choose a list quickly. Take any Laquel, you will get the work of others. 

    Once finished it will be impossible to retrieve this huge list of domains.

    We will not share this work.


  • I want to participate. Do you have any list left?
  • KaineKaine thebestindexer.com
    edited March 2014
    Yes you have préférence ?
    Lest screen less: .Ch

    I send you by PM: .gov

    .Gov now taken.


    Actual refreshed list to scrap:

    image

    If you do not add new domain to scrap it remains only 9 seats available.



    jjumpm2  is responsible for domain.CO.UK

    fakenickahl  is responsible for domain.COM

    DonCorleone is responsible for domain.DE

    Justin is responsible for domain.NET

    ewandy is responsible for domain.ORG

    gooner is responsible for domain.EDU

    Trevor_Bandura is responsible for domain.CA

    vort3x is responsible for domain.CH

    vifa is responsible for domain.GOV

    @kaine is responsible for domain.FR


  • sorry , I will be out of town till Monday . If everyone is in a hurry , please scrape .de domains . I had scraped 5K domains till now . .de File is on my wall . Otherwise I will resume scraping after monday . sorry again .
Sign In or Register to comment.