Skip to content

Top1 million Alexa cleaned

KaineKaine thebestindexer.com
edited March 2014 in GSA Search Engine Ranker
Small gift for weekend use 

Best website on Alexa cleaned (1000000) and not sorted (first is the best): 



Like:  

1,google.com

TO




.TXT 22MB:


.RAR 5MB:



If you have any other database or other PR, I would look into a list.




Comments

  • justicejustice
    When i import this list with identify and sort i get only 1 link recognized. I don't have problems with my other lists.. Is this normal ?? Probably yes, but i am just asking..
  • KaineKaine thebestindexer.com
    I don't know ... it's original source: http://s3.amazonaws.com/alexa-static/top-1m.csv.zip
  • rodolrodol
    why you having this problem justice? i just copied the whole txt and started identify and sort so far importet around 5k new urls... i just decided to stop the identify and sort, and imported directly to a spamm project, this way is faster
  • hoolakhoolak
    @Justice open it in notepad++ and save it in different txt file and then import.
  • seo4seo4
    Any idea for what we can use this list?
  • KaineKaine thebestindexer.com
    edited March 2014
    I have seen many people talk about the depreciation of PR. 
    Many spoke of Alexa site as to better estimate the qualiter web site. 
    If were going in this direction, the 1000 000 Best should be interesting. 
    I'm looking for a website that ressence PR tested (for large database scraping).
  • goonergooner SERLists.com
    edited March 2014
    LOL did anyone actually open it? Most URLs have the 'h' of http missing.
    So replace 'ttp' with 'http' and the list will work in SER.
    Sometimes it pays to actually open a file and look at it  :))
  • rodolrodol
    wow read, i just imported copy paste the txt to ser and all good... you are doing something wrong this list is good
  • KaineKaine thebestindexer.com
    I think's is good too, it must come from your parameter :)

  • jonathanjonjonathanjon
    Not sure how to use this too. I tried to import and sort and got nothing out of it.
  • goonergooner SERLists.com
    Hmmm i downloaded this the first day it was live but only opened it yesterday.
    The URL's had the 'h' missing. So either the download has changed or i got a corrupted file or something.
    Anyway, i edited the file and it worked good in SER.
    Nice share @kaine
  • davbeldavbel UK
    Try copy & paste.

    I tried to import it first and it only recognised one URL, but then when I c&p it worked
  • KaineKaine thebestindexer.com
    Ok maybe syntax editeur.

    Open with Wordpad (retains this syntax) copy paste in another .txt > Blast.

    Sorry for inconvenient, I did not think about it.
  • RayBanRayBan
    In which editor do you open it ?
    I can only view it like pasted below, but not sure how to edit it. some of the site names have numbers in the middle so i am unable to use - replace all function.
    1,google.com
    2,facebook.com
    3,youtube.com
    4,yahoo.com
    5,baidu.com
    6,wikipedia.org
    7,qq.com
    8,linkedin.com
    9,taobao.com
    10,twitter.com
    11,live.com
    12,amazon.com
    13,sina.com.cn
    14,google.co.in
    15,hao123.com
    16,blogspot.com
    17,weibo.com
    18,wordpress.com
    19,yandex.ru
  • jonathanjonjonathanjon
    A friendly question here. Would many of these sites be something we can post to?
  • goonergooner SERLists.com
    Yes, some will be postable - Not the obvious ones like facebook, twitter of course.

    Funny that SER recognises Clickbank as Wordpress article and Twitter as blog comment.
  • KaineKaine thebestindexer.com
    edited March 2014
    @RayBan test that (copy/past wordpad another .txt).

    http://www3.zippyshare.com/v/92680670/file.html
  • RayBanRayBan
    thank's @kaine - i hope ser will be able to post at least to 3% of those.
  • KaineKaine thebestindexer.com
    edited March 2014
    I have tested quickly (many other problem to resolve) and SER to plant more than I was hoping :)

    Test has rodol like: no restrictive project.
  • justicejustice
    Thanks @hoolak and all the others for your answers... Now it worked like a charm.. :-)
  • sweeppickersweeppicker
    Thanks
  • gsa8mycowsgsa8mycows forum.gsa-online.de/profile/11343/gsa8mycows
    edited May 2015
    thanks Kaine for your list
    How do you mine the top million?
    Does that come with a paid sub?
Sign In or Register to comment.