Sanakreon

All the things I like

Archive for the ‘Data Mining’ Category

How internet changed recently

Lately I’ve been wondering how did the internet landscape change in the last 2-3 years. So I wrote a short python script to compile a list of websites that are in top 5000 most popular websites this year, but were not in the top 200,000 most popular websites in 2008. So basically it is a list of popular websites that appeared (or became popular) in the last 2-3 years. Totally there are 821 such websites, and here is a list of them. Popularity data is taken as provided by alexa.com.

Some of these are adult websites(beware!), some of these are not in English language. But many are useful websites which existence you may not have known.

Popularity is measured by the number of connections to the website. These are not necessary when the user visits example.com, but may happen when the website that user visits connects to example.com, as in the case with googleusercontent.com.

First is popularity(alexa) rank(lower is better), then domain name.

The first 10 are:

5 blogspot.com

21 bing.com

32 googleusercontent.com

79 weibo.com

84 renren.com

95 fileserve.com

101 tmall.com

114 hotfile.com

119 imgur.com

127 stackoverflow.com

May 15th, 2011 at 11:15 pm

Posted in Data Mining