wikipedia pageview statistics

The pageview statistics of the Wikipedia projects are accessible through the wikistats projects.

Ways to access the wikipedia pageviews
Webtool A simple page view analysis web tool tools.wmflabs.org
Raw data since 2015 Correced raw data since may 2015 https://dumps.wikimedia.org/other/pageviews/
Raw data 2007 - Dec. 2015 Raw data 2007 - Dec. 2015 https://dumps.wikimedia.org/other/pagecounts-raw/
REST API returns JSON, data since 2016 wikimedia rest api

Raw data definition from: pagecounts README.txt


Further associations with wiki terms/pages

eyeplorer – context of Äthiopien

Wikipedias Category structure vs. UDP (Universal Decimal Classification)


My Wikipageviews project …

… dumps terms of interest (=Wikipedia pages) in a MySQL database so i am able to attach a BI Tool and and compare the data to different sources.

Advantage to the sources given above:

  • 1 hour resolution starting from 2014 (if desired i can go back to 2007)

Disadvantage:

  • Adding a new term takes ca. 4 days

My current data structure looks like this:

The terms currently available are:

de – Addis_Abeba
en – Africa
de – Afrika
de – Amharische_Sprache
de – Äthiopien
de – Entwicklungsland
de – Entwicklungspolitik
de – Entwicklungszusammenarbeit
en – Ethiopia
de – Karlheinz_Böhm

Summery of the Content

About ralf

I studied Geology at University Erlangen and got my PhD (bio-nanotechnology) at TU Dresden. In my spare time i program simulations and tinker around with data prediction methods. Frisbee is my favorite sport and i play guitar when my friends and i meet to make some music.
This entry was posted in data processing, database and tagged , . Bookmark the permalink.

2 Responses to wikipedia pageview statistics

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.