The pageview statistics of the Wikipedia projects are accessible through the wikistats projects.
Webtool | A simple page view analysis web tool | tools.wmflabs.org |
Raw data since 2015 | Correced raw data since may 2015 | https://dumps.wikimedia.org/other/pageviews/ |
Raw data 2007 - Dec. 2015 | Raw data 2007 - Dec. 2015 | https://dumps.wikimedia.org/other/pagecounts-raw/ |
REST API | returns JSON, data since 2016 | wikimedia rest api |
Raw data definition from: pagecounts README.txt
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
page_title: For pagecounts files, it holds the title of the page. E.g.: Main_Page Berlin For projectcounts files, it is "-". * count_views the number of times this page has been viewed in the respective hour. * total_response_size the total response size caused by the requests for this page in the respective hour. en Main_Page 42 50043 means 42 requests to "en.wikipedia.org/wiki/Main_Page", which accounted in total for 50043 response bytes. And de.m.voy Berlin 176 314159 would stand for 176 requests to "de.m.wikivoyage.org/wiki/Berlin", which accounted in total for 314159 response bytes. ".wikipedia.org" -> "" ".wikibooks.org" -> ".b" ".wiktionary.org" -> ".d" ".wikimediafoundation.org" -> ".f" ".wikimedia.org" -> ".m" (only for some projects. See below) ".wikinews.org" -> ".n" ".wikiquote.org" -> ".q" ".wikisource.org" -> ".s" ".wikiversity.org" -> ".v" ".wikivoyage.org" -> ".voy" ".mediawiki.org" -> ".w" ".wikidata.org" -> ".wd" |
Further associations with wiki terms/pages
eyeplorer – context of Äthiopien
Wikipedias Category structure vs. UDP (Universal Decimal Classification)
My Wikipageviews project …
… dumps terms of interest (=Wikipedia pages) in a MySQL database so i am able to attach a BI Tool and and compare the data to different sources.
Advantage to the sources given above:
- 1 hour resolution starting from 2014 (if desired i can go back to 2007)
Disadvantage:
- Adding a new term takes ca. 4 days
My current data structure looks like this:
Column | Examlpe | Datatype | Comment |
---|---|---|---|
IDWIKI | 123456 | int | unique |
COUNTRY | en | char | de or en |
ARTICLE | Ethiopia | varchar | Wiki Article |
HITS | 54 | int | pageviews(*) |
NUM | 6547 | int | bytes delivered(*) |
HITSTIMESTAMP | 2014-01-01 04:00:08 | datetime | wiki timestamp |
INSERTED | 2017-04-26 11:07:50 | datetime | inserted it this DB |
SUBWIKI | m | char | m - mobile (*) |
The terms currently available are:
de – Addis_Abeba
en – Africa
de – Afrika
de – Amharische_Sprache
de – Äthiopien
de – Entwicklungsland
de – Entwicklungspolitik
de – Entwicklungszusammenarbeit
en – Ethiopia
de – Karlheinz_Böhm
2 Responses to wikipedia pageview statistics