About the data

The ultimate goal of The Gamma project is to make data-driven articles such as the ones presented on this page fully open and reproducible. This means that they should contain all the code needed to obtain the data from the original source. The current version is not quite there yet - it focuses on letting readers reproduce all the computations that were done when building visualizations and also to create and share custom perspectives on the data.

However, you can still get the raw data in a CSV format from the project GitHub. This was obtained by combining data from The Guardian, which has a fantastic data set of medals until 2008 and adding results from 2012 by scraping data from the BBC. If you are interested, you can find the F# source code here (the file also tries to get data from olympic.org, but ironically, this is not nearly as complete as the Guardian table...).

When you run any visualization on this site, it accesses data live from a simple REST service that exposes the raw data and a more sophisticated REST service that implements the grouping operations. The services follow the protocol described here and can be also called from F# via the REST provider.

If you are interested in the project or you would like to collaborate on visualizing interesting data, drop us an email at [email protected], ping us on Twitter at @thegamma_net. All source code is available under Apache 2.0 license on GitHub.

All Time Olympic Medals Table

Everyone who has been following Olympic Games in London 2012 or Rio 2016 knows that the person with the largest number of medals of all time is Michael Phelps, but do you know who is the second and third? As you can see in the following table, the second person (for summer Olympic games) is Larisa Latynina who won 9 gold medals for Soviet Union between 1956 and 1964 and the third is Finnish runner Paavo Nurmi, also with 9 gold medals from 1920s.

options source

Loading content, please wait...

The table shows the most important facts, but there is a lot more information that you can get from the data if you change options of the visualization or if you change the source code that generates it. Here is a couple of simple things you can try on your own:

  • Find out where each athlete competed — To do this, click on the "options" button. This analyzes the visualization and automatically lets you change some parameters. In the "Group by athlete" table, you can add aggregated attributes for the table. Add "concatenate values of Games" and drop "concatenate values of Teams".

  • Who is the least lucky athlete — Counting gold medals is easy, but who has the largest number of bronze and silver medals? To find out, remove all items from "Sort the data" in "options" and specify your own criteria. Choose "by Bronze descending" to find the person with most bronze medals!

  • Look at medals from London 2012 only — You can find this in an alternative version of the visualization, but to do this on your own, click on "source" and change the second line from olympics.data to olympics.'by game'.'London (2012)'.data. This filters the data to only medals from London 2012. As you type olympics., the editor will let you specify other filters too. You can, for example, look at specific teams rather than specific games.