Lukas Steger
  • About
  • Articles
  • Contact

Articles

April 11, 2020

TidyTuesday Week 15: Internationalisation of the Tour de France

I thought the Beer Production data would be my favorite dataset for TidyTuesday, but this week’s Tour de France data from the excellent {tdf} package by Alastair Rushworth is going straigth to the top spot! The data mainly comes from Wikipedia and has all sort of data on TdF winners and stages. Being quite familiar with the Tour de France and its stats I’m less interested in stage lengths and winning margins, but I would like to analyse the development of internalisation of the Tour de France and cycling as a sport.
March 31, 2020

TidyTuesday Week 14: Beer Production in the US

I have to say as a homebrewer this week’s #tidytuesday got me quite excited. 🍺 This is the first time I use the {ggdark} package to convert regular ggplot2 themes into a dark version. The data was mainly gathered from pdf report using {pdftools} and some {stringr} magic, therefore the data includes total rows and some missing and duplicate values. Before working on the visualization I will spend some time on cleaning the dataset to then answer the question “How did the total beer production per year develop over the last 10 years by brewery size?
March 5, 2020

Who is going to finish the Köln Marathon?

Inspired by this question on Quora I wanted to dig into marathon results and in particular the “DNFs” (participants who started the marathon but didn’t make it to the finish line). I would have liked to do my little analysis on the results from the Berlin Marathon, but unfortunately I couldn’t find any results including the DNFs (if you have a hint where to find them please let me know 😄).
  • ««
  • «
  • 1
  • 2
  • 3
  • 4
  • »
  • »»
© Lukas Steger 2020