Skip to content

Repo for all session materials for the Dataharvest 2025 session on how to automate the data flow from source to chart

Notifications You must be signed in to change notification settings

dw-data/dataharvest25-automate-source-to-chart

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Dataharvest session:
Using free tools to automate your data flows from source to chart

Session by Gianna-Carina Gruen from DW's Data Journalism Team

What you will learn:

How to ...

  • collect data from a url
  • parse the data into the needed format using Python's pandas library
  • use Python's datawrapper library to create a chart
  • set up the script to run automatically on Github Actions

Prerequesites for the session:

Basic coding or Python knowlegde is helpful but not required

  • required: Access to colab.google.com (if you already have a Google account, it's included), so you can open this notebook
  • required: Datawrapper API token (please set it up in ahead of the session)
  • required: Github Account (if you want to set up the workflow to be automated) (please set it up in ahead of the session)
  • optional: Code Text editor (if you want to code along in the session), like Atom or Sublime Text
  • optional: Distill Browser Plugin (if you want to set up the workflow on click)

Code & Data

Code: To follow along, you'll need to make yourself a copy of this jupyter notebook on Google colab.

Data source: For this session, we'll be working with the UNHCR data on arrivals to Europe via land and sea, more specifically: with the URLs provided on the page to the json data

Automation

To automatically run your script on Github Actions, you'll need three things (which are also included in this github repo and can be downloaded at the top.

  • your script as a .py file
  • a requirements.txt file
  • a .yml file wrapped into a folder .github/workflows

You'll be adding all of them into a repository in your own Github account. Then switch to the "Actions" tab to see if your automation run started properly or whether there was an issue that needs debugging.

About

Repo for all session materials for the Dataharvest 2025 session on how to automate the data flow from source to chart

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages