Accessibility Settings

color options

monochrome muted color dark

reading tools

isolation ruler

Resource

Topics

Scraping Data 

Read this article in

Scraping refers to using a tool or writing a program that automatically pulls data from a website. Below are some resources for learning to scrape data from websites, no matter what your comfort level with coding.

This chapter from The Data Journalism Handbook 1 includes tips for scraping and some code examples.

Journocode (2019) offers a great overview of scraping basics by a group of journalists and technologists based in Germany.

Samantha Sunne outlines the basics of scraping in this presentation. It also contains links to more tools for beginners to start scraping websites.

Republish our articles for free, online or in print, under a Creative Commons license.

Republish this article


Material from GIJN’s website is generally available for republication under a Creative Commons Attribution-NonCommercial 4.0 International license. Images usually are published under a different license, so we advise you to use alternatives or contact us regarding permission. Here are our full terms for republication. You must credit the author, link to the original story, and name GIJN as the first publisher. For any queries or to send us a courtesy republication note, write to hello@gijn.org.

Read Next

Data Journalism News & Analysis

From Space to Story in Data Journalism

Over the past 10 years satellite imagery has become an important component of data journalism. In the next 10, it will likely evolve further, from a tool used primarily for illustrating stories to an integral part of research and investigative reporting.