Data Journalism

As our governments and businesses become increasingly flush with information, more and bigger data are becoming available from across the globe. Increasingly, investigative reporters need to know how to obtain, clean, and analyze “structured information” in this digital world. bloks

Here is a list of resources to get you started, but we want to keep updating our community with the best resources available. Do you know of a great data tutorial we haven't listed, perhaps in a language other than English? Help us keep this resource guide comprehensive by sending your favorite resource to: hello@gijn.org. ¿Habla español? For resources in Spanish, click here.


Key Resourcesnicar

The National Institute for Computer-Assisted Reporting, a project of Investigative Reporters and Editors, launched in 1989 to train reporters around the world on how to use data as part of broader investigations. In addition to “boot camps” and in-office training, NICAR offers a data library, practice data sets, and hosts the original annual conference on computer-assisted reporting. IRE also publishes the popular book, Computer-Assisted Reporting: A Practical Guide.

Poynter offers Five Tips for getting started with computer-assisted reporting, and 10 Tools to analyze datasets more efficiently.

The Center for Investigative Journalism published a manual on data journalism “for all journalists who want to master the art of interrogating and questioning numbers competently.” CIJ also provides a slew of additional books, guides and video resources of aspects of data journalism.

Dadatadrivenjournalismta-Driven Journalism offers a collection of resources for computer-assisted reporters.

 

Periodismo de Base de Datos provides tutorials and resources on data journalism for Spanish-speaking reporters.

Arab Reporters for Investigative Journalism offers this brief introduction to data journalism (in Arabic).

The International Consortium of Investigative Journalists provides a selection of video tutorials on basic Excel functions, as well as how to background a person or company, or find federal court documents in the U.S.

The International Journalists’ Network maintains a blog of the latest trainings, tools, and resources for data journalists.

Hacks/Hackers is a global movement bringing together computer programmers and investigative journalists to tell powerful data-driven stories. Trainings offered through regional chapters.header-logo

The Investigative Dashboard lists tools for data mining, visualization and social network analysis. Google search your tool of choice and you’ll surely find tutorials on how to begin.

djh-facing-left3 The Data Journalism Handbook is an international, collaborative effort involving dozens of data journalism experts. The free guide is available for download in English, French, Georgian, Russian, and Spanish.


Data Mining

Code Academy offers a series of free interactive trainings on the basics of HTML, CSS, JavaScript, Python, Ruby, and PHP.

Massachusetts Institute of Technology offers a series of free online courses in computer programming with Python, Java, and C++.

Michael Martl publishes an open-source textbook on how to program with Ruby on Rails.

ProPublica ran this “shopping list” of tools and training guides for scraping data from the web using Ruby.

Online Journalism published an introduction to using ScraperWiki to obtain data from the web.


Data Analysis

inertPyrDDJ

Investigative Reporters and Editors provides a simple tutorial to converting PDFs to Text.

Electronic Data Resource Service at McGill provides a tutorial on how to export a table from PDF to Excel.

School of Data offers a series of tutorials – from finding datasets, to basic Excel skills and using the results to tell a story.

Dan Nguyen put together this tutorial on using Google Refine to clean structured data sets, and also links to other video tutorials on Google Refine.

Github offers a “Gentle Introduction to SQL.”


Visualization & Mapping

Edward Tufte's books and courses are industry standards.

Flowing Data is run by statistician Nathan Yau, author of Data Points: Visualization that Means Something and Visualize This: The FlowingData Guide to Design, Visualization, and Statistics.

Visualisationofdata.com offers a directory of compelling infographics, how-to info, and more.

Esri offers a series of free online courses for those interested in mapping with ArcGIS.

Gustavo Faleiros created JEO, a WordPress theme for launching geodata-based sites. It allows news organizations, bloggers and NGOs to publish news stories as layers of information on digital maps.

Peter Aldhous put together a primer on using Excel’s free social network plugin, NodeXL.

The Data Visualisation Catalogue is an on-going project to "help you find the right data vizualization method for your data".


Statistics

OpenIntro hosts this free textbook on statistics

Knight Digital Media Center provides free, two-day online courses.

Flowing Data is run by statistician Nathan Yau, author of

Coursera offers a number of online statistics courses including:

Recommended Books on Statistics:


Data & Technology Blogs

ProPublica Nerd Blog, secrets of data journalists and newsroom developersData Store

Data Blog, the Guardian’s blog on computer-assisted reporting

Nacion Data, Spanish-language data journalism blog of the Argentinian daily La Nación.

Open Knowledge Foundation, global movement to open up knowledge around the world and see it used and useful

Toledol, a Portuguese-language blog about computer-assisted reporting

Computational and Data Journalism, news and technology articles about data journalism

Computational Reporting, all about data mining

Dajore, data journalism research

Driven by Data, how data journalism is sifting through the facts

Vis4.net, random thoughts on information visualization and data journalism

Reporter’s Lab, Duke University’s blog on tools, techniques and research for public affairs reporting.

Tow Center for Digital Journalism, Columbia’s blog on how technology is changing journalism, its practice and its consumption


Books

Computer-Assisted Reporting: A Comprehensive Primer, By Fred Vallance-Jones and David McKie

Computer-Assisted Reporting: A Practical Guide, the E-version by Brant Houston

Computer-Assisted Research: Information Strategies and Tools for Journalists, By Nora Paul and Kathleen A. Hansen

The Data Journalism Handbook is an international, collaborative effort involving dozens of data journalism experts. The free guide is available for download in English, French, Georgian, Russian, and Spanish.

Mapping for Stories: A Computer-Assisted Reporting Guide, By Jennifer LaFleur and Andy Lehren

Precision Journalism: a Reporter’s Introduction to Social Science Methods, by Philip Meyer


Conferences

NICAR hosts the original annual conference on computer-assisted reporting, which is attended by hundreds, and also puts on data-specific boot camps.

Data Harvest is a collaboration between the Journalismfund.euWobbing Europe and FarmSubsidy.org. The next conference is scheduled for May 2014 in Brussels.

The International Journalism Festival in Perugia, Italy, includes a School of Data Journalism training.

The Global Investigative Journalism Conference, held every two years, hosts a broad range of data-specific trainings.

Ghana Databootcamp trains participants in Ghana on how to locate, obtain and analyze public data on the extractive industries.