How To Create a Data Journalism Team

Print More

Five years ago, reporters who looked into databases or sought help from hackers and web coders were seen as the nerds of the newsroom. The current atmosphere is different: we’re all aware of concrete stories of corruption, including global investigative projects like the Panama Papers, which have underscored the alliances between reporters and technology. That a 2017 Pulitzer Prize went to the investigation that uncovered the dark side of the offshore industry is recognition of the rigor and impact of this new way of working.

In Latin and Central America, there is a large gap between technology and journalism, but despite that, eight legacy media and six digital native sites have formed data journalism teams inspired by the experiences of The New York Times, The Guardian, ProPublica and the Los Angeles Times.

Ojo Público logo

I am a co-founder of one of these sites: Ojo Público, a digital media site comprised of six journalists and two developers specializing in data reporting. Our piece, “Memoria Robada” (Stolen Memory), received the Third Latin American Award for Investigative Journalism in 2016. It was a regional investigation that used big data to provide evidence of the scale of trafficking of cultural goods in Latin America, illustrating designation as organized crime.

The data journalism units mentioned all have distinct characteristics, in team size, profile and work dynamics. But, in a short period of time, most have also made shocking revelations and have been recognized by the prestigious Data Journalism Awards, organized by the Global Editors Network, which identify the most innovative stories supported by data.

These teams are located in Argentina, Peru, Costa Rica, Colombia, Brazil, Mexico and Chile. I have had the privilege of meeting various members and learning about their experiences, which gave me insight into some of the fundamental traits that may be useful to other journalists and techies who are looking to follow in their footsteps.

1. Size Is Relative

“One of the keys to the success of a data journalism team is the diversity of its members,” said Ricardo Brom, director of data intelligence from Argentinean newspaper La Nación. An electrical engineer by training, Brom left his post as director of technology services of the newspaper in early 2011 to become the brain behind coding at La Nación Data, the data journalism department at this paper, which has become the most respected of its kind in the region.

The team is made up of a journalist coordinator with database management skills; a reporter who is an expert in the laws around access to information; two programmers, a data analyst and a designer in charge of data visualizations. They are dedicated to medium- and long-term special projects, but also collaborate with several reporters from different sections of the newsroom who require assistance.

The La Nación data team

The La Nación Data team has been given a significant role in the paper’s investigations. However, this does not mean that only large teams can produce impactful stories with databases. The experience of small digital media sites like Ojo Público and Convoca in Peru, both winners at the Data Journalism Awards, shows that it’s possible to create excellent work with a small team. But it also depends on the extent to which journalists integrate a developer – one with sufficient knowledge of programming languages for database management– into their newsrooms.

2. The Developers

In the field of web development, there are professionals and the self-taught with diverse knowledge on computer programming. But a data journalism team needs developers with two types of profiles: the back-end, who works on the server and manages the database, and the front-end, who is responsible for the visible part of the platform. If it’s only possible to count on one of them, one must start to find and integrate the back-end.

These are the job characteristics of both:

  • The back-end developer is in charge of programming different components of the website and management of the database. They know programming languages like Java, PHP, Ruby and Python, as well as database management programs like MySQL, Postgres, SQL Server and MongoDB. The developer must also have knowledge of digital security to prevent attacks on the server.
  • The front-end developer builds the code of all visual components of a site. The developer must also know three programming languages: HTML (Hyper Text Markup Language), which allows the structure of all web platforms; CSS (Cascading Style Sheets), which organizes presentation and appearance (colors, types, font sizes); and JavaScript, which offers interactivity (animations, buttons, windows on the page).

3. The New Journalism Mentality

The fundamental challenge for reporters of a data journalism team is not to learn to use the most sophisticated digital tools, but to think differently about the issues that should be investigated to build new ways to ask and seek information, and to create hypotheses and working methods.

In traditional journalism, the reporter focused on identifying a relevant case. Data journalism looks for patterns, trends and identifiable phenomena. Your job is no longer a solitary activity, instead it involves a team — using the help of technology — that provides correct formulas and relevant sources which are used to question databases quickly and find clues or evidence previously unimaginable.

4. Collaborative Spirit, Integrated Space

All team members must have in-depth knowledge of the background of the project or research skills to contribute to the team. Each team member has a clear role, but that doesn’t mean they will be limited in their tasks. They must be willing to collaborate with ideas and solutions throughout the reporting process. This may include the best ways to collect or extract information, methods for testing and verifying and alternatives for presenting information to readers and users in the clearest manner possible.

This approach requires the group to interact in the same space. This will make it easier for reporters to learn to communicate in the language of programmers to handle big data. In turn, programmers will learn to think like journalists when they create codes to cross or analyze a mountain of information in search of understanding.

Large media companies such as The Guardian and The New York Times have data teams within newsrooms which work closely with editors and reporters from other areas. In small digital media operations specializing in investigations with big data, such as Ojo Público, newsrooms are designed as data teams.

5. Basic Work Tools

No matter what type of information the members of the team will access, it will be essential to know how to use a spreadsheet. The basic toolbox also includes a program to clean and standardize data like Open Refine and tools to create simple visualizations like Infogr.am and Tableau to complex visualizations like D3.js, a JavaScript library that allows a variety of interactive graphics.

For projects involving the management of large databases, like the OffshoreLeaks or Panama Papers, you will need more advanced programs like Neo4j, which allows identification between large quantities of data and shows them in graphics of nodes and edges, making reading relationships between different data points more intuitive. In this case it will be crucial to have input from web developers.

6. The Visual Component

Often, the findings or news originating in one or more of the crossed databases needs to be told in a language other than text to make them more understandable: fixed or interactive visualizations. Its main purpose is not aesthetic value, but clarity and comprehension.

As described above, a team of data journalists have reporters who know how to use simple tools to build visualizations. The one or more developers should be able to create a code or reuse any visual to come up with innovative ways to tell their stories. Two fundamental books that should be in the group’s library are: Facts are Sacred by Simon Rogers and The Functional Art by Alberto Cairo.

7. Ongoing Trainings

Staying updated on new available digital tools and the most innovative ways people are using technology to enhance journalistic research, training and ongoing exploration is part of the team’s daily routine. Following data blogs, such as these from The Guardian, La Nación Data and ProPublica, is as necessary as participation in hackathons, online courses and local and international workshops.

8. Integrating Local Community

Hackathon: ISOJ 2017 joined journalists and developers to create new apps for accountability journalism.

There are several organizations throughout the world that promote the best use of technology in journalism. Being part of them can foster knowledge exchange, networking and appreciating the talent and latest global trends in data journalism. Three organizations stand out for their scope:

  • Hacks/Hackers, with chapters in more than 90 countries
  • Global Editors Network, which has 1,300 members and organized the Data Journalism Awards
  • Investigative Reporters and Editors, which has promoted NICAR (National Institute for Computer-Assisted Reporting) since 1994. It holds an annual summit in the US of experts in computer-assisted reporting

9. A Transparent Method

One of the traits that readers are most thankful for, and that news teams are able to produce in data journalism, is transparency. The majority of data journalist groups, after findings are published, explain how research was carried out, what models or statistical criteria were applied and what limitations exist in the story. In addition, some groups release the databases they used in their platforms on GitHub to be reused by other journalists or people interested in exploring the topic.

We must also remember that one of the great advantages of working with programmers is the automation of several processes, such as extracting public data from government websites to build databases that were previously located in hundreds or thousands of documents in PDF or JPG format. After the team is finished using the documents, the accepted community practice is to then release all information of public interest.

10. Original Focus

The way data journalism teams work allows the design of investigations on diverse themes with original and innovative focuses, starting from how we show something is true, to the narratives for presenting the findings.

For example, how do we explain the impact of private corporate policies on citizens’ health? With this platform, you are able to know if your doctor or preferred clinic has an outstanding malpractice complaint, and who is behind it. How do we explain the impact of trafficking of cultural heritage items in the region? “Memoria Robada” (Stolen Memory) tracks 2 million data points and combines research, a database of stolen objects around the world and all available official documents. We are reaching a point where we might be able to unite rigorous methodologies (journalism and coding) to expand the boundaries of journalism.


This article originally appeared on the Journalism in the Americas blog and is cross-posted here with permission. The excerpt is part of the book Innovative Journalism in Latin America, published by the Knight Center for Journalism in the Americas, with the help of Open Society Foundations’ Program on Independent Journalism. An Arabic translation is available here.

Fabiola Torres López is a founder and editor of OjoPúblico. She is an ambassador of Sembra Media in Peru and co-author of The Swiss Army Knife for Journalists: Digital research tools in the big data era, a data journalism guide for Hispanic investigative reporters.

Leave a Reply

Your email address will not be published. Required fields are marked *