A Poor Journalist’s Text-Mining Toolkit

How can you search and analyze collections of documents on your own computers with simple tools? At DataHarvest, Robert Gebeloff and I ran a workshop to answer that question. As people were seemed interested, here's a write-up of the two key tools we worked with: Apache Tika for content extraction and regular expressions in Sublime Text as an advanced search tool.


Top Ten #ddj: The Week’s Most Popular Data Journalism Links

What's the data-driven journalism crowd tweeting? Here are top links for June 10-22: Voting habits of Americans (@nytimes); Eurocup players (@br_data); FT dataviz guide (@digiday); stats for journalists (@ddjournalism); #ddj awards recap (@smfrogers); nursing homes in Germany (@source); & more.

Screen Shot 2016-06-23 at 11.14.59 AM

Humanizing Data: Using Numbers AND People

With the growing relevance and popularity of data journalism, it may be easy to prioritize numbers over people, and spend our time emphasizing the data through graphics, maps, charts, and other visual products. But sometimes the faces and names behind the data get lost. Fortunately, we have some guiding lights to keep us on the path of good journalism. Consider these highlights from the panel Humanizing Numbers at IRE16...

Screen Shot 2016-06-22 at 3.46.40 PM

From Research to Publication: A Snap Look at Tips from IRE16

When it's time to start a new investigation, journalists prepare themselves in different ways: from doing the research to pitching the story to building the narrative. It's not about a single formula, but about integrating different resources and strategies. Here, we present a selection of the tips presented at the 2016 conference of Investigative Reporters and Editors (#IRE16), useful for starting and developing investigations.

Screen Shot 2016-06-21 at 11.01.42 AM

Sheila Coronel’s Speech Gives Inspiration at IRE16

The awards luncheon at the annual IRE conference featured a moving keynote address by Columbia University's Sheila Coronel. Her speech focused on the contagious and empowering spirit of collaboration taking hold among investigative journalists worldwide.


GIJN Board Members Elected – Welcome to Jung, Nazakat

The results from last week's election of GIJN board candidates are in: electronic voting by GIJN members elected two new board members: Syed Nazakat, representing Asia/Pacific, and Eva Jung, representing Europe. In all, seven seats were up for election, each for a two-year term. Five other board members were re-elected: Anton Harber (Wits University Investigative Journalism Programme), Carlos Eduardo Huertas (Connectas), Oleg Khomenok (Internews), Bruce Shapiro (Dart Center), and Margo Smit (VVOJ).

IRE16 Keynote

Coronel: A Golden Age of Global Muckraking at Hand

Ten years ago, when I first moved to New York and gave my first lecture at the Columbia Journalism School, I told students that I believe we are at the dawn of a Golden Age of global muckraking. They were a great class, but they didn’t believe me. But look at where we are now.

Screen Shot 2016-06-16 at 1.14.50 PM

Top Ten #ddj: The Week’s Most Popular Data Journalism Links

What's the data-driven journalism crowd tweeting? Here are top links for May 31-June 9: EU Swings Right (@NYtimes); #ddj newsrooms (@niemanlab); German commutes (@zeitonline); French police violence (@BuzzFeedFrance); Dataharvest HackDay (#thesponge.eu); Ethnic America (@WashingtonPost); & more.