Register for #GIJC25
November 20, 2025 • 09:00
-
day
days
-
hour
hours
-
min
mins
-
sec
secs

Accessibility Settings

color options

monochrome muted color dark

reading tools

isolation ruler

Tag

text mining

2 posts

Data Journalism

Top Ten #ddj: This Week’s Top Data Journalism

What’s the global #ddj community tweeting about? Our NodeXL mapping from June 5 to 11 includes #VisualTrumpery from @mcrosasb, analysis of Theresa May’s election disaster by @GuardianVisuals, dataviz structuring strategies from @eagereyes, and school enrollment woes in Delhi from @htTweets.

Reporting Tools & Tips

A Poor Journalist’s Text-Mining Toolkit

How can you search and analyze collections of documents on your own computers with simple tools? At DataHarvest, Robert Gebeloff and I ran a workshop to answer that question. As people were seemed interested, here’s a write-up of the two key tools we worked with: Apache Tika for content extraction and regular expressions in Sublime Text as an advanced search tool.