Web Scraping: A Journalist’s Guide

$8 billion in just a few hours earlier this year? It was because of a web scraper, a tool companies use—as do many data reporters. A web scraper is simply a computer program that reads the HTML code from webpages, and analyze it. With such a program, or “bot,” it’s possible to extract data and information from websites.

The Research Desk: Tools for Tweets, Domain History, Data

We’re back with another selection of web resources and reports that might be of interest to journalists around the world. On the list this week: new reports from the International Labour Organization, Congressional Research Service, and UK House of Commons; and tools to search domain ownership, load tweets into a spreadsheet, and search open data. Good hunting!

অনলাইনে গবেষণা ও অনুসন্ধানের যত কৌশল

ওপেন সোর্স অনুসন্ধানের জন্য সবচে জরুরী টুল হলো সার্চ ইঞ্জিন। এর সাথে যদি সোশ্যাল মিডিয়া, ডোমেইন লুক-আপ এবং সংবাদপত্র ও টেলিফোন ডিরেক্টরির মত প্রথাগত উৎস যোগ করা যায়, তাহলে শুধু ইন্টারনেটে ঘাঁটাঘাটি করেই আপনি অনুসন্ধানী রিপোর্টের জন্য অনেক কার্যকর তথ্য বের করে আনতে পারবেন।

Online Research Tools and Investigative Techniques

Search engines are an intrinsic part of the array of commonly used “open source” research tools. Together with social media, domain name look-ups and more traditional solutions such as newspapers, effective web searching will help you find vital information. Many people find that search engines often bring up disappointing results from dubious sources. A few tricks, however, can ensure that you corner the pages you are looking for, from sites you can trust. The same goes for searching social networks and other sources to locate people.