Stories

•

Topics

» Reporting Tools & Tips

How To Monitor Social Media for Misinformation

by Nic Dias • July 31, 2017

Read this article in

Русский

For journalists trying to keep an eye out for misleading claims and content on social networking sites, there are a stupefying number of channels to track and posts to read. But the value of monitoring at least some portion of this content is undeniable. Just consider the recent case in which CNN’s KFile was able to identify the original source of a Trump-CNN smackdown video tweeted by the President.

Thankfully, there are accessible tactics and tools out there to help make the task of monitoring social media more manageable. Before you can begin thinking usefully about how to monitor Facebook or Twitter, though, you have to figure out what you’re going to be monitoring — groups and/or topics. And what you choose will depend on the social platform you’re looking at.

Reddit, although still frequently ignored by journalists, is an invaluable source to follow. According to Alexa, it’s currently in the top ten of most popular website in the world and even more popular than Twitter. Misinformation that ends up circulating widely on Facebook and Twitter often appears on Reddit first. One classic example from 2013 is the theory that a missing Brown University student was a suspect in the Boston Marathon bombing.

The blue search bar at the top of the subreddit search page (top) and the subreddit suggestions given at the bottom of the general search page (bottom).

Reddit is made up of a collection of open forums called subreddits, whose subjects include everything from aardvarks to alt-right politics. To find subreddits, use the blue search bar at the top of the subreddit search page or use the general search page and look at the subreddit suggestions given at the bottom. Both search engines will look for your query throughout entire subreddits, though each will deliver a slightly different set of results.

The subreddits suggested at the bottom of the general search page, for instance, are listed given how many times they’ve mentioned your search terms. However, in either case, it’s easy to generate valuable results. Searching for a web domain like thecanary.co will return subreddits that referenced the site. Even a search like “Obama is terrible” will return meaningful results like r/Conservative and r/The_Donald.

Once you’ve found an interesting subreddit, you can search for its name to discover similar subreddits. Also, keep an eye out for new subreddits mentioned in comments.

4chan

4chan is its own beast. It’s ephemeral. It’s chaotic. It’s anonymous. It’s ugly. (Be warned: you’re going to see some disturbing stuff.) But it’s also a place where barrages of tweets have been inspired or initiated: 4chan was the first place where the #MacronLeaks documents were posted.

In one sense, 4chan is more straightforward to monitor in that by only watching six boards — /b/, /pol/, /int/, /x/, /news/ and /bant/ — you can cast a wide net over relevant discussions on the platform. However, these boards move rapidly. Posts also disappear after they’ve been inactive for three days, so you’ll have to use a third-party archive like Archived.Moe to read older posts. Finally, to see all the comments for a thread, you’ll have to click a button or link out to another page. As such, passively monitoring 4chan is not possible without doing some programming.

All that said, you can make the hunt for noteworthy content a bit easier. Using the “Find” feature on your browser, highlight terms like “http,” “twitter,” “facebook,” “mail,” “youtube” and the hash symbol “#” to pinpoint conversations that have a life beyond 4chan. There’s also a search bar at the top of every board, but it’s limited to matching terms in original posts or posts that initiate threads. Be careful when navigating to unfamiliar domains. Google the domain beforehand to check if it’s safe; it’s easy to pick up malware. (Note: I’ll be writing about user scripts in a later post.)

Twitter

There are two key ways to monitor Twitter activity: terms and lists.

Using terms to find tweets is a matter of figuring out which of them tend to be used in the relevant conversations. (I’m using “terms” broadly to refer to any string of characters, including domains, hashtags and usernames.) So, ask yourself a few questions:

Do I know any websites that produce misleading content?
Do I expect the tweets I’m looking for to include certain words or hashtags, like “snowflakes” or “#LockHerUp?
Are there particular accounts that are likely to be mentioned in these tweets?

Go over Twitter’s Search API documentation to get a fuller sense of what you can do with your query. Encase discrete queries in parentheses and link them together with “OR.” If you start running into errors, split your query into parts.

Part of a table of Twitter search operators. Source: Twitter’s Search API documentation.

Once you’ve formed a query, throw it into the Twitter search bar and take a look at what you get back. Notice any new terms that you hadn’t thought of and add them to your list. If you see a lot of irrelevant tweets that contain the same word, eliminate those results using the “-” operator. Repeat this process until you feel you’ve built a well-calibrated search query and save it somewhere on your computer. Conversations online are always changing, however, so it’s important to update your query regularly.

Apart from term-focused searching, lists are an effective way of quickly putting together groups of accounts to monitor. Lists are created by a user when they want to follow a group of accounts as a unit and are particularly handy because they let you capitalize on the expertise of other journalists. Use other people’s public Twitter lists to skip loads of work, but be sure to only use the lists of trusted sources. Nonsense lists abound on Twitter.

You’ll have to utilize a Google hack to search through lists on Twitter. (See this guide on Google search operators.) Add site:twitter.com/*/lists to the search bar. By doing so, you will delimit your search to sites whose URLs include the pattern following the colon. In this case, the pattern is that of a URL for a Twitter user’s lists, and the “*” is a universal placeholder for a username. Thus, by adding this term, you can search the public lists of all Twitter users — or more specifically the names of these lists. So think about how someone might name the lists you’d be interested in, trying different iterations of the same or similar terms.

A Google search query for Twitter lists.

In addition, look at what lists a user has created, or belongs to, by going to his or her profile, clicking “More” and then clicking “Lists.” As a last step, combine the lists you’ve found into super lists using Twitter List Copy. You may want to keep the lists you create private, if you don’t want others to know who you’re sleuthing. (More on Twitter lists here.)

Facebook

The potential to monitor Facebook is narrower than other platforms. First, you can only see content that has been designated public by users. Second, Facebook does not support direct, programmatic access to the public feed — or the stream of all public statuses — for anyone other than a few publishers. This means there is no way to flexibly monitor statuses, and you’ll have to center your search on pages and groups.

Facebook’s search bar is limiting when searching for pages and groups because your search query is only matched with names. So if you’re unsure of what a relevant page or group might be called, this approach isn’t going to get you far. (Watch this video for more on searching Facebook.) Again, Google comes in handy here. Add site:facebook.com/pages or site:facebook.com/groups to your query to search within pages’ and groups’ descriptions in addition to their names. Also, adding “-places” to a pages query will remove all places, like restaurants, from your results.

You can focus on active pages by changing Google’s date settings: click “Tools” and then “Any time.” Technically, you can search for posts from pages or within public groups with Facebook’s vanilla search bar, but you’re going to have to scroll through a bunch of spam.

Concluding Notes

Discord logo.

More and more of what First Draft is interested in tracking is coming in the form of images. Unfortunately, there aren’t yet any free, ready-to-use solutions for passively tracking images on social networking sites. Moreover, the coordination of things like Twitter campaigns increasingly happens on closed, invitation-only messaging platforms like Discord, and accessing these conversations just comes down to good, old-fashioned reporting.

This post originally appeared on First Draft and is cross-posted here with permission. First Draft is dedicated to improving skills and standards in the reporting and sharing of information that emerges online, and offers free verification resources at firstdraftnews.com.

Nic Dias is a computational journalist and a senior research fellow for First Draft. A recent graduate of the Columbia School of Journalism, he has worked on First Draft’s UK Election project and written on digital astroturfing.

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License

Republish our articles for free, online or in print, under a Creative Commons license.

Read other stories tagged with:

4chan Facebook fake news google internet search misinformation Reddit Social Media twitter

Republish this article

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License

Material from GIJN’s website is generally available for republication under a Creative Commons Attribution-NonCommercial 4.0 International license. Images usually are published under a different license, so we advise you to use alternatives or contact us regarding permission. Here are our full terms for republication. You must credit the author, link to the original story, and name GIJN as the first publisher. For any queries or to send us a courtesy republication note, write to hello@gijn.org.

<h2>How To Monitor Social Media for Misinformation</h2> by <a href="https://twitter.com/niccdias">Nic Dias</a> for Global Investigative Journalism Network &bull; July 31, 2017 <a href="https://gijn.org/2018/08/01/%D0%BA%D0%B0%D0%BA-%D0%BE%D1%82%D1%81%D0%BB%D0%B5%D0%B6%D0%B8%D0%B2%D0%B0%D1%82%D1%8C-%D0%B2-%D1%81%D0%BE%D1%86%D1%81%D0%B5%D1%82%D1%8F%D1%85-%D0%B4%D0%B5%D0%B7%D0%B8%D0%BD%D1%84%D0%BE%D1%80%D0%BC/">Русский</a><a href="https://gijn.org/wp-content/uploads/2017/07/Social-Media-Verification-3.png"><img class="alignright wp-image-46952 size-medium" src="https://gijn.org/wp-content/uploads/2017/07/Social-Media-Verification-3-336x168.png" alt="" width="336" height="168"></a>For journalists trying to keep an eye out for misleading claims and content on social networking sites, there are a stupefying number of channels to track and posts to read. But the value of monitoring at least some portion of this content is undeniable. Just consider the recent case in which CNN&rsquo;s KFile was able to identify the original source of a Trump-CNN smackdown video <a class="markup--anchor markup--p-anchor" href="https://twitter.com/realDonaldTrump/status/881503147168071680" target="_blank" rel="nofollow noopener noreferrer">tweeted</a> by the President.Thankfully, there are accessible tactics and tools out there to help make the task of monitoring social media more manageable. Before you can begin thinking usefully about how to monitor Facebook or Twitter, though, you have to figure out what you&rsquo;re going to be monitoring  -- groups and/or topics. And what you choose will depend on the social platform you&rsquo;re looking at.<h3 id="bd93" class="graf graf--p graf-after--p"><a href="https://www.reddit.com/">Reddit</a></h3>Reddit, although still frequently ignored by journalists, is an invaluable source to follow. According to Alexa, it&rsquo;s currently in the <a href="http://www.alexa.com/topsites">top ten of most popular website in the world</a> and even more popular than Twitter. Misinformation that ends up circulating widely on Facebook and Twitter often appears on Reddit first. One classic example from 2013 is the theory that a missing Brown University student was a suspect in the Boston Marathon bombing.Reddit is made up of a collection of open forums called subreddits, whose subjects include everything from aardvarks to alt-right politics. To find subreddits, use the blue search bar at the top of <a class="markup--anchor markup--p-anchor" href="http://reddit.com/subreddits" target="_blank" rel="nofollow noopener noreferrer">the subreddit search page</a> or use <a class="markup--anchor markup--p-anchor" href="http://reddit.com/search" target="_blank" rel="nofollow noopener noreferrer">the general search page</a> and look at the subreddit suggestions given at the bottom. Both search engines will look for your query throughout entire subreddits, though each will deliver a slightly different set of results.The subreddits suggested at the bottom of the general search page, for instance, are listed given how many times they&rsquo;ve mentioned your search terms. However, in either case, it&rsquo;s easy to generate valuable results. Searching for a web domain like thecanary.co will return subreddits that referenced the site. Even a search like "Obama is terrible" will return meaningful results like r/Conservative and r/The_Donald.Once you&rsquo;ve found an interesting subreddit, you can search for its name to discover similar subreddits. Also, keep an eye out for new subreddits mentioned in comments.<h3 id="3c5e" class="graf graf--p graf-after--p"><a href="https://www.4chan.org/">4chan</a></h3><a href="https://gijn.org/wp-content/uploads/2017/07/4chan.jpg"><img class="alignright wp-image-46955 size-medium" src="https://gijn.org/wp-content/uploads/2017/07/4chan-336x148.jpg" alt="" width="336" height="148"></a>4chan is its own beast. It&rsquo;s ephemeral. It&rsquo;s chaotic. It&rsquo;s anonymous. It&rsquo;s ugly. (Be warned: you&rsquo;re going to see some disturbing stuff.) But it&rsquo;s also a place where barrages of tweets have been inspired or initiated: 4chan was the first place where the #MacronLeaks documents were posted.In one sense, 4chan is more straightforward to monitor in that by only watching six boards  --  /b/, /pol/, /int/, /x/, /news/ and /bant/  -- you can cast a wide net over relevant discussions on the platform. However, these boards move rapidly. Posts also disappear after they&rsquo;ve been inactive for three days, so you&rsquo;ll have to use a third-party archive like <a class="markup--anchor markup--p-anchor" href="https://archived.moe/" target="_blank" rel="nofollow noopener noreferrer">Archived.Moe</a> to read older posts. Finally, to see all the comments for a thread, you&rsquo;ll have to click a button or link out to another page. As such, passively monitoring 4chan is not possible without doing some programming.<aside class="module align-right half type-pull-quote">Be careful when navigating to unfamiliar domains. Google the domain beforehand to check if it&rsquo;s safe; it&rsquo;s easy to pick up malware.</aside>All that said, you can make the hunt for noteworthy content a bit easier. Using the "Find" feature on your browser, highlight terms like "http," "twitter," "facebook," "mail," "youtube" and the hash symbol "#" to pinpoint conversations that have a life beyond 4chan. There&rsquo;s also a search bar at the top of every board, but it&rsquo;s limited to matching terms in original posts or posts that initiate threads. Be careful when navigating to unfamiliar domains. Google the domain beforehand to check if it&rsquo;s safe; it&rsquo;s easy to pick up malware. (Note: I&rsquo;ll be writing about user scripts in a later post.)<h3 id="392a" class="graf graf--p graf-after--p"><a href="https://twitter.com/">Twitter</a></h3><a href="https://gijn.org/wp-content/uploads/2017/07/twitter-logo.jpg"><img class="alignright wp-image-46956 size-thumbnail" src="https://gijn.org/wp-content/uploads/2017/07/twitter-logo-140x140.jpg" alt="" width="140" height="140"></a>There are two key ways to monitor Twitter activity: terms and lists.Using terms to find tweets is a matter of figuring out which of them tend to be used in the relevant conversations. (I&rsquo;m using "terms" broadly to refer to any string of characters, including domains, hashtags and usernames.) So, ask yourself a few questions:<ul class="postList">
<li id="6c4e" class="graf graf--li graf-after--p">Do I know any websites that produce misleading content?</li>
<li id="26cb" class="graf graf--li graf-after--li">Do I expect the tweets I&rsquo;m looking for to include certain words or hashtags, like "snowflakes" or "#LockHerUp?</li>
<li id="b877" class="graf graf--li graf-after--li">Are there particular accounts that are likely to be mentioned in these tweets?</li>
</ul>Go over <a class="markup--anchor markup--p-anchor" href="https://dev.twitter.com/rest/public/search" target="_blank" rel="nofollow noopener noreferrer">Twitter&rsquo;s Search API documentation</a> to get a fuller sense of what you can do with your query. Encase discrete queries in parentheses and link them together with "OR." If you start running into errors, split your query into parts.Once you&rsquo;ve formed a query, throw it into the Twitter search bar and take a look at what you get back. Notice any new terms that you hadn&rsquo;t thought of and add them to your list. If you see a lot of irrelevant tweets that contain the same word, eliminate those results using the "-" operator. Repeat this process until you feel you&rsquo;ve built a well-calibrated search query and save it somewhere on your computer. Conversations online are always changing, however, so it&rsquo;s important to update your query regularly.<aside class="module align-right half type-pull-quote">Apart from term-focused searching, lists are an effective way of quickly putting together groups of Twitter accounts to monitor.</aside>Apart from term-focused searching, lists are an effective way of quickly putting together groups of accounts to monitor. Lists are created by a user when they want to follow a group of accounts as a unit and are particularly handy because they let you capitalize on the expertise of other journalists. Use other people&rsquo;s public Twitter lists to skip loads of work, but be sure to only use the lists of trusted sources. Nonsense lists abound on Twitter.You&rsquo;ll have to utilize a Google hack to search through lists on Twitter. (See <a class="markup--anchor markup--p-anchor" href="http://www.googleguide.com/print/adv_op_ref.pdf" target="_blank" rel="nofollow noopener noreferrer">this guide</a> on Google search operators.) Add site:twitter.com/*/lists to the search bar. By doing so, you will delimit your search to sites whose URLs include the pattern following the colon. In this case, the pattern is that of a URL for a Twitter user&rsquo;s lists, and the "*" is a universal placeholder for a username. Thus, by adding this term, you can search the public lists of all Twitter users  -- or more specifically the names of these lists. So think about how someone might name the lists you&rsquo;d be interested in, trying different iterations of the same or similar terms.In addition, look at what lists a user has created, or belongs to, by going to his or her profile, clicking "More" and then clicking "Lists." As a last step, combine the lists you&rsquo;ve found into super lists using <a class="markup--anchor markup--p-anchor" href="http://projects.noahliebman.net/listcopy/connect.php" target="_blank" rel="nofollow noopener noreferrer">Twitter List Copy</a>. You may want to keep the lists you create private, if you don&rsquo;t want others to know who you&rsquo;re sleuthing. (More on Twitter lists <a class="markup--anchor markup--p-anchor" href="https://firstdraftnews.com/starting-from-scratch%E2%80%8A-%E2%80%8Atwitter-lists-expose-the-heart-of-a-story/" target="_blank" rel="nofollow noopener noreferrer">here</a>.)<h3 id="2ca6" class="graf graf--p graf-after--p"><a href="https://www.facebook.com/">Facebook</a></h3><a href="https://gijn.org/wp-content/uploads/2017/07/facebook-logo.png"><img class="alignright wp-image-46957 size-thumbnail" src="https://gijn.org/wp-content/uploads/2017/07/facebook-logo-140x140.png" alt="" width="140" height="140"></a>The potential to monitor Facebook is narrower than other platforms. First, you can only see content that has been designated public by users. Second, Facebook does not support direct, programmatic access to the public feed  -- or the stream of all public statuses  -- for anyone other than a few publishers. This means there is no way to flexibly monitor statuses, and you&rsquo;ll have to center your search on pages and groups.Facebook&rsquo;s search bar is limiting when searching for pages and groups because your search query is only matched with names. So if you&rsquo;re unsure of what a relevant page or group might be called, this approach isn&rsquo;t going to get you far. (Watch <a class="markup--anchor markup--p-anchor" href="https://firstdraftnews.com/resource/2845/" target="_blank" rel="nofollow noopener noreferrer">this video</a> for more on searching Facebook.) Again, Google comes in handy here. Add site:facebook.com/pages or site:facebook.com/groups to your query to search within pages&rsquo; and groups&rsquo; descriptions in addition to their names. Also, adding "-places" to a pages query will remove all places, like restaurants, from your results.You can focus on active pages by changing Google&rsquo;s date settings: click "Tools" and then "Any time." Technically, you can search for posts from pages or within public groups with Facebook&rsquo;s vanilla search bar, but you&rsquo;re going to have to scroll through a bunch of spam.<h3 id="7fd2" class="graf graf--p graf-after--p">Concluding Notes</h3>More and more of what <a href="https://firstdraftnews.com/">First Draft</a> is interested in tracking is coming in the form of images. Unfortunately, there aren&rsquo;t yet any free, ready-to-use solutions for passively tracking images on social networking sites. Moreover, the coordination of things like Twitter campaigns increasingly happens on closed, invitation-only messaging platforms like <a href="https://discordapp.com/">Discord</a>, and accessing these conversations just comes down to good, old-fashioned reporting.<hr><a href="https://firstdraftnews.com/"><img class="alignleft wp-image-47322 size-thumbnail" src="https://gijn.org/wp-content/uploads/2017/07/first-draft-twitter-logo-140x140.png" alt="" width="140" height="140"></a>This post <a href="https://firstdraftnews.com/monitor-social-media/">originally appeared</a> on First Draft and is cross-posted here with permission. First Draft is dedicated to improving skills and standards in the reporting and sharing of information that emerges online, and offers free verification resources at&nbsp;<a href="https://firstdraftnews.com/">firstdraftnews.com</a>.<a href="https://gijn.org/wp-content/uploads/2017/07/nic-dias.jpg"><img class="alignleft wp-image-46898 size-thumbnail" src="https://gijn.org/wp-content/uploads/2017/07/nic-dias-140x140.jpg" alt="" width="140" height="140"></a><a href="https://twitter.com/niccdias">Nic Dias</a> is&nbsp;a computational journalist and&nbsp;a senior research fellow for First Draft. A recent graduate of the Columbia School of Journalism, he has worked on First Draft's UK Election project and written on <a href="https://firstdraftnews.com/digital-astroturfing/" target="_blank" rel="noopener">digital astroturfing. </a>
	This <a target="_blank" href="https://gijn.org/stories/how-to-monitor-social-media-for-misinformation/">article</a> first appeared on <a target="_blank" href="https://gijn.org">Global Investigative Journalism Network</a> and is republished here under a Creative Commons license.
	<img id="republication-tracker-tool-source" src="https://gijn.org/?republication-pixel=true&amp;post=657947&amp;ga=UA-21528033-17">

Misinformation on TikTok: How Documented Examined Hundreds of Videos in Different Languages

by Lam Thuy Vo • January 10, 2025

For an investigation into misinformation on TikTok, the Documented team developed helpful methodologies for using AI to transcribe and analyze large amounts of downloaded videos.

Conference Resources Reporting Tools & Tips

How Investigative Journalists Can Fight Back Against Health Misinformation

by Serdar Vardar • June 10, 2024

A new wave of investigative health journalism is exposing deadly misinformation, turning complex data into accessible truths that can save millions of lives.

Investigative Techniques Reporting Tools & Tips

Dissecting Propaganda: Using AI to Cut Through North Korean TV Spin

by Henk van Ess • June 6, 2024

Accessing information from isolated nations like North Korea is difficult. Cutting-edge AI tools now enable efficient analysis of foreign language video broadcasts.

Accessibility Settings

text size

color options

reading tools

other

Topics

How To Monitor Social Media for Misinformation

Read this article in

Related Resources

Simple Tips for Verifying if a Tweet Screenshot Is Real or Fake

Four Quick Ways to Verify Images on a Smartphone

Advanced Guide on Verifying Video Content

GIJN Guide to Investigating Foreign Lobbying

Share

Concluding Notes

Related Resources

Simple Tips for Verifying if a Tweet Screenshot Is Real or Fake

Four Quick Ways to Verify Images on a Smartphone

Advanced Guide on Verifying Video Content

GIJN Guide to Investigating Foreign Lobbying

Related Stories

Misinformation on TikTok: How Documented Examined Hundreds of Videos in Different Languages

Beginner’s Guide to Identifying Explosive Ordnance in Social Media Imagery

How Investigative Journalists Can Fight Back Against Health Misinformation

Dissecting Propaganda: Using AI to Cut Through North Korean TV Spin

Read other stories tagged with:

Republish this article

Read Next

How They Did It Reporting Tools & Tips

Investigative Techniques Reporting Tools & Tips

Conference Resources Reporting Tools & Tips

Investigative Techniques Reporting Tools & Tips