An aerial image shows a meeting of a Russian nationalist group. Image: Screenshot, BBC Eye

Stories

•

Topics

Building a Multi-Agent AI System To Sift Through Thousands of Russian Social Media Posts

by Christopher Giles, Serdar Tumgoren, Chris Zubak-Skees, and Marc Perkins, The Reuters Institute • June 25, 2026

Any journalist who’s done online investigations knows there’s simply too much evidence for one human to ever collect or investigate. Too often, we are overwhelmed with a flood of information: tens of thousands of social media posts, images, and other media.

Our team from BBC Eye, which works on original documentary investigations from around the world, wanted to see if AI could help solve this problem. We opted for AI agents — a collection of large language models (LLMs) that can coordinate and execute multi-step tasks under human supervision. When connected to external environments such as the internet or databases, these agents can fetch and analyze relevant social media content at scale, performing work that our team might otherwise not have the time to undertake.

We used this approach as part of a recently published investigation on Russia’s rising nationalist vigilante movement, building a multi-agent AI system we named Haystack to help us explore new emerging forces in daily Russian life. The team included BBC Eye’s open source investigators, who are specialists in gathering and analyzing public information, and Russia-focused reporters, along with help from computational journalists at Stanford University. This is how we developed and used this system.

Why We Created Haystack

While we did extensive on-the-ground reporting in Russia for the investigation, the country’s restrictive journalism environment made this kind of analysis additionally beneficial in supporting and deepening our research.

The project began when we noticed that social movements were disrupting Russia’s domestic politics while the Kremlin was waging its full-scale invasion of Ukraine. New groups emerged promoting far-right and nationalist beliefs, and these views were disseminated across Russian social media.

Using source reporting and data journalism methods, the BBC Eye global investigations team had already revealed how Russkaya Obshina, currently the largest nationalist group, was operating a nationwide campaign against migrants and those opposed to “traditional values,” in concert with the Russian authorities. The team also saw financial documents that suggested the group had been funded by charitable foundations run by figures close to the Kremlin (a finding Russkaya Obshina rejected).

The reporting included over a dozen interviews with former and current members of the group, migrants, Russian citizens who have been targeted, and experts monitoring the situation. But to understand the scale of this movement, the team created and leveraged a new AI system to harvest, analyze, and surface leads at a scale we could never have handled using traditional methods.

We were aiming to use AI to mirror multiple tasks of an investigations team in a single application: a computational journalist interacting with a webpage or API to download data, a reporter assessing social media posts for leads, and a data specialist producing numerical findings.

It was our first foray into multi-agent work and required lots of experimentation and iteration: both in building an agent-driven workflow that made sense for reporters, and ensuring that the agents at each step were performing their tasks effectively.

We used Haystack to gather 10,000 social media posts from over 10 Russian nationalist groups and produced 55,000 assessments of their content for signals such as nationalist ideology, references to migrants, anti-migrant raids, and expressions of violence against minority groups.

With Haystack’s assistance, the team found that Russkaya Obshina appeared to have the most prolific on-the-ground presence when compared with other nationalist groups, organizing patrols across Russian towns and cities, and raids on workplaces, shops, nightclubs, and hostels.

A nationalist vigilante interviewed by the BBC explained their motivations, and why they target migrants and those who extremists say go against “traditional values.” Image: Screenshot, BBC Eye

How Haystack Works

Haystack stitches together multiple AI agents that perform a variety of tasks. Those include:

Fetching posts from Russian social media sites that are home to several nationalist groups.
Assessing image and text-based social media posts, for example: to determine if the content includes nationalist, racist, or anti-immigrant language or imagery.
Performing data analysis in response to natural language human prompts, for example: “How many posts contain references to law enforcement raids on migrant laborers at work sites?”
The team built the application using LangGraph, a programming library that enables developers to build AI workflows by connecting multiple agents that can perform these different tasks in one standalone structure. LangGraph allows developers to integrate any of the popular open source or cloud-hosted LLMs, such as OpenAI’s GPT and Google’s Gemma models.

From a reporter’s perspective, the system resembles one of those choose-your-own-adventure books. The journalist can interact with Haystack and travel down one of several well-defined paths.

An initial challenge was ensuring the supervisor agent, which delegates tasks down chosen research paths, followed the direction provided by the journalist. We solved this issue by giving the supervisor agent examples of accurate behavior and having the system ask clarifying questions to resolve unclear reporter input.

At each stage of the process, the reporter is asked to provide instructions and clarifications to agents. We had explored building a more automated system, where agents would independently take multi-step decisions, like collect, assess, and analyze data all based on a single complex prompt. But we found that having the reporter provide input at each stage of the process, determining which and how many posts an agent should assess, reduced the chance of LLMs lapsing into guesswork and taking the research down unintended routes. We also thought it was important to have a journalist in the loop, so that oversight was maintained over agent decision-making.

Once we’d designed the system, the team put it to work by creating a seed list of nationalist and far-right groups that we were interested in investigating. Drawing on this list of over 10 nationalist groups, we used the collection agent to gather posts. This process created a unique dataset that the system could then assess for journalistic leads — helping us understand the nature of the nationalist movements that appeared to be growing in Russia.

Our next step was to ensure the agent’s assessment of online content worked reliably, so we continually checked Haystack’s output and made adjustments to prompts to improve the results.

As we spent time using the system, we discovered that responses got more helpful to our reporting as we removed ambiguity from the inputs’ wording. For example, in the beginning journalists would ask the system, “Does this post contain raids? Please label it ‘definitely,’ ‘definitely not,’ ‘probably,’ ‘probably not,’. But we found that by narrowing the LLM’s options to “yes,” “no,” “not sure,” we surfaced more precise leads.

In the background, this process was building a large database where we would review the assessments and examine the reasoning the LLM applied to its decisions.

The system also allowed reporters with little or no technical training to analyze data, for example, counting the number of posts describing anti-migrant raids. Traditionally, this type of data analysis requires fluency in programming and database querying language. Now a reporter without data training can ask questions in natural language, which Haystack translates into database queries before returning a straightforward summary of the query’s results.

Once we were confident in the system we set Haystack loose to gather and assess more social media posts, surfacing hundreds of leads that we might have overlooked, which were also verified by members of our team who manually reviewed the evidence.

How Haystack Has Helped our Journalists

The system allowed us to start quantifying the degree to which nationalist groups were inciting violence against migrants, conducting on-the-ground raids, and mobilizing alongside Russian authorities in street-level actions.

This is work that would have required weeks or months of painstaking effort, sifting through thousands of social media posts manually. Haystack allowed us to cut through the noise and get at what we were really interested in: the extent to which these Russian groups talked about their real-world activities.

The system also surprised us with some unexpected benefits along the way, surfacing euphemisms and derogatory language used to describe migrant workers such as “visiting specialist” or “workaholics.”

The multi-language capabilities of the LLMs underlying Haystack allowed non-Russian speakers to uncover such language and verify the accuracy of terms with Russian experts on our team. The ability to work across languages holds huge potential for our newsroom and others that focus on cross-border investigations.

Haystack allowed us to explore the scale and nature of the nationalist movement in Russia, in particular revealing nuances about the activities of Russkaya Obshina. With Haystack, our reporters could more clearly understand the emerging trends by harvesting and analyzing a much larger volume of posts. The tool expanded our ability to scale the team’s capacity and, in the process, helped guide our reporting and clarify the story.

What’s Next for Haystack?

We designed Haystack in a way that could work with any type of investigation. As we think about future directions for the project, we’re considering expanding the number of data sources beyond Russian social media that Haystack can harvest data from and analyze. For example, other online data sources and public records and documents.

We are also thinking about how such a system aligns with existing patterns of work in the newsroom. Reporters who are experts on particular beats and geographic regions might typically search for examples of social media content that help tell a story.

Our system can harvest such information under their direction. Reporters can then spend some time reviewing and categorizing a small subset of such content, and then enlist Haystack to gather and analyze a much larger volume of posts. We believe such AI-powered systems hold potential to expand our reach, allowing us to work on stories that may otherwise never be told.

The emergence of generative AI has allowed malicious actors to pollute information environments faster than ever. The only way to counter the flood may be to find trustworthy ways to harness AI ourselves.

Editor’s Note: This story was originally published by the Reuters Institute and is republished with permission.

This story was written by Christopher Giles, a journalist, producer, and director at BBC Eye, Serdar Tumgoren, a computational journalist and lecturer at Stanford University, Chris Zubak-Skees, a freelance computational journalist and software developer at BBC Eye, and Marc Perkins the founding editor of BBC Eye investigations and now a senior commissioning editor at Channel 4.

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License

Republish our articles for free, online or in print, under a Creative Commons license.

Read other stories tagged with:

AI Cross post Reuters Institute

Republish this article

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License

Material from GIJN’s website is generally available for republication under a Creative Commons Attribution-NonCommercial 4.0 International license. Images usually are published under a different license, so we advise you to use alternatives or contact us regarding permission. Here are our full terms for republication. You must credit the author, link to the original story, and name GIJN as the first publisher. For any queries or to send us a courtesy republication note, write to hello@gijn.org.

<h2>Building a Multi-Agent AI System To Sift Through Thousands of Russian Social Media Posts</h2> by <a href="https://reutersinstitute.politics.ox.ac.uk/news/how-bbc-eye-built-multi-agent-ai-system-sift-through-ten-thousand-russian-social-media-posts">Christopher Giles, Serdar Tumgoren, Chris Zubak-Skees, and Marc Perkins, The Reuters Institute</a> for Global Investigative Journalism Network &bull; June 25, 2026 Any journalist who&rsquo;s done online investigations knows there&rsquo;s simply too much evidence for one human to ever collect or investigate. Too often, we are overwhelmed with a flood of information: tens of thousands of social media posts, images, and other media.<aside dir="ltr">The project began when we noticed that social movements were disrupting Russia&rsquo;s domestic politics while the Kremlin was waging its full-scale invasion of Ukraine.</aside>Our team from&nbsp;<a href="https://www.bbc.co.uk/programmes/m001v5nw">BBC Eye</a>, which works on original documentary investigations from around the world, wanted to see if AI could help solve this problem. We opted for AI agents &mdash; a collection of large language models (LLMs) that can coordinate and execute multi-step tasks under human supervision. When connected to external environments such as the internet or databases, these agents can fetch and analyze relevant social media content at scale, performing work that our team might otherwise not have the time to undertake.We used this approach as part of a <a href="https://www.bbc.co.uk/news/articles/cq5p306l7weo">recently published investigation</a> on Russia&rsquo;s rising nationalist vigilante movement, building a multi-agent AI system we named Haystack to help us explore new emerging forces in daily Russian life. The team included BBC Eye&rsquo;s open source investigators, who are specialists in gathering and analyzing public information, and Russia-focused reporters, along with help from computational journalists at Stanford University. This is how we developed and used this system.<h4 dir="ltr">Why We Created Haystack</h4>While we did extensive on-the-ground reporting in Russia for the investigation, the country&rsquo;s restrictive journalism environment made this kind of analysis additionally beneficial in supporting and deepening our research.The project began when we noticed that social movements were disrupting Russia&rsquo;s domestic politics while the Kremlin was waging its full-scale invasion of Ukraine. New groups emerged promoting far-right and nationalist beliefs, and these views were disseminated across Russian social media.Using source reporting and data journalism methods, the&nbsp;<a href="https://www.bbc.co.uk/programmes/m001v5nw">BBC Eye</a> global investigations team had already revealed how Russkaya Obshina, currently the largest nationalist group, was operating a nationwide campaign against migrants and those opposed to &ldquo;traditional values,&rdquo; in concert with the Russian authorities. The team also saw financial documents that suggested the group had been funded by charitable foundations run by figures close to the Kremlin (a finding Russkaya Obshina rejected).The reporting included over a dozen interviews with former and current members of the group, migrants, Russian citizens who have been targeted, and experts monitoring the situation. But to understand the scale of this movement, the team created and leveraged a new AI system to harvest, analyze, and surface leads at a scale we could never have handled using traditional methods.<aside dir="ltr">From a reporter's perspective, the system resembles one of those choose-your-own-adventure books.</aside>We were aiming to use AI to mirror multiple tasks of an investigations team in a single application: a computational journalist interacting with a webpage or API to download data, a reporter assessing social media posts for leads, and a data specialist producing numerical findings.It was our first foray into multi-agent work and required lots of experimentation and iteration: both in building an agent-driven workflow that made sense for reporters, and ensuring that the agents at each step were performing their tasks effectively.We used Haystack to gather 10,000 social media posts from over 10 Russian nationalist groups and produced 55,000 assessments of their content for signals such as nationalist ideology, references to migrants, anti-migrant raids, and expressions of violence against minority groups.With Haystack&rsquo;s assistance, the team found that Russkaya Obshina appeared to have the most prolific on-the-ground presence when compared with other nationalist groups, organizing patrols across Russian towns and cities, and raids on workplaces, shops, nightclubs, and hostels.<h4 dir="ltr">How Haystack Works</h4>Haystack stitches together multiple AI agents that perform a variety of tasks. Those include:<ul>
<li dir="ltr">Fetching posts from Russian social media sites that are home to several nationalist groups.</li>
<li dir="ltr">Assessing image and text-based social media posts, for example: to determine if the content includes nationalist, racist, or anti-immigrant language or imagery.</li>
<li dir="ltr">Performing data analysis in response to natural language human prompts, for example: &ldquo;How many posts contain references to law enforcement raids on migrant laborers at work sites?&rdquo;</li>
<li dir="ltr">
The team built the application using&nbsp;<a href="https://www.langchain.com/langgraph">LangGraph</a>, a programming library that enables developers to build AI workflows by connecting multiple agents that can perform these different tasks in one standalone structure. LangGraph allows developers to integrate any of the popular open source or cloud-hosted LLMs, such as OpenAI&rsquo;s GPT and <a href="https://deepmind.google/models/gemma/">Google&rsquo;s Gemma</a> models.
</li>
</ul>From a reporter's perspective, the system resembles one of those choose-your-own-adventure books. The journalist can interact with Haystack and travel down one of several well-defined paths.An initial challenge was ensuring the supervisor agent, which delegates tasks down chosen research paths, followed the direction provided by the journalist. We solved this issue by giving the supervisor agent examples of accurate behavior and having the system ask clarifying questions to resolve unclear reporter input.At each stage of the process, the reporter is asked to provide instructions and clarifications to agents. We had explored building a more automated system, where agents would independently take multi-step decisions, like collect, assess, and analyze data all based on a single complex prompt. But we found that having the reporter provide input at each stage of the process, determining which and how many posts an agent should assess, reduced the chance of LLMs lapsing into guesswork and taking the research down unintended routes. We also thought it was important to have a journalist in the loop, so that oversight was maintained over agent decision-making.<aside dir="ltr">We believe such AI-powered systems hold potential to expand our reach, allowing us to work on stories that may otherwise never be told.</aside>Once we&rsquo;d designed the system, the team put it to work by creating a seed list of nationalist and far-right groups that we were interested in investigating. Drawing on this list of over 10 nationalist groups, we used the collection agent to gather posts. This process created a unique dataset that the system could then assess for journalistic leads &mdash; helping us understand the nature of the nationalist movements that appeared to be growing in Russia.Our next step was to ensure the agent&rsquo;s assessment of online content worked reliably, so we continually checked Haystack&rsquo;s output and made adjustments to prompts to improve the results.As we spent time using the system, we discovered that responses got more helpful to our reporting as we removed ambiguity from the inputs&rsquo; wording. For example, in the beginning journalists would ask the system, &ldquo;Does this post contain raids? Please label it &lsquo;definitely,&rsquo; &lsquo;definitely not,&rsquo; 'probably,&rsquo; &lsquo;probably not,'. But we found that by narrowing the LLM&rsquo;s options to &ldquo;yes,&rdquo; &ldquo;no,&rdquo; &ldquo;not sure,&rdquo; we surfaced more precise leads.In the background, this process was building a large database where we would review the assessments and examine the reasoning the LLM applied to its decisions.The system also allowed reporters with little or no technical training to analyze data, for example, counting the number of posts describing anti-migrant raids. Traditionally, this type of data analysis requires fluency in programming and database querying language. Now a reporter without data training can ask questions in natural language, which Haystack translates into database queries before returning a straightforward summary of the query&rsquo;s results.Once we were confident in the system we set Haystack loose to gather and assess more social media posts, surfacing hundreds of leads that we might have overlooked, which were also verified by members of our team who manually reviewed the evidence.<h4 dir="ltr">How Haystack Has Helped our Journalists</h4>The system allowed us to start quantifying the degree to which nationalist groups were inciting violence against migrants, conducting on-the-ground raids, and mobilizing alongside Russian authorities in street-level actions.<aside dir="ltr">The ability to work across languages holds huge potential for our newsroom and others that focus on cross-border investigations.</aside>This is work that would have required weeks or months of painstaking effort, sifting through thousands of social media posts manually. Haystack allowed us to cut through the noise and get at what we were really interested in: the extent to which these Russian groups talked about their real-world activities.The system also surprised us with some unexpected benefits along the way, surfacing euphemisms and derogatory language used to describe migrant workers such as &ldquo;visiting specialist&rdquo; or &ldquo;workaholics.&rdquo;The multi-language capabilities of the LLMs underlying Haystack allowed non-Russian speakers to uncover such language and verify the accuracy of terms with Russian experts on our team. The ability to work across languages holds huge potential for our newsroom and others that focus on cross-border investigations.Haystack allowed us to explore the scale and nature of the nationalist movement in Russia, in particular revealing nuances about the activities of Russkaya Obshina. With Haystack, our reporters could more clearly understand the emerging trends by harvesting and analyzing a much larger volume of posts. The tool expanded our ability to scale the team&rsquo;s capacity and, in the process, helped guide our reporting and clarify the story.<h4 dir="ltr">What&rsquo;s Next for Haystack?</h4>We designed Haystack in a way that could work with any type of investigation. As we think about future directions for the project, we&rsquo;re considering expanding the number of data sources beyond Russian social media that Haystack can harvest data from and analyze. For example, other online data sources and public records and documents.We are also thinking about how such a system aligns with existing patterns of work in the newsroom. Reporters who are experts on particular beats and geographic regions might typically search for examples of social media content that help tell a story.Our system can harvest such information under their direction. Reporters can then spend some time reviewing and categorizing a small subset of such content, and then enlist Haystack to gather and analyze a much larger volume of posts. We believe such AI-powered systems hold potential to expand our reach, allowing us to work on stories that may otherwise never be told.The emergence of generative AI has allowed malicious actors to pollute information environments faster than ever. The only way to counter the flood may be to find trustworthy ways to harness AI ourselves.Editor's Note: This story was <a href="https://reutersinstitute.politics.ox.ac.uk/news/how-bbc-eye-built-multi-agent-ai-system-sift-through-ten-thousand-russian-social-media-posts">originally published</a> by the Reuters Institute and is republished with permission.&nbsp;<hr>This story was written by Christopher Giles, a journalist, producer, and director at BBC Eye, Serdar Tumgoren, a computational journalist and lecturer at Stanford University, Chris Zubak-Skees, a freelance computational journalist and software developer at BBC Eye, and Marc Perkins the founding editor of BBC Eye investigations and now a senior commissioning editor at Channel 4.&nbsp;
	This <a target="_blank" href="https://gijn.org/stories/using-ai-investigatee-russian-nationalist-social-media-posts/">article</a> first appeared on <a target="_blank" href="https://gijn.org">Global Investigative Journalism Network</a> and is republished here under a Creative Commons license.
	<img id="republication-tracker-tool-source" src="https://gijn.org/?republication-pixel=true&amp;post=657947&amp;ga=UA-21528033-17">

How AI Is Helping Independent Journalists Track Wartime Casualties of Russia

by Katya Bonch-Osmolovskaya • March 31, 2025

Exiled Russian media site IStories has shared with GIJN how it built an AI-powered database of Russian military war dead and missing, and why it was worth creating.

Reporting Tools & Tips

New Investigative Tools for Monitoring Social Media Platforms

by Rowan Philp • March 20, 2023

Social media platforms are among the most difficult sites to scrape for data across the internet. A recent session at NICAR23 unveiled several dynamic new tools — including Junkipedia, a possible CrowdTangle replacement — that can perform a wealth of social media monitoring tasks, from tracking down who is behind harmful ads to identifying conspiracy groups or influencers spreading disinformation.

GIJC23

Reporting on Russia from the Outside — From Investigating Ukraine to AI-Powered Analysis

by Laura Oliver • September 20, 2023

From cultivating sources to verifying information, the challenges of investigating Russia from outside the country are numerous.

How They Did It Methodology

How They Did It: Uncovering a Vast Network of Illegal Mining in Venezuela

by Mariel Lozada • June 2, 2022

A behind-the-scenes look at a reporting collaboration that uncovers the vast scale of illegal mining and illicit smuggling in Venezuela’s Amazon region.

Accessibility Settings

text size

color options

reading tools

other

Stories

Topics

Building a Multi-Agent AI System To Sift Through Thousands of Russian Social Media Posts

Why We Created Haystack

How Haystack Works

How Haystack Has Helped our Journalists

What’s Next for Haystack?

Read other stories tagged with:

Republish this article

Read Next

Data Journalism Methodology

How AI Is Helping Independent Journalists Track Wartime Casualties of Russia

GIJC23

Reporting on Russia from the Outside — From Investigating Ukraine to AI-Powered Analysis

How They Did It Methodology

How They Did It: Uncovering a Vast Network of Illegal Mining in Venezuela

Stories

Topics

Building a Multi-Agent AI System To Sift Through Thousands of Russian Social Media Posts

Related Resources

SECOND COURSE ADDED: AI Without the Guesswork: Smarter Journalism Without Prompting and Hallucinations — FEW TICKETS LEFT

Tips for Mining Social Media Platforms with Henk van Ess

Investigating — And Embracing — the AI Revolution

GIJC23 – What Does AI Have to Do with Investigative Journalism? Everything!

Share

Why We Created Haystack

How Haystack Works

How Haystack Has Helped our Journalists

What’s Next for Haystack?

Related Resources

SECOND COURSE ADDED: AI Without the Guesswork: Smarter Journalism Without Prompting and Hallucinations — FEW TICKETS LEFT

Tips for Mining Social Media Platforms with Henk van Ess

Investigating — And Embracing — the AI Revolution

GIJC23 – What Does AI Have to Do with Investigative Journalism? Everything!

Related Stories

How AI Is Helping Independent Journalists Track Wartime Casualties of Russia

New Investigative Tools for Monitoring Social Media Platforms

Reporting on Russia from the Outside — From Investigating Ukraine to AI-Powered Analysis

How They Did It: Uncovering a Vast Network of Illegal Mining in Venezuela

Read other stories tagged with:

Republish this article

Read Next

Data Journalism Methodology

How AI Is Helping Independent Journalists Track Wartime Casualties of Russia

Reporting Tools & Tips

New Investigative Tools for Monitoring Social Media Platforms

GIJC23

Reporting on Russia from the Outside — From Investigating Ukraine to AI-Powered Analysis

How They Did It Methodology

How They Did It: Uncovering a Vast Network of Illegal Mining in Venezuela