Nearly half of participants in a controlled experiment failed to reliably distinguish AI-generated social media bots from real human users - even among people who considered themselves digitally literate. The finding, from research conducted as part of a master's program at Malmö University and supported by cybersecurity firm Surfshark, raises pointed questions about the effectiveness of human judgment as a first line of defense against automated manipulation online.
What the Study Found - and What It Implies
The experiment involved 710 participants. Of those, only 53 percent managed to correctly identify bots more often than they incorrectly identified real humans as bots - a threshold that, on reflection, is not especially demanding. It means nearly half the group, 47 percent, could not clear even that modest bar. The study did not measure expert performance against layperson performance in isolation, which makes the self-reported confidence of participants all the more striking: people who believed they were capable of spotting artificial accounts were, in many cases, wrong.
This is not a failure of intelligence. It is a failure of available signals. Social media platforms are designed to present all accounts - automated or human - through identical visual templates: a profile picture, a display name, a post, a timestamp. The surface-level cues that people instinctively rely on to assess authenticity are, by design, available to anyone writing a bot. When the interface itself offers no meaningful distinction, human perception has very little to work with.
Why Modern Bots Are Genuinely Difficult to Detect
The generation of convincing social media personas has changed substantially with the wide availability of large language models. Earlier bots were often detectable by obvious tells: stilted phrasing, repetitive posting patterns, lack of conversational coherence. Contemporary AI-generated content can produce contextually appropriate replies, adjust tone, and mimic the informal register of genuine online conversation. A bot can now express apparent enthusiasm, feign uncertainty, and construct plausible personal anecdotes - all characteristics that people associate with human presence.
Beyond language quality, bot operators have also become more sophisticated in mimicking behavioral patterns. Posting at irregular intervals, engaging across multiple threads, and maintaining consistent apparent interests over time are all strategies that reduce the behavioral markers typically used to flag automated accounts. Platform-level detection tools can flag high-volume or coordinated activity, but individual users browsing a feed have no access to those analytical layers.
The Broader Stakes for Information and Public Discourse
The significance of this difficulty extends well beyond the inconvenience of being fooled. Bots have been documented as key instruments in coordinated influence operations - used to amplify fringe viewpoints, create a false impression of consensus, and suppress the perceived credibility of opposing voices. When real users cannot reliably identify which accounts are artificial, they are effectively operating in an information environment they cannot accurately read.
This is particularly consequential in contexts where social proof matters: political discussions, public health messaging, consumer decisions. A viewpoint that appears to have broad organic support carries more weight than one that seems marginal. Bots exploit precisely that dynamic. If the apparent weight of opinion is being manufactured, and users cannot detect the manufacture, their judgments are being shaped by stimuli they have no way to evaluate accurately.
The Surfshark-backed study adds to a body of evidence suggesting that individual media literacy, while valuable, is not sufficient on its own. Calls for platform-level transparency - clearer disclosure of account origins, better labeling of AI-generated content, more accessible audit tools - have grown louder in policy circles, particularly in the European Union, where the Digital Services Act imposes new obligations on large platforms around systemic risk and algorithmic accountability. Whether those structural measures prove more effective than user education alone remains an open question, but the data from Malmö suggests that education alone has real limits.
What Users Can Realistically Do
No single behavioral signal reliably distinguishes a bot from a person, but a combination of factors can shift the odds. Accounts with very short posting histories, profiles that engage almost exclusively on one narrow topic, and comments that read as generically applicable to multiple contexts - rather than specifically responsive to a particular conversation - are worth treating with heightened skepticism. So are accounts that appear to post at consistent intervals around the clock without variation, or that have accumulated large follower counts in an implausibly short time.
More fundamentally, the study is a reminder that confidence in one's own digital judgment is not the same as accuracy. The participants who failed were not, by their own reckoning, naïve users. Recognizing the limits of intuitive assessment - and treating social media signals with structural skepticism rather than case-by-case snap judgments - is likely more protective than any checklist of specific bot behaviors, which evolve as detection awareness grows.