Half a million health records offered for sale on Alibaba—Misryoum report

Confidential, “de-identified” UK Biobank data from 500,000 volunteers was listed on Alibaba, then removed after UK government action. Privacy experts warn that anonymized health datasets can still be re-identified.
A UK government update says confidential health records from half a million volunteers were advertised for sale on an Alibaba marketplace in China—an alarming reminder that even “de-identified” medical data can still be sensitive.
The focus of the case is UK Biobank, a major research resource built from long-term volunteer participation.. Misryoum reports that the listings were discovered after the UK Biobank charity informed the government that participation data appeared to be up for sale on Alibaba’s platform.. The UK technology minister told parliament that the records had since been removed. and there is no belief that any sale took place.
The records described in the listings were linked to UK Biobank’s participants and included information researchers typically value for studies—ranging from genetic data to health and imaging records.. Officials said the data was “de-identified. ” meaning it did not include direct identifiers such as names. addresses. or precise dates of birth.. But Misryoum notes that de-identification is not the same as risk elimination: combinations of medical attributes. dates. geography at coarse resolution. and study-linked details can sometimes be enough to narrow identities when paired with other information.
Misryoum understands that the incident was reported as multiple listings appeared. with at least one dataset reportedly containing data from all 500. 000 volunteers.. While the government worked with Chinese authorities and Alibaba to remove the material. the episode adds to a growing pattern of concerns about how large-scale health datasets are stored. accessed. and ultimately protected.
That pattern matters because UK Biobank data is treated as a “jewel” for UK science: scientists at universities and companies worldwide apply for access. and the project has become a cornerstone for biomedical and public health research.. In practical terms. the datasets help researchers study long-term disease risk. track biomarkers over time. and test links between genetics and outcomes.. When such information is mishandled, the harm isn’t abstract.. Volunteers expect their contribution to advance medicine without exposing them to unwanted surveillance or misuse.
In recent months. Misryoum reports that sensitive UK Biobank information has been exposed online repeatedly. raising fresh questions about whether safeguards have kept up with the scale and value of modern data.. The parliamentary committee chair described the incident as “incredibly serious” and framed it as a further blow to public trust at a time when governments are pushing for digital systems to speed up research and healthcare services.
Privacy experts and researchers also point to a technical tension at the center of this story: how to allow scientific access while preventing raw or sensitive extracts from leaving controlled environments.. Since 2024, scientists have been required to conduct analysis in UK Biobank’s cloud-based research platform.. Misryoum reports that. while researchers must sign agreements not to download raw participant data. there has been no technical block on downloading—an approach privacy specialists have criticized as unusually risky.
The most serious implication is that “de-identified” does not necessarily mean “un-linkable.” Misryoum notes that a prior UK Biobank leak reportedly enabled re-identification of a single participant by combining information in a way that researchers and data stewards say should not be feasible under robust protections.. Once health data is out in open channels—especially marketplaces—control becomes extremely difficult.. Even if one listing is removed quickly, copies may already exist, and additional sales routes can appear elsewhere.
Misryoum also highlights the governance response: UK Biobank referred itself to the Information Commissioner’s Office. revoked access for research institutions identified as sources of the data. and temporarily suspended broader access.. Prof Rory Collins. the chief executive and principal investigator of UK Biobank. said the organization had suspended access immediately. apologized for the concern. and is taking further steps.. Those steps include putting in place additional technology and processes. reviewing the incident at board level. and upgrading defenses to help prevent de-identified data from being taken out of the platform.. UK Biobank also said it had taken its research platform offline while it completes an upgrade. with work expected to take several weeks.
For volunteers, the lesson is personal even when the data is stripped of names.. Many people sign up for research because they believe the system protects them.. Misryoum’s reporting shows how quickly that trust can be tested when marketplaces can treat health information like a commodity.. For policymakers and the research community. the case underscores that the value of biomedical datasets is precisely why protections must be engineered to resist misuse. not just rely on contracts and procedures.