VEIL.AI'S next-generation data anonymization
is utilized to create an external control arm
in Bayer’s "Future Clinical Trials" project

This case study demonstrates:

  • How anonymized data can be of such a high quality that the same conclusions can be drawn from it, as from traditional pseudonymized, individual-level research data
  • How sensitive data can be anonymized, after which it can be transferred to another country to be enriched and further utilized

This is a significant achievement. In our study, we could draw the same conclusions from anonymized data as from traditional pseudonymized, individual-level research data.

Jussi Leinonen,

Strategic Project Lead

Bayer

BACKGROUND AND TARGETS

Global life science company Bayer carried out a three-year multi-million-euro development project called Future Clinical Trials (FCT), which utilized AI-enhanced next-generation anonymization technology from VEIL.AI

The international project was aiming to find ways to:

speed up clinical trials

reduce drug’s time to market

increase patient safety

reduce costs


In the beginning of the project Bayer benchmarked European companies for data anonymization and selected VEIL.AI to be their partner for next-generation anonymization & synthetic data generation. VEIL.AI has collaborated with Bayer for over four years.

WHY DOES BAYER NEED ANONYMIZED, PRIVACY COMPLIANT REAL-WORLD EVIDENCE?

Patient recruitment and retention in clinical trials are major challenges in the development of new drugs. If there is high-quality anonymized data available, part of the control patient group could be replaced in some instances with an external control arm – i.e., with a group formed based on external health data, most commonly data from real-world data like electronic health records, registries or previous clinical trials.

 
This can have many positive impacts like increasing efficiency, reducing delays, speeding up a drug's time-to-market and lowering costs in the evaluation of new therapies.

 
Bayer also wanted to build the capabilities for embedding RWD into clinical development programs. The idea was to anonymize the pseudonymized external RWD, which enables the data to be transferred to Bayer’s own environment, where the anonymized data could be merged with Bayer’s own anonymized legacy RCT data.

STEPS TO BUILD AN EXTERNAL CONTROL ARM

DATA FROM REGISTER HOLDERS

When talking about external control arms, Bayer wanted to create a methodology for providing virtual or synthetic controls for clinical trials based on RWD and legacy clinical trial data.
The FCT project was implemented in Finland, which has good health data registers and the first legislation for secondary use of health data introduced in Europe. The name of the health data permit authority is Findata.
The new data needed for Bayer's external control group was collected from national registers and databases directly into Findata's secure operating environment, where it was cleaned, checked and processed.

HIGH-QUALITY DATA ANONYMIZATION

After that VEIL.AI anonymized the pseudonymized data with next-generation anonymization technology with very high quality. In fact, the next-generation anonymized data was of such high quality that the same conclusions could be drawn from it as from traditional, pseudonymous individual-level research data (read more about the published study article here).

VERIFICATION OF ANONYMITY BEFORE DATA RELEASE

So now Bayer had new high-quality anonymized register data in Findata's data secure environment. But how can an authority or a Data Privacy Officer (DPO) know that an anonymized dataset is really anonymous? How can anonymity be verified before anonymous data is released to be transferred to another country?

VEIL.AI CREATES A PROTOCOL TO VERIFY ANONYMITY

VEIL.AI has also solved this issue; VEIL.AI has created a protocol with which authorities and DPO’s can verify the anonymity of datasets that have gone through the anonymization process. The analysis is a transparent way to provide strong evidence of privacy protection before releasing the anonymized data for further use and helps to document what has been done during the anonymization process. 

AUTHORITY CONSENTS TO THE TRANSFER OF ANONYMIZED DATA TO ANOTHER COUNTRY

After Findata was able to verify the anonymity of the dataset, it gave consent for Bayer to export the verified anonymous data to Bayer’s own secure data science environment. 

In this way, Bayer received new high-quality anonymized data in their own Data Science environment which opened up great new possibilities for further enrichment and utilization of data, such as:

  • merging the anonymized data with legacy RCT data
  • getting more anonymized RWD and RCT data
  • data augmentation to extend the follow-up time and improve the efficiency of clinical trials

RE-USE OF BAYER’S LEGACY DATA WITH EXTERNAL PARTNERS

Another big topic was how to enable efficient data sharing and re-use of Bayer's own legacy RCT data with external partners. Re-use of legacy data provides huge opportunities and its value can increase drastically.

If the data is anonymized with high quality it can naturally also be re-used internally and in addition, collaboration and co-development with external partners is much easier.


In the FCT project, indication specific harmonized and curated datasets were created in Bayer’s data science environment. VEIL.AI's AI-enhanced next-generation anonymization technology enabled the re-use of legacy data. The anonymization results were excellent.

If legacy data is anonymized, an external partner can be granted access to the data on organization’s own server without having to send the data anywhere (outside the organization).

NEXT-GENERATION ANONYMIZATION & PERSPECTIVES FOR THE FUTURE

Bayer identified various different use case opportunities for anonymization, related to both focus areas (real-world-evidence and randomized clinical trials).

The aim is to move towards defined standards and continuous anonymization capability included in data science platforms and workflows.


"Data Scientists in data driven organizations need good data to practice high-quality data science. Continuous next-generation anonymization capability can help to do things that would otherwise not be possible." - Jussi Leinonen, Strategic Project Lead

Want to to discuss your use case with us?

Are you interested in our services and solutions? Let’s start a conversation about how we could help you.

Name (first & last)*
Company/organization email*
Message
0 of 350

*By submitting this form you agree to our Privacy policy. We are committed to your privacy. Information provided by you may be collected and automatically stored in our database and may be used for sending you additional information about VEIL.AI and our services. You may unsubscribe from these communications at any time.