Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Generative models for synthetic data generation: application to pharmacokinetic/pharmacodynamic data
 
research article

Generative models for synthetic data generation: application to pharmacokinetic/pharmacodynamic data

Jiang, Yulun  
•
García-Durán, Alberto
•
Losada, Idris Bachali
Show more
December 1, 2024
Journal of Pharmacokinetics and Pharmacodynamics

The generation of synthetic patient data that reflect the statistical properties of real data plays a fundamental role in today’s world because of its potential to (i) be enable proprietary data access for statistical and research purposes and (ii) increase available data (e.g., in low-density regions—i.e., for patients with under-represented characteristics). Generative methods employ a family of solutions for generating synthetic data. The objective of this research is to benchmark numerous state-of-the-art deep-learning generative methods across different scenarios and clinical datasets comprising patient covariates and several pharmacokinetic/pharmacodynamic endpoints. We did this by implementing various probabilistic models aimed at generating synthetic data, such as the Multi-layer Perceptron Conditioning Generative Adversarial Neural Network (MLP cGAN), Time-series Generative Adversarial Networks (TimeGAN), and a more traditional approach like Probabilistic Autoregressive (PAR). We evaluated their performance by calculating discriminative and predictive scores. Furthermore, we conducted comparisons between the distributions of real and synthetic data using Kolmogorov-Smirnov and Chi-square statistical tests, focusing respectively on covariate and output variables of the models. Lastly, we employed pharmacometrics-related metric to enhance interpretation of our results specific to our investigated scenarios. Results indicate that multi-layer perceptron–based conditional generative adversarial networks (MLP cGAN) exhibit the best overall performance for most of the considered metrics. This work highlights the opportunities to employ synthetic data generation in the field of clinical pharmacology for augmentation and sharing of proprietary data across institutions.

  • Details
  • Metrics
Type
research article
DOI
10.1007/s10928-024-09935-6
Scopus ID

2-s2.0-85202151152

PubMed ID

39192091

Author(s)
Jiang, Yulun  

École Polytechnique Fédérale de Lausanne

García-Durán, Alberto

Merck KGaA

Losada, Idris Bachali

Merck KGaA

Girard, Pascal

Merck KGaA

Terranova, Nadia

Merck KGaA

Date Issued

2024-12-01

Published in
Journal of Pharmacokinetics and Pharmacodynamics
Volume

51

Issue

6

Start page

877

End page

885

Subjects

Deep learning

•

Generative methods

•

Neural networks

•

Synthetic pharmacokinetic/Pharmacodynamic data

•

Virtual patients

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
MLBIO  
FunderFunding(s)Grant NumberGrant URL

Krunal Vasant Kavathiya of Merck Specialities Pvt. Ltd.

Merck KGaA

Available on Infoscience
January 24, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/243555
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés