TY - GEN
T1 - A Flexible and Synthetic Transplant Kidney Exchange Problem (FASTKEP) Data Generator
AU - Nau, Calvin
AU - Sankaran, Prashant
AU - McConky, Katie
AU - Sudit, Moises
AU - Khazaelpour, Payam
AU - Velasquez, Alvaro
N1 - Publisher Copyright:
© IISE Annual Conference and Expo 2024.All rights reserved.
PY - 2024
Y1 - 2024
N2 - The Kidney Exchange Problem (KEP) optimizes a pool of incompatible patient-donor pairs and non-directed donors to determine the optimal cycles and chains of length-N that maximize transplants. A direct impact of KEP optimization is an improved quality of life for transplant recipients at a lower cost to the healthcare system (as opposed to dialysis treatments). To improve this process, synthetic data is commonly used to test and develop KEP algorithms. The canonical KEP data generation process is referred to as the Saidman Generator. The generator utilizes attributes of the donors and recipients that determine compatibility, such as blood type and protein compatibility, alongside known data distributions, including the percentage of different blood types comprising an exchange, to create datasets that aim to mirror the real world. Prior published Java implementations of this generator exist. However, in this work, an open-source data generation package implemented in Python is proposed for the KEP which mirrors the Saidman generator’s output. The proposed implementation is compared to a common Java implementation for scalability and efficiency. This generator allows for future extensions that integrate additional recipient and donor features that produce data better matching real-world exchanges. The result is an open-source Python implementation of a KEP data generator. The efficient creation of directed exchange datasets enables future work, including the use of the presented data generation process by ML/AI researchers in application to large-scale machine learning model training.
AB - The Kidney Exchange Problem (KEP) optimizes a pool of incompatible patient-donor pairs and non-directed donors to determine the optimal cycles and chains of length-N that maximize transplants. A direct impact of KEP optimization is an improved quality of life for transplant recipients at a lower cost to the healthcare system (as opposed to dialysis treatments). To improve this process, synthetic data is commonly used to test and develop KEP algorithms. The canonical KEP data generation process is referred to as the Saidman Generator. The generator utilizes attributes of the donors and recipients that determine compatibility, such as blood type and protein compatibility, alongside known data distributions, including the percentage of different blood types comprising an exchange, to create datasets that aim to mirror the real world. Prior published Java implementations of this generator exist. However, in this work, an open-source data generation package implemented in Python is proposed for the KEP which mirrors the Saidman generator’s output. The proposed implementation is compared to a common Java implementation for scalability and efficiency. This generator allows for future extensions that integrate additional recipient and donor features that produce data better matching real-world exchanges. The result is an open-source Python implementation of a KEP data generator. The efficient creation of directed exchange datasets enables future work, including the use of the presented data generation process by ML/AI researchers in application to large-scale machine learning model training.
KW - Combinatorial Optimization
KW - Data Generation
KW - Graph Machine Learning
KW - Kidney Exchange Problem
KW - Organ Exchange
UR - https://www.scopus.com/pages/publications/85206570634
M3 - Conference contribution
AN - SCOPUS:85206570634
T3 - Proceedings of the IISE Annual Conference and Expo 2024
BT - Proceedings of the IISE Annual Conference and Expo 2024
A2 - Greer, A. Brown
A2 - Contardo, C.
A2 - Frayret, J.-M.
PB - Institute of Industrial and Systems Engineers, IISE
T2 - IISE Annual Conference and Expo 2024
Y2 - 18 May 2024 through 21 May 2024
ER -