Key Highlights
- AI meets biobank: Elucidata and Sapien partner to convert 300,000+ patient samples into AI-ready, multimodal datasets for drug and diagnostic innovation.
- Unlocking Asia’s data: Collaboration brings rich Indian clinical-genomic data into the global AI drug development ecosystem.
- Synthetic intelligence in action: First use-case builds models to infer omics from pathology slides, enabling tissue-sparing strategies in oncology R&D.
Transforming Biobanks into AI-Driven Engines
Elucidata, a U.S.-India based AI pioneer in life sciences, and Sapien Biosciences, India’s largest commercial biobank founded with Apollo Hospitals, are partnering to reimagine how biospecimens fuel drug discovery. With more than 300,000 patient samples across diseases like cancer, cardiovascular, autoimmune, and neurology, Sapien brings unmatched scale and diversity. Using Elucidata’s Polly platform, this massive repository will be harmonized into AI-compatible formats, enabling predictive modeling, synthetic data generation, and biomarker discovery.
Making Indian Clinical Data Global-Ready
A major aim is to unlock underrepresented patient datasets from India for global pharmaceutical R&D. More than 85,000 cancer patient records—paired with histopathology, genomics, and structured clinical data—will now power inclusive diagnostics and drug development. This helps fill critical gaps in datasets historically dominated by Western populations, accelerating equitable innovation in precision medicine.
AI-Powered Models for Omics Inference
In its first phase, the partnership will develop AI models that extract genomic and transcriptomic insights directly from digitized pathology images. Leveraging Sapien’s annotated NGS datasets, the teams aim to generate high-quality synthetic multimodal data—crucial for rare cancers and tissue-constrained samples. This breakthrough enables faster, less invasive analysis, improving R&D economics and patient outcomes.
Driving Translational Impact with Real-World Data
This partnership will explore multi-disease applications beyond oncology, creating synthetic clinicogenomic datasets across autoimmune, neuro, and cardiovascular conditions. As Elucidata scales its Polly platform across Sapien’s deep biobank, both companies position themselves as global enablers of AI-native drug discovery. With clients already advancing toward FDA milestones and national recognitions under their belt, this duo is shaping the future of scalable, real-world data science in biomedicine.
About Elucidata
Elucidata is a data-centric AI company enabling drug and diagnostic innovation by converting fragmented biomedical data into AI-ready assets. Its Polly platform integrates EHRs, genomics, imaging, and clinical trial data for downstream modeling. Headquartered in San Francisco with operations in the U.S. and India, Elucidata supports 70+ biopharma clients and contributes to drug programs progressing toward regulatory approvals.
About Sapien Biosciences
Sapien Biosciences is India’s first commercial biobank and a leader in real-world evidence for precision health. With access to over 300,000 patients’ data and samples—including pathology, FFPE, blood, and annotated clinical records—Sapien enables applications across AI/ML modeling, biomarker discovery, and translational research. It collaborates with pharma and diagnostics companies to bring greater representation and relevance to global drug development.