Artificial intelligence to diagnose genetic diseases

Researchers at Rady Children’s Institute for Genomic Medicine (RCIGM) have utilized automated machine-learning and clinical natural language processing (CNLP) to diagnose rare genetic diseases in record time. This new method is speeding answers to physicians caring for infants in intensive care and opening the door to increased use of genome sequencing as a first-line diagnostic test for babies with cryptic conditions.

“Some people call this artificial intelligence, we call it augmented intelligence,” said Stephen Kingsmore, MD, DSc, President and CEO of RCIGM. “Patient care will always begin and end with the doctor. By harnessing the power of technology, we can quickly and accurately determine the root cause of genetic diseases. We rapidly provide this critical information to intensive care physicians so they can focus on personalizing care for babies who are struggling to survive.”

The workflow and research were led by the RCIGM team in collaboration with leading technology and data-science developers —Alexion, Clinithink, Diploid, Fabric Genomics and Illumina.

Dr. Kingsmore’s team has pioneered a rapid Whole Genome Sequencing process to deliver genetic test results to neonatal and paediatric intensive care (NICU/PICU) physicians to guide medical intervention. RCIGM is the research arm of Rady Children’s Hospital-San Diego.

By reducing the need for labour-intensive manual analysis of genomic data, the supervised automated pipeline provided significant time-savings. In February 2018, the same team achieved the Guinness World Record for fastest diagnosis through whole genome sequencing. Of the automated runs, the fastest times – averaging 19 hours – were achieved using augmented intelligence.

“This is truly pioneering work by the RCIGM team—saving the lives of very sick newborn babies by using AI to rapidly and accurately analyse their whole genome sequence “ says Eric Topol, MD, Professor of Molecular Medicine at Scripps Research and author of the new book Deep Medicine.

RCIGM has optimized and integrated several time-saving technologies into a rapid Whole Genome Sequencing (rWGS) process to screen a child’s entire genetic makeup for thousands of genetic anomalies from a blood sample.

Key components in the rWGS pipeline come from Illumina, the global leader in DNA sequencing, including Nextera DNA Flex library preparation, whole genome sequencing via the NovaSeq 6000 and the S1 flow cell format. Speed and accuracy are enhanced by Illumina’s DRAGEN (Dynamic Read Analysis for GENomics) Bio-IT Platform.

Other pipeline elements include Clinithink’s clinical natural language processing platform CliX ENRICH that quickly combs through a patient’s electronic medical record to automatically extract comprehensive patient phenotype information.

Another core element of the machine learning system is MOON by Diploid. The platform automates genome interpretation using AI to automatically filter and rank likely pathogenic variants. Deep phenotype integration, based on natural language processing of the medical literature, is one of the key features driving this automated interpretation. MOON takes five minutes to suggest the causal mutation out of the 4.5 million variants in a whole genome.

In addition, Alexion’s rare disease and data science expertise enabled the translation of clinical information into a computable format for guided variant interpretation.

As part of this study, the genetic sequencing data was fed into automated computational platforms under the supervision of researchers. For comparison and verification, clinical medical geneticists on the team used Fabric Genomics’ AI-based clinical decision support software, OPAL (now called Fabric Enterprise)—to confirm the output of the automated pipeline. Fabric software is part of RCIGM’s standard analysis and interpretation workflow.

The study titled “Diagnosis of genetic diseases in seriously ill children by rapid whole-genome sequencing and automated phenotyping and interpretation,” found that automated, retrospective diagnoses concurred with expert manual interpretation (97 percent recall, 99 percent precision in 95 children with 97 genetic diseases).

Researchers concluded that genome sequen-cing with automated phenotyping and interpretation—in a median 20:10 hours—may spur use in intensive care units, thereby enabling timely and precise medical care. “Using machine-learning platforms doesn’t replace human experts. Instead it augments their capabilities,” said Michelle Clark, PhD, statistical scientist at RCIGM and the first author of the study. “By informing timely targeted treatments, rapid genome sequencing can improve the outcomes of seriously ill children with genetic diseases.”
Rady Children’s Institutewww.radygenomics.org/category/news/pr/