Breaking News
January 23, 2019 - New certified reference material for testing residual solvents in cannabis
January 23, 2019 - Gene-edited chickens could prevent future flu pandemic
January 23, 2019 - Cardiovascular disease risk begins even before birth
January 23, 2019 - Younger patients receiving kidney transplant more likely to live longer, shows data
January 23, 2019 - Skin samples hold early signs of prion disease, research suggests
January 23, 2019 - Researchers discover how body initiates repair mechanisms that limits damage to myelin sheath
January 23, 2019 - Fecal transplant from certain donors better than others
January 23, 2019 - Risk for Uninsurance in AMI Patients Reduced With Medicaid Expansion
January 23, 2019 - Readmissions reduction program may be associated with increase in patient-level mortality
January 23, 2019 - Fostering translation and communication in medicine and beyond
January 23, 2019 - To Fight Fatty Liver, Avoid Sugary Foods and Drinks
January 23, 2019 - TPU scientists develop new implants that double the rate of bone lengthening in kids
January 23, 2019 - New sessions at Pittcon 2019
January 23, 2019 - Insilico to present latest findings in AI for Drug Discovery at 3rd Annual SABPA FTD Forum
January 23, 2019 - Opioid overdose patients can be safely discharged an hour after administration of naloxone
January 23, 2019 - Scientists find bacterial extracellular vesicles in human blood
January 23, 2019 - Researchers gain new insights into development of necrotizing enterocolitis in preemies
January 23, 2019 - Medical expert advises people with epilepsy not to stockpile medicines
January 23, 2019 - Study outlines research priorities for improving pediatric patient care and safety
January 23, 2019 - Bedfont to exhibit NObreath FeNO monitor at Arab Health 2019
January 23, 2019 - Nicotinamide riboside supplementation confers significant physiological benefits to mothers and offspring
January 23, 2019 - Increasing temperatures may help preserve crop nutrition
January 23, 2019 - Many Oncologists in the Dark About LGBTQ Health Needs
January 23, 2019 - Epigenetic change causes fruit fly babies to inherit diet-induced heart disease
January 23, 2019 - Erasing memories could reduce relapse rates among drug addicts
January 23, 2019 - African Americans who smoke cigarettes are more likely to develop peripheral artery disease
January 23, 2019 - Unique data combination helps FinnGen researchers to fund links between genetic factors and health
January 23, 2019 - Parents’ mental health problems associated with reactive attachment disorder in children
January 23, 2019 - Graphene Flagship project studies impact of graphene and related materials on our health
January 23, 2019 - The connection between the Pope and contraceptive pills
January 23, 2019 - Prior dengue infection could protect children from symptomatic Zika
January 23, 2019 - Previous dengue virus infection associated with protection from symptomatic Zika
January 23, 2019 - VISTA checkpoint implicated in pancreatic cancer immunotherapy resistance
January 23, 2019 - The Tiny Camera That Could Revolutionize Cardiovascular Surgery
January 23, 2019 - Peptide isolated from soil fungi has antitumor and antibacterial properties
January 23, 2019 - TGen identifies polio-like virus as potential cause of Acute Flaccid Myelitis outbreak
January 23, 2019 - Migrants and refugees do not bring disease and are at greater health risk themselves says WHO
January 23, 2019 - Examing the effects of menopause in workplace
January 23, 2019 - Enemy number 1 – Air pollution and climate change top of WHO agenda
January 23, 2019 - Two Positive Phase III studies of Tafenoquine for the Radical Cure of Plasmodium vivax Malaria Published in The New England Journal of Medicine
January 23, 2019 - World Trade Center responders at increased risk for head and neck cancers
January 23, 2019 - Low-sugar diet leads to significant improvement in nonalcoholic fatty liver disease in boys
January 23, 2019 - Chaos in bodily regulation can optimize our immune system, finds study
January 23, 2019 - Short, text-based exercises can increase happiness for adults recovering from substance use disorders
January 23, 2019 - Body size may have greater influence on women’s lifespan than men
January 23, 2019 - Groundbreaking tool helps visualize neuronal activity with near-infrared light
January 23, 2019 - Prior dengue immunity in children may be protective against symptomatic Zika
January 23, 2019 - Holocaust survivors with PTSD and their offspring exhibit more unhealthy behavior patterns
January 23, 2019 - Scientists discover new genetic mutations causing inherited deaf-blindness
January 23, 2019 - UC team designs new naloxone-dispensing smart device
January 23, 2019 - Torrent Pharmaceuticals Limited Issues Voluntary Nationwide Recall of Losartan Potassium Tablets, USP and Losartan Potassium and Hydrochlorothiazide Tablets, USP
January 23, 2019 - Brain activity shows development of visual sensitivity in autism
January 23, 2019 - Two hour gap between dinner and sleep is overrated says Japanese research
January 23, 2019 - Fear and embarrassment are causing smear test numbers to plummet
January 23, 2019 - Protein-secreting device implanted in epileptic rats reduces seizures, improves cognition
January 23, 2019 - Reintroduction project recovers current wild population of green turtle in Cayman Islands
January 23, 2019 - Cancer survivors face greater financial burden related to medical bills
January 23, 2019 - PSA screening reduces prostate cancer deaths by 30%
January 23, 2019 - LSTM receives grant to help improve health of people living in informal settlements
January 23, 2019 - Hemochromatosis Mutation Linked to Other Morbidity
January 23, 2019 - Why early diagnosis of autism should lead to early intervention
January 23, 2019 - Aspirin May Lower Stroke Risk in Women with History of Preeclampsia
January 23, 2019 - Exposure to certain chemicals may be linked to decrease in blood pressure during pregnancy
January 23, 2019 - Bowel cancer on the rise among younger Australians
January 23, 2019 - Scientists have reversed memory loss in a mouse model of Alzheimer’s
January 23, 2019 - Defective molecular master switch could lead to age-related macular degeneration
January 23, 2019 - Researchers identify how concussions may contribute to seizures
January 23, 2019 - Short interval between last meal of the day and bedtime may not affect blood glucose levels
January 23, 2019 - Still Too Many Highway Deaths Tied to Speeding
January 23, 2019 - Prenatal valproate exposure linked to increased ADHD risk
January 23, 2019 - Compound identified that may help treat heart failure
January 23, 2019 - Undiagnosed Asthma in Urban Adolescents May Be Common
January 23, 2019 - Study describes metabolism of intestinal microbiota in babies for the first time
January 22, 2019 - Study links concussions to development of epilepsy
January 22, 2019 - Specialist-led hospital bereavement service may help restrain legal action after difficult deaths
January 22, 2019 - Genetic study reveals possible new routes to treating osteoarthritis
January 22, 2019 - Blood test may detect early signs of lung-transplant rejection
January 22, 2019 - Blood marker could aid in early prediction of Alzheimer’s progression
January 22, 2019 - Orthodontic treatment does not guarantee future dental health
January 22, 2019 - Rutgers researchers discover cause of bone loss in people with joint replacements
Promoting precision medicine using data science of large datasets

Promoting precision medicine using data science of large datasets

image_pdfDownload PDFimage_print

An interview with Dr. Rajat Mukherjee conducted by Alina Shrourou, BSc

Please give an overview of what exactly data science is, and why it’s important to promote precision medicine.

I feel that data science is a marriage between statistical science and informatics, using statistical principals of math and logic on huge volumes of data.

© Jirsak/Shutterstock.com

You have to rely on informatics to store, read, and then apply these complex statistical algorithms to make sense of huge volumes of information.

A good use of data science can lead to major breakthroughs in medical research in areas like diagnostics, precision medicine, and real-world evidence. By taking large datasets, we can study if different approaches can have a markedly different effect for different patient populations.

What are the benefits of big data analysis compared to traditional collection and analysis?

That’s a good question. In traditional collection and analysis, or randomized clinical trials, the data are often collected in very controlled environments. Things like environmental factors may be very well controlled. On the other hand, factors like genomics may not be accounted for.

© Mopic/Shutterstock.com

Nowadays, real-world data studies or even randomized clinical trials are being designed differently to accommodate the variability and heterogeneity that environmental factors and genetic factors can bring. Data science provides a platform for systematically studying the interaction with environmental factors and genetic factors and looking at the therapeutic effects.

Please outline how you and your research team use data to inform diagnostics? What other biomedical applications are there of data science?

We study biomedical signals and images to develop statistical classifiers that can be used for diagnostics. We also work in the area of precision medicine, researching either genetics or related areas and biomarkers that can help enrich populations for targeted therapeutics. This is becoming more and more popular and important with applications in oncology, rare diseases and difficult to study diseases like Alzheimer’s and Parkinson’s disease.

Another area where data science is useful is in monitoring the patient population for both minor or harmful side effects of therapeutics that are in the market, as many side effects may only become known in the long-term, which can be hard to capture in short term clinical trials.

How can biomedical signals be used as a source of data for diagnosis? Please describe how signal data is processed and transformed to help with data analytics.

Biomedical signals are high-dimensional data, and they need a lot of what we call pre-processing. Essentially, it is the process of filtering out the noise and extracting the valuable information from these signals.

The next step would be feature extraction. We use these methods because you cannot use each and every component of the high dimensional data. You must extract features from these signals and images that are informative of disease status.

Next follows feature selection, i.e. select extracted features or their combinations that have the highest association with disease status. A diagnostic classifier can then be developed and validated using an independent test or validation set. The validation set must be independent of the data set used to develop the classifier.  In general, the diagnostic development and validation using signals and images are done in two separate clinical trials. However, our team works on different seamless options that may lead to much more efficient but still statistically valid designs.

We have been involved in a few diagnostics projects where we have taken biomedical signals and images and transformed them into classifiers. In one of the biggest projects where we have a classifier now, the pivotal validation studies are ongoing. The design of the pivotal validation trial is an operationally seamless, threshold optimization, group-sequential adaptive design which has been accepted by the CDRH, FDA.

How can data science be used to inform biomarker identification and selection? What work is Cytel doing in this area?

Biomarkers are another interesting example, as they play a key role in providing precision medicine strategies. Biomarkers can be diagnostic, prognostic or predictive. Predictive biomarkers help in enrichment strategies for therapies that may only work for a particular sub-population that may be classified as biomarker-positive sub-population.

Biomarker development relies heavily on data science techniques such as filtering and reduction of data and using machine learning techniques to classify patients into biomarker positive/negative.

Please can you also describe data mining work that you are conducting, and how it can inform decision making?

We have yet to use data mining on big data, but we will in the future. However, we have used data mining to look at go/ no-go type decisions, for example, if there have been multiple early phase studies on a particular area or therapeutic, we can pull all of this early phase data together and we do some data mining to come up with these go no-go decisions either to further the pipeline or to call it to an end.

Another new area of interest for us where we have used data mining techniques is the area of pharmacovigilance where large amounts of post-marketing data are used to generate signals for adverse events.

How important are Bayesian models within data science?

As a statistician, when you talk about real world data, I automatically think about Bayesian methods which are ideally suited to apply on accumulating data and updating the information of interest.

Bayesian methods can also be used for automatic feature extraction and selection. These areas suffer from the absence of uniform methodology and Bayesian methods can fill up that gap.

Do you think data science will change the way we manage large volumes of data? What does the future hold for data science and the healthcare and drug discovery industry?

Managing large volumes of data is part of data science, so yes, data science has an integral role to play in the way we manage large volumes of data. Data science will mean having specialized people to take care of big data in the right way, so that it can be applied in real time. Data science is going to change the nature of how big volumes of data are to be stored, accessed and applied.

I think incorporating data science in a drug-development team opens doors to having effective multidisciplinary teams attacking a common problem from different directions.  

Where can readers find more information?

About Rajat Mukherjee

Rajat Mukherjee has 15 years of professional experience as an industry and academic statistician, and brings a range of expert knowledge to Cytel’s customers. This includes work in pattern recognition problems for devices and biomarker discovery, Bayesian clinical trials, adaptive designs, and design and analysis of complex epidemiological studies.

His experience and expertise also includes statistical computing, survival analysis, longitudinal analysis, nonparametric and semiparametric inference, as well as statistical classification and high-dimensional data. Rajat has a strong background and interest in development and implementation of statistical methodology to real life medical problems.

Tagged with:

About author

Related Articles