Breaking News
November 18, 2018 - FDA Approves Aemcolo (rifamycin) to Treat Travelers’ Diarrhea
November 18, 2018 - Poverty blamed on widening north-south gap in young adult deaths in England
November 18, 2018 - Progress in meningitis lags far behind other vaccine-preventable diseases, analysis shows
November 18, 2018 - Consensus Statement Issued on Management of Foot, Ankle Gout
November 18, 2018 - Fine particle air pollution is a public health emergency hiding in plain sight
November 18, 2018 - In-hospital mortality higher among patients with drug-resistant infections
November 17, 2018 - Research shines new, explanatory light on link between obesity and cancer
November 17, 2018 - FIND explores new diagnostic assays for confirmatory HCV diagnosis in community settings
November 17, 2018 - Tracking Preemies’ Head Size May Yield IQ Clues
November 17, 2018 - Scientists call for unified standards in 3-D genome and epigenetic data
November 17, 2018 - Lab Innovations 2018 has beaten all records by attracting 3,113 attendees
November 17, 2018 - Sexuality education before age 18 may reduce risk of sexual assault in college
November 17, 2018 - Lab Innovations 2018 confirmed as a major hit with visitors, exhibitors and speakers
November 17, 2018 - Largest parasitic worm genetic study hatches novel treatment possibilities
November 17, 2018 - UCLA biologists uncover how head injuries can lead to serious brain disorders
November 17, 2018 - Static and dynamic physical activities offer varying protection against heart disease
November 17, 2018 - Obesity significantly increases risk of Type 2 diabetes and coronary artery disease
November 17, 2018 - People with rare cancers can benefit from genomic profiling, shows research
November 17, 2018 - NIH awards over $1.8 million to husband-and-wife doctors to test new breast cancer approach
November 17, 2018 - Four-in-one antibody used to fight flu shows promise in mice
November 17, 2018 - New approach allows pathogens to be starved by blocking important enzymes
November 17, 2018 - Higher body mass index could cause depression even without health problems
November 17, 2018 - Protein which plays role in sensing cell damage serves as new target to treat pulmonary hypertension
November 17, 2018 - FDA Approves Adcetris (brentuximab vedotin) in Combination with Chemotherapy for Adults with Previously Untreated Systemic Anaplastic Large Cell Lymphoma or Other CD30-Expressing Peripheral T-Cell Lymphomas
November 17, 2018 - ID specialist input improves outcomes for outpatient parenteral antimicrobial therapy
November 17, 2018 - UT Southwestern scientists selected to receive 2019 Edith and Peter O’Donnell Awards
November 17, 2018 - New clinical algorithm to help individuals manage type 2 diabetes when fasting during Ramadan
November 17, 2018 - Researchers identify LZTR1 as evolutionarily conserved component of RAS pathway
November 17, 2018 - Heart Disease Leading Cause of Death in Low-Income Counties
November 17, 2018 - Estrogen Levels Test: MedlinePlus Lab Test Information
November 17, 2018 - Research reveals link between immunity, diabetes
November 17, 2018 - Research shows how to achieve improved smoking cessation outcomes within California’s Medicaid population
November 17, 2018 - New study finds less understanding and implementation of patient engagement
November 17, 2018 - New shoe insole technology could help diabetic ulcers heal better while walking
November 17, 2018 - New method to extend cell division and immortalization of avian-derived cells
November 17, 2018 - Australian Academy of Science urges parents to vaccinate children against meningococcal disease
November 17, 2018 - Hot water treatment may help improve inflammation and metabolism in sedentary people
November 17, 2018 - Researchers produce 3D chemical maps of small biological samples
November 17, 2018 - Must Blood Pressure Rise Wth Age? Remote Tribes Hold Clues
November 17, 2018 - Noonan Syndrome
November 17, 2018 - Interventions to delay and prevent type 2 diabetes are underused, researchers say
November 17, 2018 - Hackathon prize winner seeks to remotely monitor patient skin conditions
November 17, 2018 - Research team identifies Ashkenazi Jewish founder mutation for Leigh syndrome
November 17, 2018 - Gene editing could be used to halt kidney disease in patients with Joubert syndrome
November 17, 2018 - Study uncovers link between gut disruption and aging
November 17, 2018 - Teens more likely to pick up smoking after exposure from friends and family
November 17, 2018 - Nicoya designate the Institute for Stem Cell Biology and Regenerative Medicine as the OpenSPR Centre of Excellence
November 17, 2018 - new horizon in dental, oral and craniofacial research
November 17, 2018 - How does poor air quality affect your health?
November 17, 2018 - New device can regulate children’s blood glucose more like natural pancreas
November 17, 2018 - Game-Changers in Western Blotting and Protein Analysis
November 17, 2018 - FDA announces new actions to limit sale of e-cigarettes to youth
November 17, 2018 - Warmer winter temperatures related to higher crime rates
November 17, 2018 - MCO places increasing emphasis on helping people find and access healthy food
November 17, 2018 - Group of students aim to improve malaria diagnosis using old smartphones
November 17, 2018 - Transplantation of feces may protect preterm children from deadly bowel disease
November 17, 2018 - Researchers explore whether low-gluten diets can be recommended for people without allergies
November 17, 2018 - New and better marker for assessing patients after cardiac arrest
November 17, 2018 - For 7-year-old with failing bone marrow, a life-saving transplant | News Center
November 17, 2018 - New first-line treatment for peripheral T-cell lymphoma approved by FDA
November 17, 2018 - Artificial intelligence could be valuable tool to help young victims disclose traumatic testimony
November 17, 2018 - Breakthrough in the treatment of Restless Legs Syndrome
November 16, 2018 - FDA Approves Keytruda (pembrolizumab) for the Treatment of Patients with Hepatocellular Carcinoma (HCC) Who Have Been Previously Treated with Sorafenib
November 16, 2018 - Eagle Books | Native Diabetes Wellness Program
November 16, 2018 - Patients with common heart failure more likely to have lethal heart rhythms
November 16, 2018 - How AI could help veterinarians code their notes | News Center
November 16, 2018 - Bias-based bullying does more harm to students than generalized bullying
November 16, 2018 - Researchers find first direct evidence that cerebellum plays role in cognitive functions
November 16, 2018 - Non-coding genetic variant plays key role in endothelial function and disease incidence
November 16, 2018 - EMA recommends first all-oral treatment to tackle deadly sleeping sickness
November 16, 2018 - Drug used to treat dizziness may slow down growth of triple-negative breast cancer
November 16, 2018 - AHA: Icosapent Ethyl Cuts CV Risk From Elevated Triglycerides
November 16, 2018 - ‘Orphan’ RNAs make cancer deadlier, but potentially easier to diagnose
November 16, 2018 - Air Cube touches down at hospital | News Center
November 16, 2018 - CRISPR-based tool shown to enhance cell-based immunotherapy
November 16, 2018 - Mechanisms that govern HIV latency differ in the gut and blood, finds study
November 16, 2018 - Researchers unravel mystery of NPM1 protein in acute myeloid leukemia
November 16, 2018 - High school students less likely to select milk, fruit for lunch when fruit juice is available
November 16, 2018 - Football coaches with great emotional competence are more successful
November 16, 2018 - Researchers awarded $10 million grant to address root causes of asthma in Puerto Rico
Growth of genomic databases hinders efforts to identify bacteria

Growth of genomic databases hinders efforts to identify bacteria

image_pdfDownload PDFimage_print

There are many ways to slice and dice genomic data to identify a species of bacteria, or at least find its close relatives. But fast techniques to sequence genomes have flooded the public databases and in a biased fashion, containing lots of genomic data about some species and not enough about others, according to a Rice University computer scientist.

Todd Treangen and his colleagues tested taxonomic classification methods that match genomic sequences from bacteria of interest with those recorded in large databases to identify species. In the process, they charted a path toward improved accuracy and sensitivity.

Treangen is senior author of a study published this month in Genome Biology that demonstrates how changes over time in a widely used federal database, the National Center for Biotechnology Information’s RefSeq, have influenced the accuracy of metagenomic classification methods.

A primary concern for Treangen, an expert in metagenomics — the study of genetic material from environmental samples — is maintaining the ability to quickly identify bacteria that pose a threat to public health.

Big data is uniquely positioned to do this — but there’s so much of it. At present, he said, low-cost and high-throughput DNA shotgun sequencing machines, which read short DNA sequences from collections of microorganisms, have resulted in the doubling of genomic data in RefSeq every two to three years.

“I initially thought more data is always better for these methods,” said Treangen, who joined Rice this year from the University of Maryland Institute for Advanced Computer Studies. “You would expect that there would be no penalty, because database growth is good.” However, the researchers found that bacterial data in RefSeq has an outsized effect at the species level of the taxonomic hierarchy, which is growing at a breakneck pace.

That’s a problem for researchers who combine two common techniques to identify what they find. One is called k-mer-based classification, which identifies short DNA sequences from all the organisms in a bacterial sample via exact matches.

“Most of the methods that have made the problem computationally feasible rely on k-mers, which are exact matches of length ‘k,’ or a key in to the microbes contained in the database,” he said. “If a sequenced read perfectly matches something in the database, the intuition is that you can say what that is with great precision and shortcut more expensive computational approaches.”

A commonly used technique with k-mer-based classification is lowest common ancestor (LCA) assignment, he said. LCA compares samples to sequences that share a match, assigning them if necessary to a higher level in the taxonomy, such as a genus rather than a species. But this may not be specific enough for researchers trying to pin down a pathogen, he said.

In fact, the study found a k-mer-based classification tool called Bracken, which uses Bayesian statistics to infer the best match for a sequence, helped mitigate the imbalance. Even so, it struggled to identify genomes with close relatives, but not perfect matches, in the database.

Treangen said well-funded research into particular pathogens is a necessity and has greatly aided rapid-outbreak detection and tracking, but it ultimately biases public databases like RefSeq.

“For instance, there’s an immense bias toward foodborne pathogens,” he said. “Society wants to know a lot about Salmonella, and rightfully so. The FDA, and specifically GenomeTrakr, have aided in the sequencing of thousands of relevant pathogens and have added them directly to the reference database.”

However, he said that skews the reference database toward particular genera and families of microbes in a way that affects the accuracy and sensitivity of fast taxonomic-classification tools like Kraken that use k-mer and LCA-based approaches.

Treangen said the best recent example of a false positive identification is a study that initially reported evidence of anthrax bacteria in New York City’s subways. The study, based on sequenced genomes from samples, was later revised to reflect mismatches that falsely identified the sequences as Bacillus anthracis.

While a focus on public health is a key priority, Treangen said novel techniques able to cope with database growth and noise, coupled with an increased breadth of sequenced genomes, is needed for continued improvements in the field. “For example, microorganisms from the soil and ocean are severely under-sampled,” he said. “There remain a lot of microbes that we need to continue to sequence to better fill out public databases, and that will ultimately help our ability to accurately classify microbes from complex samples.”

Source:

Flood of genome data hinders efforts to ID bacteria

Tagged with:

About author

Related Articles