Breaking News
November 16, 2018 - ACC Latin America Conference brings experts to discuss latest cardiovascular science
November 16, 2018 - Pooled analysis of Intersect ENT’s steroid releasing implants in patients after frontal sinus surgery to be published
November 16, 2018 - Expectations about pain intensity can become self-fulfilling prophecies
November 16, 2018 - NIH awards $3.4 million to UC researchers to study gastrointestinal lymphatic system
November 16, 2018 - Scientist Dr David Taylor of MR Solutions is a finalist in the BMW i UK Tech Founder Awards
November 16, 2018 - Earlier treatment could help reverse autistic-like behavior in tuberous sclerosis
November 16, 2018 - Vegetables and salad may include bacteria that are resistant to antibiotics
November 16, 2018 - Endocrine Society chooses four Diabetes Caucus leaders as winners of Diabetes Champion Award
November 16, 2018 - Brain and muscle cells found within kidney organoids
November 16, 2018 - Person’s sex hormones may play key role in trauma survival, finds study
November 16, 2018 - PTEN Genetic Test: MedlinePlus Lab Test Information
November 16, 2018 - Toxic metal pollution linked with development of autism spectrum disorder
November 16, 2018 - Calcified nodules in the retina increase risk for progression to late stages of AMD
November 16, 2018 - ZEISS teams up with arivis AG to offer complete 3D imaging solutions
November 16, 2018 - Georgia State professor receives $1.2 million grant to study how the brain controls eating behavior
November 16, 2018 - Specific bacterial toxins reduce number of cells suppressing immune response
November 16, 2018 - Review by ID physician improves outcomes for outpatient parenteral antimicrobial therapy
November 16, 2018 - Conditions that produce signs similar to arthritis
November 16, 2018 - AHA: Dapagliflozin Noninferior to Placebo for MACE in T2DM
November 16, 2018 - Surgery remains best treatment for appendicitis, Stanford study finds
November 16, 2018 - Non-surgical fistula creation system Ellipsys becomes key focus of attention at CiDA
November 16, 2018 - Researchers find no link between ‘allergy friendly’ dogs and lower risk of asthma
November 16, 2018 - Researchers elucidate new rules of connectivity of neurons in the neocortex
November 16, 2018 - Treating children with ‘bubble baby disease’
November 16, 2018 - Nexus announces availability of Arsenic Trioxide Injection in the US
November 16, 2018 - Researchers find metabolite shuttle between cells in the liver that may combat tissue fibrosis
November 16, 2018 - AHA: PTSD Common Among Those Who Suffer Tear in the Aorta’s Wall
November 16, 2018 - Many RA patients’ pain related to central nervous system
November 16, 2018 - Changes in Himalayan gut microbiomes linked to diet
November 16, 2018 - Inhibition of prostaglandin E2 enhances ability to combat infectious colitis
November 16, 2018 - Chronic dry eye can slow reading rate and disrupt day to day tasks
November 16, 2018 - Researchers develop new drug molecule that inhibits inflammation
November 16, 2018 - Dementia symptoms peak in winter and spring, study finds
November 16, 2018 - Stanford tobacco researcher weighs in on JUUL
November 16, 2018 - Increasing omega-3 fatty acid intake during pregnancy reduces risk of premature birth, review finds
November 16, 2018 - Researchers find no link between infants waking up at night and later developmental problems
November 16, 2018 - Both parents and children agree about confidential medical services
November 16, 2018 - FDA warns against use of unapproved pain medications with implanted pumps
November 16, 2018 - Precision medicine-based approach to slow or reverse biologic drivers of Alzheimer’s disease
November 16, 2018 - Study provides new insight into norovirus outbreaks, may help guide efforts to develop vaccines
November 16, 2018 - Inexpensive, portable air purifier could help protect the heart from pollution
November 16, 2018 - New 15-minute scan could help diagnose brain damage in babies up to two years old
November 16, 2018 - Deep brain stimulation not effective for treating early Alzheimer’s
November 16, 2018 - Traditional chemotherapy superior to new alternative for oropharyngeal cancers | News Center
November 16, 2018 - What This Pond Protist Does With Its Genome Will Astound You
November 15, 2018 - Researchers develop tool that speeds up analysis and publication of biomedical data
November 15, 2018 - Scientists identify mechanism used by lung cancer cells to obtain glucose
November 15, 2018 - Abnormalities in development of the brain could be involved in onset of autism, finds new study
November 15, 2018 - Soy protein equally effective as animal protein in building muscle strength
November 15, 2018 - American Academy of Pediatrics, Nov. 2-6
November 15, 2018 - Dopamine drives early addiction to heroin
November 15, 2018 - Variance in gut microbiome in Himalayan populations linked to dietary lifestyle | News Center
November 15, 2018 - Reducing Cardiovascular Disease: The Amish Way
November 15, 2018 - King’s researchers launch charter to guide organizations to engage abuse survivors in research
November 15, 2018 - Enable Injections enters into development agreements with UCB and Apellis Pharmaceuticals
November 15, 2018 - TGen North collaborates with NARBHA Institute to advance human health
November 15, 2018 - Researchers discover molecular basis for therapeutic actions of an African folk medicine
November 15, 2018 - Human Cell Atlas study of early pregnancy shows how mother’s immune system is modified
November 15, 2018 - New guidelines for detecting and managing sarcopenia to be launched in the UK
November 15, 2018 - Researchers explore role of dietary composition on energy expenditure
November 15, 2018 - Elsevier launches Entellect™ Platform, unlocking value by creating AI-ready life sciences data
November 15, 2018 - Now that cannabis is legal in Canada, let’s use it to tackle the opioid crisis
November 15, 2018 - In the Spotlight: At the intersection of tech, health, and ethics
November 15, 2018 - Traditional Glaucoma Test Can Miss Severity of the Disease
November 15, 2018 - Researchers directly connect activities of genes with instinctive behavior in male cichlids
November 15, 2018 - Salk researchers report new methods to identify AD drug candidates with anti-aging properties
November 15, 2018 - St. Jude Hospital announces availability of largest collections of leukemia samples
November 15, 2018 - Attenua Announces First Patient Treated in Phase 2 Clinical Trial in Chronic Cough with Bradanicline
November 15, 2018 - Designing a novel cell-permeable peptide chimera to promote wound healing
November 15, 2018 - NEI investigators combine two imaging modalities to view the retina in unprecedented detail
November 15, 2018 - Determining how hearts develop to better understand congenital heart defects
November 15, 2018 - Maverick immune cells can act independently to identify and kill cancer cells, finds research
November 15, 2018 - Advanced AI and big data methods to tackle dementia
November 15, 2018 - Report reveals increase in pancreatic cancer death rates across Europe
November 15, 2018 - Luxia Scientific announces availability of its gut microbiome test in Luxembourg
November 15, 2018 - New diabetes drugs linked to increased risk of lower-limb amputation and ketoacidosis
November 15, 2018 - New approach targets matrix surrounding neurons to protect neurons after stroke
November 15, 2018 - Lilly Submits New Drug Application to the FDA for Lasmiditan for Acute Treatment of Migraine
November 15, 2018 - Heart failure patients shouldn’t stop meds even if condition improves: study
November 15, 2018 - Parents and carers of people with diabetes experience emotional or mental health problems
Growth of genomic databases hinders efforts to identify bacteria

Growth of genomic databases hinders efforts to identify bacteria

image_pdfDownload PDFimage_print

There are many ways to slice and dice genomic data to identify a species of bacteria, or at least find its close relatives. But fast techniques to sequence genomes have flooded the public databases and in a biased fashion, containing lots of genomic data about some species and not enough about others, according to a Rice University computer scientist.

Todd Treangen and his colleagues tested taxonomic classification methods that match genomic sequences from bacteria of interest with those recorded in large databases to identify species. In the process, they charted a path toward improved accuracy and sensitivity.

Treangen is senior author of a study published this month in Genome Biology that demonstrates how changes over time in a widely used federal database, the National Center for Biotechnology Information’s RefSeq, have influenced the accuracy of metagenomic classification methods.

A primary concern for Treangen, an expert in metagenomics — the study of genetic material from environmental samples — is maintaining the ability to quickly identify bacteria that pose a threat to public health.

Big data is uniquely positioned to do this — but there’s so much of it. At present, he said, low-cost and high-throughput DNA shotgun sequencing machines, which read short DNA sequences from collections of microorganisms, have resulted in the doubling of genomic data in RefSeq every two to three years.

“I initially thought more data is always better for these methods,” said Treangen, who joined Rice this year from the University of Maryland Institute for Advanced Computer Studies. “You would expect that there would be no penalty, because database growth is good.” However, the researchers found that bacterial data in RefSeq has an outsized effect at the species level of the taxonomic hierarchy, which is growing at a breakneck pace.

That’s a problem for researchers who combine two common techniques to identify what they find. One is called k-mer-based classification, which identifies short DNA sequences from all the organisms in a bacterial sample via exact matches.

“Most of the methods that have made the problem computationally feasible rely on k-mers, which are exact matches of length ‘k,’ or a key in to the microbes contained in the database,” he said. “If a sequenced read perfectly matches something in the database, the intuition is that you can say what that is with great precision and shortcut more expensive computational approaches.”

A commonly used technique with k-mer-based classification is lowest common ancestor (LCA) assignment, he said. LCA compares samples to sequences that share a match, assigning them if necessary to a higher level in the taxonomy, such as a genus rather than a species. But this may not be specific enough for researchers trying to pin down a pathogen, he said.

In fact, the study found a k-mer-based classification tool called Bracken, which uses Bayesian statistics to infer the best match for a sequence, helped mitigate the imbalance. Even so, it struggled to identify genomes with close relatives, but not perfect matches, in the database.

Treangen said well-funded research into particular pathogens is a necessity and has greatly aided rapid-outbreak detection and tracking, but it ultimately biases public databases like RefSeq.

“For instance, there’s an immense bias toward foodborne pathogens,” he said. “Society wants to know a lot about Salmonella, and rightfully so. The FDA, and specifically GenomeTrakr, have aided in the sequencing of thousands of relevant pathogens and have added them directly to the reference database.”

However, he said that skews the reference database toward particular genera and families of microbes in a way that affects the accuracy and sensitivity of fast taxonomic-classification tools like Kraken that use k-mer and LCA-based approaches.

Treangen said the best recent example of a false positive identification is a study that initially reported evidence of anthrax bacteria in New York City’s subways. The study, based on sequenced genomes from samples, was later revised to reflect mismatches that falsely identified the sequences as Bacillus anthracis.

While a focus on public health is a key priority, Treangen said novel techniques able to cope with database growth and noise, coupled with an increased breadth of sequenced genomes, is needed for continued improvements in the field. “For example, microorganisms from the soil and ocean are severely under-sampled,” he said. “There remain a lot of microbes that we need to continue to sequence to better fill out public databases, and that will ultimately help our ability to accurately classify microbes from complex samples.”

Source:

Flood of genome data hinders efforts to ID bacteria

Tagged with:

About author

Related Articles