Breaking News
January 19, 2019 - Highly effective protocol to prepare cannabis samples for THC/CBD analysis
January 19, 2019 - Prinston Pharmaceutical Inc. Issues Voluntary Nationwide Recall of Irbesartan and Irbesartan HCTZ Tablets Due to Detection of a Trace Amount of Unexpected Impurity, N-Nitrosodiethylamine (NDEA) in the Products
January 19, 2019 - How does solid stress from brain tumors cause neuronal loss, neurologic dysfunction?
January 19, 2019 - $14.7 million partnership to supercharge vaccine development
January 19, 2019 - Ian Fotheringham receives Charles Tennant Memorial Lecture award
January 19, 2019 - Brain vital signs detect neurophysiological impairments in players with concussions
January 19, 2019 - Lack of job and poor housing conditions increased likelihood of people attending A&E
January 19, 2019 - Novel targeted drug delivery system improves conventional cancer treatments
January 19, 2019 - Rutgers study finds gene responsible for spread of prostate cancer
January 19, 2019 - Complications Higher Than Expected for Invasive Lung Tests
January 19, 2019 - 3-D printed implant promotes nerve cell growth to treat spinal cord injury
January 19, 2019 - Automated texts lead to improved outcomes after total knee or hip replacement surgery
January 19, 2019 - Poor cardiorespiratory fitness could increase risk of future heart attack, finds new study
January 19, 2019 - Drinking soft drinks while exercising in hot weather may increase risk of kidney disease
January 19, 2019 - Formlabs 3D prints anatomical models
January 19, 2019 - Heart-Healthy Living Also Wards Off Type 2 Diabetes
January 19, 2019 - Teaching Kids to Be Smart About Social Media (for Parents)
January 19, 2019 - Metabolite produced by gut microbiota from pomegranates reduces inflammatory bowel disease
January 19, 2019 - Researchers examine how spray from showers and toilets expose us to disease causing bacteria
January 19, 2019 - Behavioral experiments confirm that additional neurons improve brain function
January 19, 2019 - New study compares performance of real-time infectious disease forecasting models
January 19, 2019 - Obesity can be risk factor for developing renal cell carcinoma, confirms study
January 19, 2019 - New regulation designs on cigarette packs direct smokers’ attention to health warnings
January 19, 2019 - QIAGEN receives first companion diagnostic approval in Japan
January 19, 2019 - Study explores role of Dunning-Kruger effect in anti-vaccine attitudes
January 19, 2019 - Newly identified subset of immune cells may be key to fighting chronic inflammation
January 19, 2019 - New immune response regulators discovered
January 18, 2019 - Poor blood oxygenation during sleep predicts chance of heart-related death
January 18, 2019 - First international consensus on the diagnosis and management of fibromuscular dysplasia
January 18, 2019 - Rapid resistance gene sequencing technology can hasten identification of antibiotic-resistant bacteria
January 18, 2019 - Researchers develop artificial enzymatic pathway for synthesizing isoprenoids in E. coli
January 18, 2019 - Scientists advise caution in immunotherapy research
January 18, 2019 - How children across the world develop language
January 18, 2019 - Columbia Medical Student Receives McDonogh Scholarship
January 18, 2019 - Secretive ‘Rebate Trap’ Keeps Generic Drugs For Diabetes And Other Ills Out Of Reach
January 18, 2019 - Plant based diet could be the best option for the planet says commission
January 18, 2019 - New conservation practice could reduce nitrogen from agricultural drainage, study shows
January 18, 2019 - UIC researchers receive $1.7 million NCI grant to study Southeast Asian fruit
January 18, 2019 - New study determines the fate of DNA derived from genetically modified food
January 18, 2019 - Scientists develop new gene therapy that prevents axon destruction in mice
January 18, 2019 - Study finds critically low HPV vaccination rates among younger adolescents in the U.S.
January 18, 2019 - Brain cells involved in memory play key role in reducing future eating behavior
January 18, 2019 - Risk for Conversion of MS Varies With Different Therapies
January 18, 2019 - Investigational cream may help patients with inflammatory skin disease
January 18, 2019 - Medical school news office receives six writing awards | News Center
January 18, 2019 - County By County, Researchers Link Opioid Deaths To Drugmakers’ Marketing
January 18, 2019 - Research reveals risk for developing more than one mental health disorder
January 18, 2019 - Scientists discover a dramatic pattern of bone growth in female mice
January 18, 2019 - Study finds link between lengthy periods of undisturbed maternal sleep and stillbirths
January 18, 2019 - New nuclear medicine method could improve detection of primary and metastatic melanoma
January 18, 2019 - Combination therapy shows high efficacy in treating people with leishmaniasis and HIV
January 18, 2019 - Health Tip: Don’t Ignore Changes in Skin Color
January 18, 2019 - Dietary Recommendations for Healthy Children
January 18, 2019 - Eliminating the latent reservoir of HIV
January 18, 2019 - Pain From The Government Shutdown Spreads. This Time It’s Food Stamps
January 18, 2019 - Newly discovered regulatory mechanism helps control fat metabolism
January 18, 2019 - New rapid blood tests could speed up TB diagnosis, save the NHS money
January 18, 2019 - Researchers develop intelligent system for ‘tuning’ powered prosthetic knees
January 18, 2019 - Monoclonal antibody pembrolizumab prolongs survival in patients with squamous cell carcinoma
January 18, 2019 - New research detects mosquito known to transmit malaria for the first time in Ethiopia
January 18, 2019 - Researchers identify new genes linked to development of age-related macular degeneration
January 18, 2019 - Computerized method helps better protect pharma patents
January 18, 2019 - New guidelines to make swallowing safer for people in Australian nursing homes
January 18, 2019 - Lumex Instruments’ RA-915AM monitor installed at Hg treatment plant in Almadén, Spain
January 18, 2019 - ACCC survey finds multiple threats to growth of cancer programs
January 18, 2019 - Meeting the challenge of engaging men in HIV prevention and treatment
January 18, 2019 - Furloughed Feds’ Health Coverage Intact, But Shutdown Still Complicates Things
January 18, 2019 - Experts discuss various aspects on health risks posed by fumigated containers
January 18, 2019 - Researchers use gene-editing tool CRISPR/Cas9 to limit impact of parasitic diseases
January 18, 2019 - Alpha neurofeedback training could be a means of enhancing learning success
January 18, 2019 - Innovative ‘light’ method demonstrates positive results in fight against malignant tumors
January 18, 2019 - The cytoskeleton of neurons found to play role in Alzheimer’s disease
January 18, 2019 - New resource-based approach to improve HIV care in low- and middle-income countries
January 18, 2019 - Bedfont appoints Dr Jafar Jafari as first member of the Gastrolyzer Medical Advisory Board
January 18, 2019 - New study shows link between secondhand smoke and cardiac arrhythmia
January 18, 2019 - DZIF scientists reveal problems with available diagnostics for Zika and chikungunya virus
January 18, 2019 - Breast cancers more likely to metastasize in young women within 10 years of giving birth
January 18, 2019 - Over 5.6 million Americans exposed to high nitrate levels in drinking water
January 18, 2019 - Blood vessels can now be created perfectly in a petri dish
January 18, 2019 - Study identifies prominent socioeconomic and racial disparities in health behavior in Indiana
Growth of genomic databases hinders efforts to identify bacteria

Growth of genomic databases hinders efforts to identify bacteria

image_pdfDownload PDFimage_print

There are many ways to slice and dice genomic data to identify a species of bacteria, or at least find its close relatives. But fast techniques to sequence genomes have flooded the public databases and in a biased fashion, containing lots of genomic data about some species and not enough about others, according to a Rice University computer scientist.

Todd Treangen and his colleagues tested taxonomic classification methods that match genomic sequences from bacteria of interest with those recorded in large databases to identify species. In the process, they charted a path toward improved accuracy and sensitivity.

Treangen is senior author of a study published this month in Genome Biology that demonstrates how changes over time in a widely used federal database, the National Center for Biotechnology Information’s RefSeq, have influenced the accuracy of metagenomic classification methods.

A primary concern for Treangen, an expert in metagenomics — the study of genetic material from environmental samples — is maintaining the ability to quickly identify bacteria that pose a threat to public health.

Big data is uniquely positioned to do this — but there’s so much of it. At present, he said, low-cost and high-throughput DNA shotgun sequencing machines, which read short DNA sequences from collections of microorganisms, have resulted in the doubling of genomic data in RefSeq every two to three years.

“I initially thought more data is always better for these methods,” said Treangen, who joined Rice this year from the University of Maryland Institute for Advanced Computer Studies. “You would expect that there would be no penalty, because database growth is good.” However, the researchers found that bacterial data in RefSeq has an outsized effect at the species level of the taxonomic hierarchy, which is growing at a breakneck pace.

That’s a problem for researchers who combine two common techniques to identify what they find. One is called k-mer-based classification, which identifies short DNA sequences from all the organisms in a bacterial sample via exact matches.

“Most of the methods that have made the problem computationally feasible rely on k-mers, which are exact matches of length ‘k,’ or a key in to the microbes contained in the database,” he said. “If a sequenced read perfectly matches something in the database, the intuition is that you can say what that is with great precision and shortcut more expensive computational approaches.”

A commonly used technique with k-mer-based classification is lowest common ancestor (LCA) assignment, he said. LCA compares samples to sequences that share a match, assigning them if necessary to a higher level in the taxonomy, such as a genus rather than a species. But this may not be specific enough for researchers trying to pin down a pathogen, he said.

In fact, the study found a k-mer-based classification tool called Bracken, which uses Bayesian statistics to infer the best match for a sequence, helped mitigate the imbalance. Even so, it struggled to identify genomes with close relatives, but not perfect matches, in the database.

Treangen said well-funded research into particular pathogens is a necessity and has greatly aided rapid-outbreak detection and tracking, but it ultimately biases public databases like RefSeq.

“For instance, there’s an immense bias toward foodborne pathogens,” he said. “Society wants to know a lot about Salmonella, and rightfully so. The FDA, and specifically GenomeTrakr, have aided in the sequencing of thousands of relevant pathogens and have added them directly to the reference database.”

However, he said that skews the reference database toward particular genera and families of microbes in a way that affects the accuracy and sensitivity of fast taxonomic-classification tools like Kraken that use k-mer and LCA-based approaches.

Treangen said the best recent example of a false positive identification is a study that initially reported evidence of anthrax bacteria in New York City’s subways. The study, based on sequenced genomes from samples, was later revised to reflect mismatches that falsely identified the sequences as Bacillus anthracis.

While a focus on public health is a key priority, Treangen said novel techniques able to cope with database growth and noise, coupled with an increased breadth of sequenced genomes, is needed for continued improvements in the field. “For example, microorganisms from the soil and ocean are severely under-sampled,” he said. “There remain a lot of microbes that we need to continue to sequence to better fill out public databases, and that will ultimately help our ability to accurately classify microbes from complex samples.”

Tagged with:

About author

Related Articles