Pathosphere.org: pathogen detection and characterization through a web-based, open source informatics platform

  • Andy Kilianski (投稿者)
  • Patrick Carcel (投稿者)
  • Shijie Yao (投稿者)
  • Pierce Roth (投稿者)
  • Josh Schulte (投稿者)
  • Greg B. Donarum (投稿者)
  • Ed T. Fochler (投稿者)
  • Jessica M. Hill (投稿者)
  • Alvin T. Liem (投稿者)
  • Michael R. Wiley (投稿者)
  • Jason T. Ladner (投稿者)
  • Bradley P. Pfeffer (投稿者)
  • Oliver Elliot (投稿者)
  • A. Petrosov (投稿者)
  • Dereje D. Jima (投稿者)
  • Tyghe G. Vallard (投稿者)
  • Melanie C. Melendrez (投稿者)
  • Evan Skowronski (投稿者)
  • Phenix Lan Quan (投稿者)
  • W. Ian Lipkin (投稿者)
  • Henry S. Gibbons (投稿者)
  • David L. Hirschberg (投稿者)
  • Gustavo Palacios (投稿者)
  • C. Nicole Rosenzweig (投稿者)

数据集

描述

Abstract Background The detection of pathogens in complex sample backgrounds has been revolutionized by wide access to next-generation sequencing (NGS) platforms. However, analytical methods to support NGS platforms are not as uniformly available. Pathosphere (found at Pathosphere.org) is a cloud - based open - sourced community tool that allows for communication, collaboration and sharing of NGS analytical tools and data amongst scientists working in academia, industry and government. The architecture allows for users to upload data and run available bioinformatics pipelines without the need for onsite processing hardware or technical support. Results The pathogen detection capabilities hosted on Pathosphere were tested by analyzing pathogen-containing samples sequenced by NGS with both spiked human samples as well as human and zoonotic host backgrounds. Pathosphere analytical pipelines developed by Edgewood Chemical Biological Center (ECBC) identified spiked pathogens within a common sample analyzed by 454, Ion Torrent, and Illumina sequencing platforms. ECBC pipelines also correctly identified pathogens in human samples containing arenavirus in addition to animal samples containing flavivirus and coronavirus. These analytical methods were limited in the detection of sequences with limited homology to previous annotations within NCBI databases, such as parvovirus. Utilizing the pipeline-hosting adaptability of Pathosphere, the analytical suite was supplemented by analytical pipelines designed by the United States Army Medical Research Insititute of Infectious Diseases and Walter Reed Army Institute of Research (USAMRIID-WRAIR). These pipelines were implemented and detected parvovirus sequence in the sample that the ECBC iterative analysis previously failed to identify. Conclusions By accurately detecting pathogens in a variety of samples, this work demonstrates the utility of Pathosphere and provides a platform for utilizing, modifying and creating pipelines for a variety of NGS technologies developed to detect pathogens in complex sample backgrounds. These results serve as an exhibition for the existing pipelines and web-based interface of Pathosphere as well as the plug-in adaptability that allows for integration of newer NGS analytical software as it becomes available.
可用的日期1 一月 2015
出版商Figshare

引用此