Technical white paper

Advancing next-generation sequencing analytics with HPE and Intel-based solutions

Executive summary

Next-generation sequencing (NGS) is delivering on its promise as a life sciences research tool and is rapidly evolving with many new developments to increase speed, improve accuracy, and reduce costs of sequencing. In this white paper, the results of two recent benchmark tests will be used to help customers select High Performance Computing (HPC) solutions that will accelerate the performance of their genomic analysis workloads, and demonstrate the high throughput rates that can be achieved with HPE ProLiant XL170r Gen10 Servers configured with Intel Xeon Gold processors, Intel Omni-Path Architecture (Intel OPA) interconnects, fast local SSD storage, and Lustre based remote storage. This prepackaged offering from HPE conforms to both the Broad-Intel Genomic Stack (BIGstack) reference architecture and the Broad Institute's Genome Analysis Toolkit (GATK) Best Practices pipeline.

Faster next-generation sequencing accelerates scientific discovery

Next-generation sequencing (NGS) is a key data-intensive technology in life sciences that is rapidly enhancing the capabilities of biotechnology, healthcare, pharmaceuticals, and life sciences organizations where rising healthcare costs demand an extraordinary rate of innovation. In fact, rapid advances in NGS technologies, HPC infrastructure, and data analytics such as artificial intelligence (AI) and deep learning (DL) are producing breakthroughs in life sciences research that are cost-effective and providing answers to long sought genomic questions.
Fortunately, there continues to be a dramatic decline in costs, even as the sequencing volume of genomic data sets is rising (Figure 1). However, the speed of scientific discovery is still limited by the scalability, flexibility, ease of use, and affordability of the HPC infrastructure used to rapidly process genomic data sets.