Argonne, Illinois. – December 3, 2014 – A team of researchers from Argonne National Laboratory and DataDirect Networks (DDN) moved 65 terabytes of data in under just 100 minutes at a recent supercomputing conference. Typically, two days are needed to move this volume of data between sites with a 10 Gbps connection.
With help from Ciena, Brocade, and ICAIR, the team sustained data transfer rates in excess of 85 Gbps—with peaks at over 90 Gbps—between storage systems in Ottawa, Canada, and New Orleans, LA, over a 100 Gbps wide-area network (WAN) connection. The demonstration took place on November 19, 2014 at SC14, the leading international conference for high performance computing, networking, storage and analysis.
This unprecedented achievement required combining the embedded file system and virtual machine capabilities of the DDN storage controller, the high-speed wide-area data transfer capabilities of the Globus GridFTP server, and an advanced 100G wide-area network.
"Embedding the GridFTP servers in virtual machines on DDN's storage controller eliminates the need for external data transfer nodes and network adapters," explained Raj Kettimuthu, principal software development specialist at Argonne National Laboratory. "We sustained a data transfer rate of 85 Gbps for over 60 minutes—and occasionally for as long as 90 minutes—several times during the SC14 conference."
Achieving 90+ Gbps for memory-to-memory transfers using a benchmarking tool like iperf is straightforward and has been demonstrated several times in the past. Achieving similar rates for disk-to-disk transfers however, presents a number of challenges, including choosing the appropriate block size that works well for both disk I/O and network I/O, and selecting the appropriate combination of parallel storage I/O threads and parallel TCP streams for optimal end-to-end performance.
Network experts often claim that storage is the bottleneck in the end-to-end transfers on high-speed networks, while storage experts claim that the network is often the bottleneck on transfers between sites with high-performance parallel file systems. "This demonstration was aimed at bringing together the experts and latest developments in all aspects concerning disk-to-disk WAN data movement, including network, storage, and data movement tools," said Kettimuthu.
The team expects that the approach can be used to achieve 100+ Gbps wide-area transfer rates between storage systems using multiple WAN paths and additional storage resources in the end systems.
Team members were Kevin Harms, Eun-Sung Jung, Raj Kettimuthu, Linda Winkler from Argonne National Laboratory and the University of Chicago and Mark Adams from DataDirect Networks, with help from Jim Chen and Joe Mambretti from ICAIR, Doug Hogg and Marc Lyonnais from Ciena, Wilbur Smith from Brocade, Jon Dugan and Brian Tierney from ESnet, Ian Foster and Mike Link from Argonne National Laboratory and the University of Chicago, and Clayton Walker, Laura Shepard, Susan Presley, and Bob Vassar from DDN.
About Argonne National Laboratory
Argonne National Laboratory seeks solutions to pressing national problems in science and technology. The nation's first national laboratory, Argonne conducts leading-edge basic and applied scientific research in virtually every scientific discipline. Argonne researchers work closely with researchers from hundreds of companies, universities, and federal, state and municipal agencies to help them solve their specific problems, advance America's scientific leadership and prepare the nation for a better future. With employees from more than 60 nations, Argonne is managed by UChicago Argonne, LLC for the U.S. Department of Energy's Office of Science. For more information, visit www.anl.gov.
About DataDirect Networks
DataDirect Networks (DDN) is the world leader in massively scalable DDN's data storage and processing solutions and professional services enable content-rich and high growth IT environments to achieve the highest levels of systems scalability, efficiency and simplicity. DDN enables enterprises to extract value and deliver business results from their information. DDN's customers include the world's leading online content and social networking providers, high performance cloud and grid computing, life sciences, media production, and security and intelligence organizations. Deployed in thousands of mission critical environments worldwide, DDN's solutions have been designed, engineered and proven in the world's most scalable data centers to ensure competitive business advantage for today's information powered enterprise. For more information, go to www.ddn.com.
SC14, sponsored by the IEEE Computer Society and the Association for Computing Machinery, offers a complete technical education program and exhibition to showcase the many ways high performance computing, networking, storage and analysis lead to advances in scientific discovery, research, education and commerce. This premier international conference includes a globally attended technical program, workshops, tutorials, a world class exhibit area, demonstrations and opportunities for hands-on learning. For more information on SC14, please visit: http://sc14.supercomputing.org.