NDM 2012‎ > ‎

Keynote Speech

Title: "Data-intensive and Cloud Applications in Large-scale Data Center Systems" [slides]

AbstractData-intensive applications have been evolving from their original focus on offline mining of business data into broader domains, including the online inspection and analysis of large-scale web data used for rapid response to current conditions. Processing such 'data in motion' brings new challenges to the domain of data intensive computing. This talk will articulate some of those challenges, present representative solutions, and describe potential avenues for future work, in lieu of several constraints seen for this broad class of datacenter applications, including their use of shared underlying datacenter infrastructure, their support by datacenter operators, and the time-constrained operation inherent in their execution. Future research opportunities in this space include application acceleration via GPGPUs as well as new ways to enrich the open source infrastructures used to run these codes.


Regents' Professor
Director, CERCS Research Center
Georgia Institute of Technology

Prof. Karsten Schwan is a Regents’ Professor in the College of Computing at the Georgia Institute of Technology. He also a Director of the Center for Experimental Research in Computer Systems (CERCS), with co-directors from both GT's College of Computing and School of Electrical and Computer Engineering. The NSF-sponsored CERCS research center's faculty conduct research in experimental computer systems in the domains of Enterprise, High Performance, and Embedded/Pervasive Systems, with members from industry and from federal agencies. Prof. Schwan's M.S. and Ph.D. degrees are from Carnegie-Mellon University in Pittsburgh, Pennsylvania. His Ph.D. concerned high performance computing, addressing operating and programming systems support for the Cm* multiprocessor, following which he conducted extensive research in real-time and distributed systems. His current work ranges from topics in operating systems, to middleware, to parallel and distributed systems, focusing on information-intensive distributed applications in the enterprise domain (e.g., the operational information systems supporting large enterprises) and
in the high performance domain (e.g., high performance I/O). Specific techncal topics include (1) scalable techniques for virtualizing and using future multi-core and multi-machine platforms, (2) efficient methods for managing applications and services in datacenter and cloud computing systems , including new techniques for runtime performance and behavior monitoring and understanding, (3) middleware for high performance data movement, addressing I/O in future petascale machines and QoS-sensitive data streaming in pervasive and wide area systems, and (4) experimentation with representative applications in the HPC,
enterprise, and pervasive domains.