GTC 2012 Program Guide - GPU Technology Conference

More documents

Recommendations

Info

TUESDAY present an efficient method for directly tessellating NURBS surfaces using the NVIDIA CUDA computing API. Speaker(s): Brent Oster (Applied Engineer, NVIDIA) Topic(s): Computer Graphics (Advanced) TUESDAY, MAY 15, 15:00 (50 MINUTES) ROOM A3 S0407 A High Level Programming Environment for Accelerated Computing One of the critical hurdles for the widespread adoption of accelerated computing in HPC is programming difficulty. Users need a simple programming model that is portable and is not significantly different from the approaches used on current multi-core x86 processors. In this talk I will present Cray’s strategy to accelerator programming, which is based on a high level programming environment with tightly coupled compilers, libraries, and tools. Ease of use is possible with compiler making it feasible for users to write applications in Fortran, C, C++, tools to help users port and optimize for accelerators, and auto-tuned scientific libraries. Speaker(s): Luiz DeRose (Director of Programming Environment, Cray Inc.) Topic(s): Development Tools & Libraries, Parallel Programming Languages & Compilers (Intermediate) TUESDAY, MAY 15, 15:00 (50 MINUTES) ROOM A5 S0413 Delivering 3D Professional Graphics from the Cloud with Citrix XenDesktop Recent technological advances have made it practical to deliver 3D professional graphics applications from the Cloud (private or public) with a high quality user experience and at an attractive cost. Organizations can keep their intellectual property safe in the data center since only fully-rendered screen images are sent over the network. Users in remote locations no longer have to wait for large file transfers. And they can access 3D models from a wide variety of devices, including iPads and Android tablets. Learn how Citrix XenDesktop, XenServer and Receiver technologies have made all of this a reality for many organizations today. Speaker(s): Derek Thorslund (Director of Product Management, Citrix Systems, Inc.) Topic(s): Cloud Computing, Computer Graphics, Visualization (Beginner) TUESDAY, MAY 15, 15:00 (25 MINUTES) ROOM A7 S0436 Integrated GPU Acceleration With Real Time Visualization Of Terabyte Data Computation and visualization doesn’t necessarily have to act as two separate entities. This talk explains the integration of real-time compute with real-time visualization. Industry and academia have provided attractive solutions for compiler-directive optimized code for computations. To support cases that involves massive yet ad-hoc data I/O and computation with interactive visualization, Hue developed a different model which bridges the gap between “complete system rewrite” and “compiler directive optimized code”. The talk explains how highly optimized data I/O mechanisms coupled with predefined input and output definitions for kernels provide excellent scalability and interactivity during runtime. Speaker(s): Kelly Walker (Senior Software Developer, Hue) Topic(s): Visualization, Energy Exploration (Beginner) TUESDAY, MAY 15, 15:00 (25 MINUTES) ROOM B S0611 Edge-Aware Shaders for Real-Time Computer Graphics The most common approach in rendering is to define behavior at a point in terms of material properties and incident illumination. That approach works well when the geometry and material properties are well-known, and the light physics are simulated accurately. We present a technique to help situations where the model and/or physics is incomplete. This technique augments shaders with information about nearby edges, such as corners and boundaries between materials, and makes it natural to add richness procedurally near these visually critical regions. Speaker(s): Peter-Pike Sloan (Principal Research Scientist, NVIDIA) Topic(s): Computer Graphics (Intermediate) TUESDAY, MAY 15, 15:00 (50 MINUTES) ROOM M S0620 VSIPL++: A High-Level Programming Model for Productivity and Performance (Presented by Mentor Graphics) Learn how VSIPL++ can improve your productivity and provide software portability, without sacrificing performance. We will describe how VSIPL++’s open-standard high-level programming model addresses the challenges of writing high-performance embedded software on GP-GPUs and other heterogeneous hardware, using advanced C++ techniques and data abstraction – and how we make this work in the real world. We will also present a comparison of performance results from various configurations of CPU and GP-GPU processing engines for a signal processing application developed using VSIPL++. Speaker(s): Brooks Moses, Ph.D. (Sourcerer, Mentor Graphics Corporation) Topic(s): Supercomputing (Beginner) TUESDAY, MAY 15, 15:00 (25 MINUTES) ROOM A2 S0625 S3D Direct Numerical Simulation - Preparations for the 10-100PF Era The evolution of supercomputing into the mid-petaflop era has been typified by heterogenous compute nodes with the majority of the compute capability delivered by a large number of lightweight cores. In order to prepare for the extension of this trend, the DNS code S3D has been retooled in anticipation of a target architecture offering 10s of thousands of heterogeneous nodes containing many X86 cores as well as GPU derived accelerators. Movement of outer loops to the highest level in the code facilitates hybrid MPI-OpenMP performance and an elegant path to accelerated kernels using OpenACC. It is anticipated that relevant scientific simulations at this scale will have a per-node footprint that can be contained entirely on the accelerator, so provision is made to maintain primary solution variables in accelerator memory with specific regions moved to the CPU for inter-node communication and workload balancing. With the current performance it is estimated that the new code will make it possible to meet early science goals with the full build-out of the anticipated Titan system as well as provide a platform to transition into the exascale software research space. Speaker(s): Ray Grout (National Renewable Energy Laboratory) Topic(s): Supercomputing (Beginner)
TUESDAY, MAY 15, 15:00 (50 MINUTES) ROOM NVIDIA NSIGHT LAB S0802 CUDA Profiler Training on Windows Nsight offers a comprehensive set of performance analysis tools. From the ability to trace complete system multi-core CPU and multi GPU activities, to profile CUDA kernel with precise profiling experiments, developers can identify system level optimization opportunities as well as expensive and inefficient CUDA kernels requiring in-depth analysis with the CUDA profiler. Through a set of comprehensive exercises, the attendee will be able to utilize these features to become fully proficient at optimizing complex CUDA applications. Speaker(s): NVIDIA Developer Tools Team Topic(s): Development Tools & Libraries (Beginner) TUESDAY, MAY 15, 15:30 (25 MINUTES) ROOM K S0152 Accurate Sequence Alignment using Distributed Filtering on GPU Clusters Learn how GPUs enable new ways to rethink a complex bioinformatics problem: Accurate sequence alignment. What was once prohibitive to compute can become the basic block of novel GPU-based algorithms. Modern DNA sequencing machines generate enormous amounts of short sequences within minutes, and they should be aligned to a reference genome in real time. Most solutions only find a few locations that match a short sequence. We introduce a new technique to find all matching locations inside a reference sequence for a given number of mismatches. Our technique is based on a distributed filtering scheme and GPU based processing. Speaker(s): Reza Farivar (PhD Student, University of Illinois at Urbana- Champaign), Shivaram Venkataraman (PhD Student, UC Berkeley) Topic(s): Bioinformatics, Algorithms & Numerical Techniques (Intermediate) TUESDAY, MAY 15, 15:30 (25 MINUTES) ROOM J3 S0316 Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation This presentation describes our development of a GPUaccelerated backpropagation implementation for Synthetic Aperture Sonar systems that supports multiple nodes via MPI and multi-GPU nodes. This implementation can form a complexvalued gigapixel image in one hour on a single C2050. We further scale this implementation to the Keeneland system where we can form the same gigapixel image in 21 seconds on 48 nodes with 144 C2070 Tesla GPUs. Our talk will discuss the details of our implementation, including our optimizations and scaling results for various node and GPU configurations, as well as the applicability to other domains, including Synthetic Aperture Radar. Speaker(s): Thomas Benson (Research Engineer II, Georgia Tech Research Institute) Topic(s): Application Design & Porting Techniques (Intermediate) TUESDAY, MAY 15, 15:30 (50 MINUTES) ROOM J1 S0366 OptiX Out-of-Core and CPU Rendering OptiX has broken some major barriers recently by enabling out-of-GPU-core memory rendering and by adding a CPU rendering back-end when an OptiX-capable GPU is not present in the system. OptiX users and CUDA developers will be interested in how we accomplished these feats within the existing GPU architecture. This talk will provide a brief introduction to OptiX and then dive into what the new features provide. We will then go under the covers and show how we pulled it off. Speaker(s): David McAllister (OptiX Manager, NVIDIA, OptiX group) Topic(s): Ray Tracing, Computer Graphics (Intermediate) TUESDAY, MAY 15, 15:30 (50 MINUTES) ROOM B S0409 Stochastic Rasterization Learn how to render transparency, motion blur, and depth of field effects in real time using random sampling. These effects combine multiple objects in each pixel, making them expensive to compute directly. But recent research shows that, with stratified sampling and clever reconstruction, good image quality can be achieved with surprisingly small numbers of samples per pixel. We will explain how to do this on the GPU, and explore trade-offs of performance, quality, accuracy, and noise. Speaker(s): Eric Enderton (Research Scientist, NVIDIA), Morgan McGuire (Visiting Professor, NVIDIA and WIlliams College) Topic(s): Computer Graphics, Digital Content Creation & Film (Intermediate) TUESDAY, MAY 15, 15:30 (25 MINUTES) ROOM A7 S0444 Explore New Techniques in Volume Rendering/ Segmentation with Open Inventor The goal of this session is to show the improvements in quality, performance and flexibility of the volume rendering implementation of Open Inventor. The latest GPU techniques, such as virtual textures and ray casting, have been combined into a flexible shader API and applied on out of core data. The techniques of volume rendering, sugarcube rendering, basic and complex clipping, sculpting, editing and segmentation will be demonstrated using examples from a geobody extraction workflow. The great ease and flexibility of the shader pipeline API will be illustrated, and we will discuss the broad future perspectives of that technology. Speaker(s): Mike Heck (Technology Advisor, VSG) Topic(s): Computer Graphics (Advanced) TUESDAY, MAY 15, 15:30 (25 MINUTES) ROOM A2 S0654 Fusion Energy Sciences & Computing at the Extreme Scale The fusion energy sciences community has made excellent progress in developing advanced codes for which computer run-time and problem size scale well with the number of processors on massively parallel supercomputers. A good example is the effective usage of the full power of modern leadership class computational platforms from the terascale to the petascale and beyond to produce nonlinear particle-in-cell simulations which have accelerated progress in understanding the nature of plasma turbulence in magneticallyconfined high temperature plasmas. Illustrative results provide great encouragement for being able to include increasingly realistic dynamics in extreme-scale computing campaigns to enable predictive simulations with unprecedented physics fidelity. Speaker(s): William Tang (Fusion Simulation Program at the Princeton Plasma Physics Laboratory (PPPL), Princeton) Topic Area(s): Supercomputing (Intermediate) 39 CONFERENCE GUIDE TUESDAY
Page 1 and 2: PRESENTED BY PLATINUM SPONSORS MAY
Page 3 and 4: WELCOME TO GTC Dear GTC Attendees,
Page 5 and 6: IMPORTANT INFORMATION If there is a
Page 7 and 8: 1 3 6 10 20 23 27 47 69 83 103 145
Page 9 and 10: SUNDAY, MAY 13 08:30 to 17:35 InPar
Page 11 and 12: Welcome to NVIDIA’s Emerging Comp
Page 13 and 14: 14:00 to 14:50 S2003 Emerging Compa
Page 15 and 16: CEO ON STAGE LISTING BIODIGITAL Bio
Page 17 and 18: MERSIVE Since it was founded in 200
Page 19 and 20: REALVIEW IMAGING LTD. RealView Imag
Page 21 and 22: CONFERENCE GUIDE 19
Page 23 and 24: AGENDA WEDNESDAY, MAY 16 Plenary Se
Page 25 and 26: SESSION INFORMATION - PRE-CONFERENC
Page 27 and 28: models with companion tools (e.g. f
Page 29 and 30: SESSION INFORMATION TUESDAY, MAY 15
Page 31 and 32: to NVIDIA’s GPUs. We summarize th
Page 33 and 34: Speaker(s): James Lin (Assistant Pr
Page 35 and 36: with extremely high memory bandwidt
Page 37 and 38: presenters showcasing research code
Page 39: TUESDAY, MAY 15, 15:00 (50 MINUTES)
Page 43 and 44: TUESDAY, MAY 15, 16:00 (50 MINUTES
Page 45 and 46: Speaker(s): Mark Harris (Chief Tech
Page 47 and 48: Speaker(s): Nathan Bell (Senior Res
Page 49 and 50: SESSION INFORMATION WEDNESDAY, MAY
Page 51 and 52: WEDNESDAY, MAY 16, 09:30 (25 MINUTE
Page 55 and 56: collective behavior, helping reveal
Page 61: Speaker(s): Bjoern Landmann (Develo
Page 64 and 65: WEDNESDAY significantly beyond the
Page 66 and 67: WEDNESDAY Speaker(s): Cliff Woolley
Page 68 and 69: WEDNESDAY WEDNESDAY, MAY 16, 17:00
Page 71 and 72: SESSION INFORMATION THURSDAY, MAY 1
Page 73 and 74: THURSDAY, MAY 17, 09:30 (25 MINUTES
Page 75: applications. Furthermore guideline
Page 78 and 79: THURSDAY Speaker(s): Peter Zaspel (
Page 80 and 81: THURSDAY porting a numerical code t
Page 82 and 83: THURSDAY THURSDAY, MAY 17, 16:00 (5
Page 84 and 85: GPU Consolidation and Virtualizatio
Page 86 and 87: POSTER LISTINGS AN09 - Parallel Pro
Page 88 and 89: POSTER LISTINGS accuracy. Beside th
Page 90 and 91:
with High SGI® Performance GPU NVI
Page 92 and 93:
POSTER LISTINGS will allow to inclu
Page 94 and 95:
POSTER LISTINGS CV08 - Orientation
Page 96 and 97:
Covering the fastest computers in t
Page 98 and 99:
POSTER LISTINGS mesh. To this end,
Page 100 and 101:
POSTER LISTINGS speed enhancement w
Page 102 and 103:
POSTER LISTINGS this work addresses
Page 104 and 105:
What you need to know. Now. Dr. Dob
Page 106 and 107:
SPEAKERS AND PANELISTS Arutyun Avet
Page 108 and 109:
SPEAKERS AND PANELISTS with emphasi
Page 110 and 111:
SPEAKERS AND PANELISTS computationa
Page 112 and 113:
SPEAKERS AND PANELISTS Iain Couzin
Page 114 and 115:
SPEAKERS AND PANELISTS Concept and
Page 116 and 117:
SPEAKERS AND PANELISTS GE as a GPGP
Page 118 and 119:
SPEAKERS AND PANELISTS h Session(s)
Page 120 and 121:
SPEAKERS AND PANELISTS purpose comp
Page 122 and 123:
SPEAKERS AND PANELISTS Osman Kent C
Page 124 and 125:
SPEAKERS AND PANELISTS active contr
Page 126 and 127:
SPEAKERS AND PANELISTS Rainald Lohn
Page 128 and 129:
SPEAKERS AND PANELISTS Naoya Maruya
Page 130 and 131:
SPEAKERS AND PANELISTS models, at A
Page 132 and 133:
SPEAKERS AND PANELISTS Taro Okamoto
Page 134 and 135:
SPEAKERS AND PANELISTS Loukas Petri
Page 136 and 137:
SPEAKERS AND PANELISTS Erwin Roth R
Page 138 and 139:
SPEAKERS AND PANELISTS Arroyo Video
Page 140 and 141:
SPEAKERS AND PANELISTS John Stone S
Page 142 and 143:
SPEAKERS AND PANELISTS Hoang-Tron M
Page 144 and 145:
SPEAKERS AND PANELISTS Elmar Westph
Page 146 and 147:
SPEAKERS AND PANELISTS Hongwei Zhou
Page 148 and 149:
SPONSORS AND EXHIBITORS PLATINUM SP
Page 150 and 151:
SPONSORS AND EXHIBITORS GOLD SPONSO
Page 152 and 153:
SPONSORS AND EXHIBITORS PLATINUM ME
Page 154 and 155:
SPONSORS AND EXHIBITORS Bright Comp
Page 156 and 157:
SPONSORS AND EXHIBITORS MBA Science
Page 158 and 159:
SPONSORS AND EXHIBITORS Polywell PQ
Page 160 and 161:
SPONSORS AND EXHIBITORS Ubitus Inc.
Page 162 and 163:
STAY EDUCATED! GTC is comprised of
Page 164:
FIRST FLOOR TO ST. CLAIRE HOTEL BAL
show all

GTC 2012 Program Guide - GPU Technology Conference

Create successful ePaper yourself

Delete template?

Save as template?