GTC 2012 Program Guide - GPU Technology Conference

More documents

Recommendations

Info

TUESDAY TUESDAY, MAY 15, 09:00 (50 MINUTES) ROOM C S0419A Optimizing Application Performance with CUDA Profiling Tools NVIDIA provides two powerful profiling tools that you can use to maximize your application’s performance. The NVIDIA Visual Profiler helps you understand your application’s behavior with a detailed timeline and data from GPU performance counters. The Visual Profiler also provides an automatic, data-driven analysis engine that provides suggestions on potential optimization strategies for your application. Nvprof is a command-line profiler that provides gprof-like functionality for the GPU. Nvprof provides summary information about where your application is spending the most time, so that you can focus your optimization efforts. This session will provide a step-by-step walk through of both of these profiling tools, showing how you can use these tools to identify optimization opportunities at the application, kernel, and source-line levels. This session will repeat Wednesday at 14:00 (S0419B). Speaker(s): David Goodwin (Software Engineer, NVIDIA) Topic(s): Development Tools & Libraries (Beginner) TUESDAY, MAY 15, 09:00 (25 MINUTES) ROOM J2 S0527 GPUs and the Next-Generation Aerial Surveillance Graphics processors are already used for computationally intensive video tasks in many ISR (Intelligence, Surveillance, Reconnaissance) applications; GPU-based system for video enhancement and analytics outperforms a similarly priced CPU-based system 5-to-1 at HD resolutions. Our initial tests on 64 megapixel Wide Area Aerial Surveillance (WAAS) data show at least 10x speedup with tasks such as super-resolution or moving target indication. In this talk, we’ll discuss unique design and implementation challenges of real-time processing of very large video data sets. We will demonstrate our existing GPU-based software, IKENA ISR, and discuss its video-processing pipeline and innovative processing solutions that are promising to dramatically expand capabilities of emerging aerial surveillance platforms. Speaker(s): Nikola Bozinovic (CTO, MotionDSP) Topic(s): General Interest (Beginner) TUESDAY, MAY 15, 09:00 (25 MINUTES) ROOM A1 S0607 High Performance 3D Perception The path to general purpose graphics programming was driven by computer graphics: the process of rendering 3d models into 2d viewpoints. With the advent of flexible programming of GPGPU processing, this process can be reversed. 3D perception is the problem of inferring structure and motion of the physical world from 2d and 3d measurements. In this talk, we will demonstrate the role GPGPU plays in a diverse set of applications in high speed 3d perception and discuss optimization of these techniques for the GPGPU. We also demonstrate several capabilities of future systems which are enabled by GPGPU technologies. Speaker(s): Chris Slaughter (President, University of Texas Perception, Lynx Labs) Topic(s): Computer Vision (Beginner) TUESDAY, MAY 15, 09:30 (25 MINUTES) ROOM J2 S0040 Introducing CUDA in KBE Applications for Digital Vehicle Development Programs Get the latest development in Next Generation Knowledge Based Engineering (KBE) software which provides real results over the traditional design approach. Today there exist numerous KBE applications in the field of vehicle ergonomics, suspension, NVH, safety, regulations etc which deal with huge number of iterations and mathematical algorithm. With GPU computing and CUDA the KBE kernel is restructured to incorporate parallel programming model which helps the applications run faster and achieving time reduction from hours to seconds. KBE geometry kernel also gets benefited by enabling CUDA in topology based operations which take lot of time when performed on CPU. Speaker(s): Avijit Santra (Project Manager, Knowledge Based Engineering, Tata Motors Limited) Topic(s): General Interest (Intermediate) TUESDAY, MAY 15, 09:30 (25 MINUTES) ROOM K S0083 Swift: A GPU-based Smith-Waterman Sequence Alignment Program This session describes Swift, a GPU-based Smith-Waterman implementation for aligning short DNA sequences to large genomes. Swift has been designed to reduce computation time and lower hardware cost. Also, unlike other leading GPU-based Smith-Waterman sequence alignment programs like CUDASW++ and SWCUDA which focus on protein sequence alignment, Swift has been developed for DNA sequence alignment. Swift performs 200x faster than CUDASW++ using a test data set containing 1000 reads (100 bases each) and 1000 references (1000 bases each), and it performs 11x faster than the CPU-based implementation of Smith-Waterman using 24 million reads (100 bases each) and human chromosome 1. Speaker(s): Pankaj Gupta (Bioinformatics Application Developer, St Jude Children’s Research Hospital) Topic(s): Bioinformatics (Beginner) TUESDAY, MAY 15, 09:30 (25 MINUTES) ROOM A7 S0258 Sailfish: Lattice Boltzmann Fluid Simulations with GPUs and Python Learn how Run-Time Code Generation (RTCG) techniques allowed for fast development of a lattice Boltzmann (LB) fluid dynamics solver called Sailfish. Sailfish is completely open source, supports a wide variety of LB models (single and multiple relaxation times, the entropic model; single and binary fluids) and can take advantage of multiple GPUs. Even though the project is written predominantly in Python, no performance compromises are made. This talk will introduce the basic design principles of Sailfish and illustrate how RTCG allows to exploit the power of GPUs with minimal programmer effort. Speaker(s): Michal Januszewski (PhD Student/Software Engineer, University of Silesia in Katowice/Google Switzerland) Topic(s): Computational Fluid Dynamics, Computational Physics, Development Tools & Libraries (Intermediate) TUESDAY, MAY 15, 9:30 (25 MINUTES) ROOM J3 S0329 Using GPUs to Speedup Computational Lithography In this paper we show how GPUs can be used to significantly speedup computational lithography, which is heavily used in the Electronic Design Automation (EDA) industry. In particular, we demonstrate a noticeable performance increase in several basic optical lithography algorithms as well as the speedup of the full-chip verification software, crucial parts of which were ported
to NVIDIA’s GPUs. We summarize the advantages, disadvantages and challenges of using GPUs and compare it to more traditional multithreading and distributed computing alternatives for the same applications. Speaker(s): Constantin Chuyeshov (Algorithm Engineer, Cadence Design Systems) Topic(s): Electronic Design Automation (Intermediate) TUESDAY, MAY 15, 09:30 (50 MINUTES) ROOM A1 S0404 Computer Vision Libraries with GPUs Learn how Computer Vision libraries can take advantage of GPUs. Computer Vision algorithms are extremely well suited for GPU architectures because they demand large computational power that GPUs offer over CPUs. This talk provides an overview of the different GPU libraries such as (OpenCV, GPUCV, PCL, and NPP Libraries) and online resources (GPU4Vision and OpeNVIDIA) available for developers today. Examples and demonstrations of practical applications making use of these libraries will also be shown throughout the talk. Speaker(s): Eric Young (Manager of Developer Technology Profesional and Consumer Applications, NVIDIA) Topic(s): Computer Vision, Audio, Image and Video Processing (Beginner) TUESDAY, MAY 15, 09:30 (50 MINUTES) ROOM B S0430 Developing Next-Generation CUDA Acceleration in Wolfram’s Mathematica with Parallel Nsight Since version 8, Mathematica offers advanced support for GPU acceleration with optimized CUDA functions and a built-in framework for developing scientific CUDA kernel code. In this session, the Wolfram development team will share their experience developing their next-generation CUDA support in Mathematica. From the unique ability of Parallel Nsight to attach its CUDA debugger to a running process, the new parallel Warp Watch for warp-wide variable views and expression evaluation, to the latest runtime CUDA profiling experiments; they will demonstrate how they were able to take advantage of Parallel Nsight to get the most out of CUDA and the GPU. Speaker(s): Abdul Dakkak (Kernel Developer, Wolfram), Sebastien Domine (Sr. Director, Software Engineering, Developer Tools, NVIDIA), Ulises Cervantel-Pimentel (Senior Kernel Developer, Wolfram) Topic(s): Development Tools & Libraries (Intermediate) TUESDAY, MAY 15, 09:30 (50 MINUTES) ROOM M S0618 Best Practices of a 800TFlop Hybrid Supercomputer Implementation (Presented by Appro) Learn about the “Frontier Computing System”, deployed by Appro for the University Of Tsukuba Center Of Computational Sciences in Japan containing over half a million GPU cores. Learn how reliability, availability, manageability and compatibility were essential for this successful 800TF hybrid supercomputing implementation. Explore new techniques in how HA-PACS is accelerating large scale parallel code by combining CPU/GPU processing cluster configurations for scientific research, such as astrophysics and climate modeling. Learn how to improve data I/O performance and memory size limitations in hybrid systems configured with Lustre File System offering the best performance per dollar and excellent memory capacity per/FLOP. Speaker(s): Taisuke Boku (Deputy Director of Center for Computational Sciences at University of Tsukuba), Steve Lyness (VP of HPC Solutions Engineering, Appro) Topic(s): Supercomputing, Astronomy & Astrophysics (Intermediate) TUESDAY, MAY 15, 09:30 (50 MINUTES) ROOM NVIDIA NSIGHT LAB S0800 NVIDIA Nsight Lounge Come to the NVIDIA Nsight Lounge to meet the Nsight development team! Whether you would like a private meeting to discuss specific product features or test out your application with the latest version of Nsight, or you just want to hang out with the team after attending one of the exciting training session, the lab is great place to learn everything you ever wanted to know about the tool. Speaker(s): NVIDIA Developer Tools Team Topic(s): Development Tools & Libraries (Beginner) TUESDAY, MAY 15, 10:00 (25 MINUTES) ROOM J2 S0013 GPUs for Fast Triggering in NA62 Experiment We discuss an approach for using commercial graphic processors (GPUs) at the earliest trigger stages in high-energy physics experiments, and study its implementation on a real trigger system in preparation. In particular we focus on the possibility to reconstruct rings in a Cherenkov detector as building block of a selective trigger condition for rare decay search. Latency and processing rate measurements on several state-of-the-art devices are presented, and the potential issues related to processing time jitter and data transfer throughput are discussed. Speaker(s): Gianluca Lamanna (Researcher, CERN), Marco Sozzi (Associate Professor, Physics Department of Pisa) Topic(s): General Interest (Intermediate) TUESDAY, MAY 15, 10:00 (25 MINUTES) ROOM A8 S0031 Unstructured Grid Numbering Schemes for GPU Coalescing Requirements Learn how to achieve high performance for computational fluid dynamics (CFD) solvers over unstructured grids using numbering schemes tailored for GPU coalescing requirements. Using these techniques, unstructured grid CFD solvers can make more effective use of memory bandwidth, which is an otherwise significant performance bottleneck that has so far led to relatively limited performance gains on GPUs in comparison to structured grid CFD solvers. Performance benchmarks will be shown using the Jet Engine Noise Reduction (JENRE) code. Speaker(s): Andrew Corrigan (Research Mathematician, Naval Research Laboratory), Johann Dahm (University of Michigan) Topic(s): Computational Fluid Dynamics, Algorithms & Numerical Techniques, Computational Physics (Advanced) TUESDAY, MAY 15, 10:00 (25 MINUTES) ROOM A7 S0251 RANS CFD Solver on Fermi SJTU-NS3D is an in-house CFD code co-developed by SJTU and COMAC for large civil airplane, solving 3D Reynolds Average Navier-Stokes (RANS) equations on structured grids by finite volume method, which could be used in designing wing model. In this talk, we will present the design and further optimization of CUDA version of SJTU-NS3D, and it achieves 20-fold speedup for standard M6 wing model and 37-fold speedup for wing model candidate from COMAC on single Fermi C2050. 29 CONFERENCE GUIDE TUESDAY
Page 1 and 2: PRESENTED BY PLATINUM SPONSORS MAY
Page 3 and 4: WELCOME TO GTC Dear GTC Attendees,
Page 5 and 6: IMPORTANT INFORMATION If there is a
Page 7 and 8: 1 3 6 10 20 23 27 47 69 83 103 145
Page 9 and 10: SUNDAY, MAY 13 08:30 to 17:35 InPar
Page 11 and 12: Welcome to NVIDIA’s Emerging Comp
Page 13 and 14: 14:00 to 14:50 S2003 Emerging Compa
Page 15 and 16: CEO ON STAGE LISTING BIODIGITAL Bio
Page 17 and 18: MERSIVE Since it was founded in 200
Page 19 and 20: REALVIEW IMAGING LTD. RealView Imag
Page 21 and 22: CONFERENCE GUIDE 19
Page 23 and 24: AGENDA WEDNESDAY, MAY 16 Plenary Se
Page 25 and 26: SESSION INFORMATION - PRE-CONFERENC
Page 27 and 28: models with companion tools (e.g. f
Page 29: SESSION INFORMATION TUESDAY, MAY 15
Page 33 and 34: Speaker(s): James Lin (Assistant Pr
Page 35 and 36: with extremely high memory bandwidt
Page 37 and 38: presenters showcasing research code
Page 39 and 40: TUESDAY, MAY 15, 15:00 (50 MINUTES)
Page 41 and 42: TUESDAY, MAY 15, 15:00 (50 MINUTES)
Page 43 and 44: TUESDAY, MAY 15, 16:00 (50 MINUTES
Page 45 and 46: Speaker(s): Mark Harris (Chief Tech
Page 47 and 48: Speaker(s): Nathan Bell (Senior Res
Page 49 and 50: SESSION INFORMATION WEDNESDAY, MAY
Page 51 and 52: WEDNESDAY, MAY 16, 09:30 (25 MINUTE
Page 55 and 56: collective behavior, helping reveal
Page 61: Speaker(s): Bjoern Landmann (Develo
Page 64 and 65: WEDNESDAY significantly beyond the
Page 66 and 67: WEDNESDAY Speaker(s): Cliff Woolley
Page 68 and 69: WEDNESDAY WEDNESDAY, MAY 16, 17:00
Page 71 and 72: SESSION INFORMATION THURSDAY, MAY 1
Page 73 and 74: THURSDAY, MAY 17, 09:30 (25 MINUTES
Page 75: applications. Furthermore guideline
Page 78 and 79: THURSDAY Speaker(s): Peter Zaspel (
Page 80 and 81:
THURSDAY porting a numerical code t
Page 82 and 83:
THURSDAY THURSDAY, MAY 17, 16:00 (5
Page 84 and 85:
GPU Consolidation and Virtualizatio
Page 86 and 87:
POSTER LISTINGS AN09 - Parallel Pro
Page 88 and 89:
POSTER LISTINGS accuracy. Beside th
Page 90 and 91:
with High SGI® Performance GPU NVI
Page 92 and 93:
POSTER LISTINGS will allow to inclu
Page 94 and 95:
POSTER LISTINGS CV08 - Orientation
Page 96 and 97:
Covering the fastest computers in t
Page 98 and 99:
POSTER LISTINGS mesh. To this end,
Page 100 and 101:
POSTER LISTINGS speed enhancement w
Page 102 and 103:
POSTER LISTINGS this work addresses
Page 104 and 105:
What you need to know. Now. Dr. Dob
Page 106 and 107:
SPEAKERS AND PANELISTS Arutyun Avet
Page 108 and 109:
SPEAKERS AND PANELISTS with emphasi
Page 110 and 111:
SPEAKERS AND PANELISTS computationa
Page 112 and 113:
SPEAKERS AND PANELISTS Iain Couzin
Page 114 and 115:
SPEAKERS AND PANELISTS Concept and
Page 116 and 117:
SPEAKERS AND PANELISTS GE as a GPGP
Page 118 and 119:
SPEAKERS AND PANELISTS h Session(s)
Page 120 and 121:
SPEAKERS AND PANELISTS purpose comp
Page 122 and 123:
SPEAKERS AND PANELISTS Osman Kent C
Page 124 and 125:
SPEAKERS AND PANELISTS active contr
Page 126 and 127:
SPEAKERS AND PANELISTS Rainald Lohn
Page 128 and 129:
SPEAKERS AND PANELISTS Naoya Maruya
Page 130 and 131:
SPEAKERS AND PANELISTS models, at A
Page 132 and 133:
SPEAKERS AND PANELISTS Taro Okamoto
Page 134 and 135:
SPEAKERS AND PANELISTS Loukas Petri
Page 136 and 137:
SPEAKERS AND PANELISTS Erwin Roth R
Page 138 and 139:
SPEAKERS AND PANELISTS Arroyo Video
Page 140 and 141:
SPEAKERS AND PANELISTS John Stone S
Page 142 and 143:
SPEAKERS AND PANELISTS Hoang-Tron M
Page 144 and 145:
SPEAKERS AND PANELISTS Elmar Westph
Page 146 and 147:
SPEAKERS AND PANELISTS Hongwei Zhou
Page 148 and 149:
SPONSORS AND EXHIBITORS PLATINUM SP
Page 150 and 151:
SPONSORS AND EXHIBITORS GOLD SPONSO
Page 152 and 153:
SPONSORS AND EXHIBITORS PLATINUM ME
Page 154 and 155:
SPONSORS AND EXHIBITORS Bright Comp
Page 156 and 157:
SPONSORS AND EXHIBITORS MBA Science
Page 158 and 159:
SPONSORS AND EXHIBITORS Polywell PQ
Page 160 and 161:
SPONSORS AND EXHIBITORS Ubitus Inc.
Page 162 and 163:
STAY EDUCATED! GTC is comprised of
Page 164:
FIRST FLOOR TO ST. CLAIRE HOTEL BAL
show all

GTC 2012 Program Guide - GPU Technology Conference

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?