TGQR 2010Q2 Report.pdf - Teragridforum.org

More documents

Recommendations

Info

This project has a large number of jobs to run (some with dependencies on previous job runs) and multiple machines available. The PI is looking for a way to manage multiple systems, plus multiple jobs efficiently, and is currently using a python package - titled /Many Jobs/ - to help accomplish this. The PI has been working closely with the developer of these scripts (they are both at the same university), and does not seem to need any further assistance from AUS staff. PI: Quinn (Boston University, Solar Terrestrial Research). Center for Integrated Space Weather Modeling. Continuing through 12/10. The ASTA staff for this project are Lonnie Crosby, Glenn Brook (NICS), and Carlos Rosales-Fernandez (TACC). There is no further update for this quarter. PI: Cui (SDSC, Computer and Computation Research). Parallelization of a Finite Element Code for Petascale Dynamics Modeling. Continuing through 08/10. The ASTA staff for this project is Jun Zhou (SDSC). The MaFE code supported by ASTA project has been successfully scaled up to 48,000 cores on both TACC Ranger and NICS Kraken. The MPI/OpenMP Hybrid version has been also implemented to improve the parallel efficiency. However, the hybrid solver performs even worse than pure MPI for large-scale benchmark run, where MPI communication and synchronization overhead dominate the simulation time based on our experiments. Currently we are redesigning asynchronous model and trying to reduce communication time via overlap implementation. We will also further investigate optimization algorithm to reduce the main computation time towards higher peak performance. PI: Kimber (University of Pittsburgh, Thermal Systems). Computational Modeling of Thermal Striping in Nuclear Reactors. Continuing through 12/10. Anirban Jana from PSC is the ASTA staff for this project. As the computational lead, Anirban continued to guide the postdoc on systematically simulating in Fluent a series of cases with growing complexity, on the local machine at Kimber's lab. The following cases has been simulated in Fluent this quarter (in order of growing complexity): a) Single laminar isothermal round jet, 2d axisymmetric b) Single laminar non-isothermal round jet, 2d axisymmetric, without and with temperature dependence of fluid properties. c) Single laminar isothermal round jet, 3d d) Single turbulent isothermal round jet, 3d, RANS and LES Anirban and the postdoc completed rigorous mesh independence and domain independence tests and validation with known (analytical) results for case (a) and will repeat them for the other cases next. The ultimate goal is to perform fully 3d, non-isothermal, LES simulations in Fluent of single and multiple jets in order to study their thermal mixing characteristics. The project has also been expanded to include the investigation of boiling heat transfers in nuclear reactors. The plan to use the Lattice Boltzmann Method as the primary tool here, since they will be interested in mesoscopic effects. They have submitted a request to add to this TG account Dan Roseum, a PhD student in the Dept of Mechanical Engr, University of Pittsburgh, who will be working on this. PI: VanBriesen (CMU, Chemical and Reaction Processes). Simulation of Complex Organic Mixture Biotransformations in Natural Systems. Continuing through 04/11. Anirban Jana from PSC is collaborating on this ASTA project. In this project, the aim is to perform Monte Carlo simulations to investigate the dechlorination likelihoods of a class of chemicals known as polychlorinated biphenyls or PCBs. The Monte Carlo runs themselves are embarrassingly parallel, but the outputs of each MC run needs to be stored in n-dimensional arrays so that at the end, relevant statistics can be computed. Hence clearly, both memory requirements and computational time increase as the number of MC runs increase. 52
Amanda Hughes, a PhD student of Dr. Van Briesen, had developed a MATLAB code that could, on her laptop, run 330 MC simulations before running out of memory. Over the last quarter, her original serial MATLAB code was ported to the MATLAB Star-P platform on Pople and parallelized. As part of the porting process: 1) The original code structure was thoroughly reviewed. 2) The code was restructured so that the parallelizable parts of the code were consolidated into a single for loop. 3) During the restructuring, additional vectorization and other code tweaks were performed to improve the serial performance. 4) Finally the code was parallelized with Star-P by moving the contents of the parallelizable for loop into a functions and doing a ppeval on the function. This has now enabled simulations with up to 10000 MC runs. The scaling of computational time with increasing # cores was found to be very satisfactory too. Note that because the code was restructured to put everything to be parallelized into a single for loop, it will be trivial to run this in parallel on *other parallel MATLAB platforms*. For example, if using the Mathworks Parallel Computing Toolbox, instead of a ppeval, the for loop just needs to be replaced by the parfor loop. PI: Brasseur (Penn State, BioPhysics).Villous Motility as a Critical Mechanism for Efficient Nutrient Absorption in the Small Intestine. Continuing through 03/11. Lonnie Crosby (NICS), for computational work, and Amit Chourasia (SDSC), for visualization work, are involved in this ASTA project. In this quarter effort focused on computational work by Lonnie. This group's code, Intestine3D, initially produced a large amount of metadata traffic on the file system. This large traffic volume was due to the I/O pattern employed by the application in which files are opened and closed for each read/write operation. Twelve files were identified which could benefit from remaining open during the course of the application's runtime. This approach was able to decrease the application's runtime by 33%. A full profile of application performance was performed and sent to members of the project. This profile identified an inherent load imbalance in the application. Addressing this load imbalance could improve performance by an estimated 10%. Additionally, the implementation of MPI-IO parallel I/O in order to produce concatenated binary files instead of individual (one per process or group) ANSCI files can improve application performance by about 10%. Suggestions and instructions for implementing MPI-IO parallel-IO were sent to members of the project. PI. Engel (U. North Carolina, Physics). Systematics of Nuclear Surgace Vibrations in Deformed Nuclei. Continuing through 03/11. ASTA staff for this project are Meng-Shiou Wu (NICS) and Victor Eijkhout (TACC). During the first quarter of this ASTA project focus was on gathering detail specifications and understanding the code structure of the project. Meng-Shiou has been working with the group in order to explore possible approaches to improve the code efficiency on Kraken. Jointly with the PI’s group they have identified and discussed why their approach to use ScaLapack was not working, and what are possible choices for them to conduct diagonalization on a multi-core XT5 node. Both shared memory approach (use threaded libraries) and distributed memory approach (use sub-communicator and re-design memory management in their code) were discussed. Several scientific libraries that support multi-core architecture were tested (Libsci of Cray, AMD's ACML and ATLAS), but very limited or no performance improvement was observed. Currently work is on integration of their code with a code segment provided by another research team that use a master-slave style programming to utilize MPI's subcommunicator for the project. 53
Page 1 and 2: NSF Extensible Terascale Facility T
Page 3 and 4: Working Group Leaders Accounting Ad
Page 5 and 6: 6.7 RP Operations: User Facing Proj
Page 7 and 8: 1 Overview The TeraGrid is an open
Page 9 and 10: Further details can be found §6.2.
Page 11 and 12: 2.1.10 2.1.11 2.1.12 2.1.13 2.1.14
Page 13 and 14: Figure 2-3. Illustration of the thr
Page 15 and 16: mapped out pathways from the bisabo
Page 17 and 18: The Center for Analysis and Predict
Page 19 and 20: density distribution. ADF/cofilin b
Page 21 and 22: Ranger at TACC, they simulated the
Page 23 and 24: Presentations at gateway telecons f
Page 25 and 26: USGS National Map infrastructure an
Page 27 and 28: Figure 4-4 Effect of adaptive optic
Page 29 and 30: 4.2.0/. The NCSA team will support
Page 31 and 32: significantly improve the user expe
Page 33 and 34: petascale applications and tools, a
Page 35 and 36: Srinivasa Jampani, University Rao D
Page 37 and 38: efforts: Advanced Support for TeraG
Page 39 and 40: CMU Global Kinetic Simulations of t
Page 41 and 42: Heat Transfer Simulation of Suspens
Page 43 and 44: VanBriesen, CMU Villous Motility as
Page 45 and 46: • Subversion source management ha
Page 47 and 48: may be necessary to use a different
Page 49 and 50: Prof. Cheatham’s ptraj MD analysi
Page 51: PI: Choi (U. Nevada, Las Vegas, Ast
Page 55 and 56: Yang Wang (PSC). Initial contact wa
Page 57 and 58: community. User feedback received,
Page 59 and 60: 6 User Facing Projects and Core Ser
Page 61 and 62: Table 6-1. Q2 2010 Most Visited TGU
Page 63 and 64: “replacement” model, in which t
Page 65 and 66: The TGUP responded to 106,097 portl
Page 67 and 68: Most new documentation came as a di
Page 69 and 70: 6.8 Information Production and Qual
Page 71 and 72: initial testing of NFS4 as a gatewa
Page 73 and 74: (pnetCDF and netCDF4) was completed
Page 75 and 76: provide a path to becoming Longhorn
Page 77 and 78: Figure 8-2: Total transfer (GB) per
Page 79 and 80: 8.5.1 Data Movement (GridFTP) Stati
Page 81 and 82: Figure 8-5: Total number of GridFTP
Page 83 and 84: 8.5.2 Job Submission (GRAM) Statist
Page 85 and 86: Figure 8-9: GRAM-submitted jobs as
Page 87 and 88: As part of the NIGMS award, NRBSC a
Page 89 and 90: The second issue related to adding
Page 91 and 92: Figure 8-9. TeraGrid Computational
Page 93 and 94: Figure 8-11. Top 20 TeraGrid PIs, Q
Page 95 and 96: Institution Users Colorado Sch of M
Page 97 and 98: Institution Users CSU-San Bernardin
Page 99 and 100: objects that pass Futuregrid testin
Page 101 and 102: 9.4.1 Technology Audit Services Pro
Page 103 and 104:
user management is complete. Local
Page 105 and 106:
developers to see what they have al
Page 107 and 108:
GIG place and orient new data. I pa
Page 109 and 110:
north of UT Austin’s main campus)
Page 111 and 112:
• Invite potential users of Gordo
Page 113 and 114:
Modules Under Development Advanced
Page 115 and 116:
Purdue RP has recruited two undergr
Page 117 and 118:
eak-out session to the regional STE
Page 119 and 120:
to coordinate press for all related
Page 121 and 122:
ER-RP Reports for Q210: NCSA TeraGr
Page 123 and 124:
The ORNL RP continues to collaborat
Page 125 and 126:
NCAR Type Title Location Date(s) Ho
Page 127 and 128:
Lab, TN Workshop Computational Thin
Page 129 and 130:
tion Industry Technology Incubator
Page 131 and 132:
Students Part 2 W Proteins in Actio
Page 133 and 134:
Q2 ‘09 638 216 Q3 '09 504 403 Q4
Page 135 and 136:
Factory Berkeley, Berkeley Lab (LBL
Page 137 and 138:
performance analysis of parallel pr
Page 139 and 140:
them with a thorough overview of th
Page 141 and 142:
MIDAS Network software development
Page 143 and 144:
12. Wang, S. 2010. "A Cyber-GIS Fra
Page 145 and 146:
36. Xue, M., M. Tong, and G. Zhang,
Page 147 and 148:
press, 2010 (arXiv:0909.0011) TeraG
Page 149 and 150:
121. Rast, M.P., Talk, “Precision
Page 151 and 152:
171. Brown, B.P.,Talk, “Solar rot
Page 153 and 154:
214. Cui, Y., Chourasia, A., Moore,
Page 155 and 156:
253. Le, P.M., and D.V. Papavassili
Page 157 and 158:
291. Liu, M.; Li, T.; Amegayibor, F
Page 159 and 160:
338. Xu, J., M. A. Sharpe, L. Qin,
Page 161 and 162:
387. Galan, J. F.; Brown, J.; Wildi
Page 163 and 164:
2008 ACM/IEEE Conference on Superco
Page 165 and 166:
461. Spinks, P.Q., Thomson, R.C., L
Page 167 and 168:
MCA06N063 498. "Emerging Photolumin
Page 169 and 170:
543. Zhu, J., Zhu, J., Negri, A., P
Page 171 and 172:
Physics AST030030 583. N. F. Lourei
Page 173 and 174:
634. M. S. Pindzola, J. A. Ludlow,
Page 175 and 176:
683. C. Reisswig et al., Gravitatio
Page 177 and 178:
726. E. Schnetter, P. Diener, N. Do
Page 179 and 180:
SI.Pkg 2.0.4.6.1 M Extension Quarte
Page 181 and 182:
Work Package Dependencies Planned Q
Page 183 and 184:
Work Package Dependencies Planned Q
Page 185 and 186:
TeraGrid AUS Project Plan Work Pack
Page 187 and 188:
TeraGrid AUS Project Plan Work Pack
Page 189 and 190:
Work Package Dependencies Planned R
Page 191 and 192:
Work Package Dependencies Planned R
Page 193 and 194:
TeraGrid DV PY5 Project Plan (as of
Page 195 and 196:
TeraGrid NOS Project Plan Work Pack
Page 197 and 198:
Albert[85%], Bennett[45%], ongoing
Page 199 and 200:
Project-ID WBS O P M Work Package D
Page 201 and 202:
Page 203:
show all

TGQR 2010Q2 Report.pdf - Teragridforum.org

Create successful ePaper yourself

Delete template?

Save as template?