UC Riverside Undergraduate Research Journal

More documents

Recommendations

Info

Computational Prediction of Association Free Energies for the C3d-CR2 Complex and Comparison to Experimental Data Alexander S. Cheung methionine of C3d which is an artifact added by the protein expression system. One of the chains in the PDB file, chain C, a CR2 molecule that is not in contact with C3d, was also removed. This is because chain C is irrelevant in this study, since it is known that CR2 behaves as a monomer in solution [7]. What remains is a C3d molecule, consisting of 306 amino acids, in contact with a CR2 molecule, consisting of 129 amino acids. We used the program SPDBV (Swiss Protein Data Bank Viewer) [17], version 3.7, to renumber the amino acids and atoms in the PDB file (displacing every atom by -1, resulting in a total of 435 amino acids consisting of 3399 atoms) and subsequently separate the two components of the complex to create one PDB file with C3d alone and one PDB file with CR2 alone. After separation, amino acids and atoms were renumbered for CR2 to begin at residue 307 and atom 2413. Thus three PDB files constitute the final output of this step: one PDB file consisting of the C3d-CR2 complex, one consisting of C3d, and one consisting of CR2. These three PDB files are considered the “parent” PDB files. By generating each of the component parent PDB files from the complex parent PDB file in this way, it is ensured that the atomic coordinates of each component of the complex in each of their respective component files are identical to their atomic coordinates in the complex file. This is crucial for the accurate calculation of free energy differences. Finally, we used the program WHATIF [18] to add the missing C-terminal oxygen atom of C3d in the C3d-CR2 complex. The third step (Fig. 2) was the construction of the 23 specific mutants. This is done using WHATIF, a home-made python script [19] that calls WHATIF, the three parent PDB files, and three input text files, each of which list the amino acid substitutions to be made in one of the three parent PDB files. Each of the 23 mutations had to be performed twice: once on the complex PDB file and again on the individual component file containing the mutation(s). Thus, the script was run three times, each time using as inputs one of the three parent PDB files and its corresponding input text file. For the purpose of consistency, each of the parent PDB files was also run through WHATIF manually without making any mutations. The outputs of this step are 23 mutant complex PDB files, 9 mutant C3d PDB files, 14 mutant CR2 PDB files, and the three parent PDB files (49 PDB files total). These 49 PDB files comprise 24 sets (parent and 23 mutants) of 3 PDB files each (C3d, CR2, complex). The fourth step (Fig. 2) was the removal of a WHATIF-specific header added to each PDB file in the last step, and the change of the nomenclature of C-terminal oxygens from O’’, which is recognizable by WHATIF, to OXT, which is recognizable by PDB2PQR (to be used in the next step). These two tasks are accomplished using home-made python scripts [19]. The fifth step (Fig. 2) involved the use of the program, PDB2PQR [20] 1.2.1, to prepare the coordinate PDB files for use with APBS (see below). Through the use of a home-made python script, each of the 49 PDB files was run through PDB2PQR. The outputs were 49 files in PQR format, each containing three-dimensional atomic coordinate data as well as charge and van der Waals radii assigned according to the PARSE parameter file [20]. The default options for debumping and hydrogen bond optimization were left on. Debumping refers to local optimization to eliminate unfavorable van der Waals clashes (overlap or partial overlap of atomic radii). The hydrogen bond network optimization algorithm assures that optimal hydrogen bonds are present by 180 o -flipping the rings of histidine or of planar amine groups of glutamines or asparagines. This option is necessary because electron densities from X-ray diffraction data do not discriminate between the 0 o - and 180 o -flip states of these amino acid side chains. The purpose of steps 1-5 described above is to create the proper input files for use with the program APBS (Adaptive Poisson-Boltzmann Solver) [22]. The sixth step was calculation of electrostatic potentials using Figure 3. Hypothetical thermodynamic cycle. Horizontal processes represent association in vacuum (top) and in solution (bottom). Vertical processes represent solvation of the components (left) and of the complex (right). Electrostatic potential surfaces are visualized at ±30 kT/e for association in vacuum (top) and at ±1 kT/e in solution (bottom). 16 <strong>UC</strong>R Un d e r g r a d u a t e Re s e a r c h Jo u r n a l
Computational Prediction of Association Free Energies for the C3d-CR2 Complex and Comparison to Experimental Data Alexander S. Cheung APBS. The inputs for an APBS calculation are a PQR file and an input text file with the calculation parameters. The electrostatic potential of the complex and of each of the individual components was calculated at two different conditions as shown in Fig. 3. For each APBS calculation, the protein or protein complex was embedded in a box with 129 × 129 × 129 grid points having coarse grid dimensions of 140 Ǻ × 110 Ǻ × 120 Ǻ and fine grid dimensions of 105 Ǻ × 85 Ǻ × 90 Ǻ. The grid dimensions were chosen by performing preliminary test calculations using structures of the complex to ensure that there was no truncation of the largest electrostatic potential when plotted at ±1 k B T/e. For each of the 24 structures, two sets of three calculations were performed, one set being in vacuum and the other in a realistic protein-solvent environment. Each of the two sets included calculations for the complex and the two individual components. The vacuum calculations were performed using the same dielectric constant for the protein interior and solvent (ε p = ε s = 2) and at ionic strength corresponding to 0 mM ionic strength. The proteinsolvent calculations were performed using low dielectric constant for the protein interior (ε p = 2) and high dielectric constant for the solvent (ε s = 78.5), and at ionic strength corresponding to 150 mM ionic strength. To eliminate grid artifacts when comparing the results of the calculations, the individual components, C3d and CR2, were each positioned in the grid exactly as they were in the C3d-CR2 complex, respectively. A home-made python script [19] was used to automatically perform a set of 24 calculations (for parent proteins and mutants). Each of the 24 calculations includes a subset of 6 calculations, as described above. Each of the 24 APBS calculations generates a file with the electrostatic potential matrix and a log (OUT; Fig. 2) file describing the calculation progress and providing the electrostatic free energy of association in solution and the solvation free energy difference. The seventh step (Fig. 2) was visualization of the electrostatic potentials using the program VMD (Visual Molecular Dynamics) [23] version 1.8.5 and data analysis using MATLAB (The Mathworks, Inc., Natick, MA) version r2007b. Distances between mutations and the association site contact residues were measured using SPDBV. In earlier stages of this study, we performed the calculations manually, making mutations with SPDBV and performing the conversion of the PDB files to PQR files using the online version of PDB2PQR version 1.3.0 (http://agave. wustl.edu/pdb2pqr/index.html). Variations in the calculated electrostatic free energies of association in solution (without solvation effects) between the script-based high-throughput protocol and the manual online-based protocol of up to 21% were observed. Variations of up to 3% were observed in the solvation free energy differences. Results Figure 3 describes the hypothetical thermodynamic cycle we used in our calculations of electrostatic free energies of association. The horizontal steps describe association in vacuum (top) and in solution (bottom) and the vertical steps describe solvation of the free components, C3d and CR2 (left), and of the C3d-CR2 complex (right). Table 1. List of mutations, calculated solvation free energy differences, calculated association free energies in solution, experimental binding ability data, previously calculated ionization free energy differences, and distances of mutated residues from the association site. <strong>UC</strong>R Un d e r g r a d u a t e Re s e a r c h Jo u r n a l 17
Page 1 and 2: University of California, Riverside
Page 3 and 4: UCR Undergraduate Research Journal
Page 5 and 6: Zero Waste Biodiesel: Using Glyceri
Page 13 and 14: Computational Prediction of Associa
Page 15: Computational Prediction of Associa
Page 23 and 24: Phosphorylation of Crk Adaptor Prot
Page 29 and 30: Augustan Era Policy on the Rhine Fr
Page 35 and 36: Fractal Strings and Number Theory:
Page 47 and 48: Motion Based Bird Sensing Using Fra
Page 53 and 54: Love a Son, Raise a Daughter: A Cro
Page 61 and 62: Mating-type distribution of the ric
Page 67 and 68:
Secondary Organic Aerosol (Soa) And
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Bacterium-Induced Fluorescence-Enha
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
show all

UC Riverside Undergraduate Research Journal

Create successful ePaper yourself

Delete template?

Save as template?