# Molecular Solution

## Ramachandran plot

Glyala is one of the simplest molecules that exhibits some important features common to larger biomolecules.
In particular, it has more than one long-lived conformation, which we will identify in this exercise by mapping out its *potential energy surface*.

The conformations of glyala dipeptide are characterized by the dihedral angles of the backbone. Below, we color carbons in green, hydrogens in white, oxygen in red and nitrogen in blue, i.e. the torsional angle $\phi$ is N-C-C-N , while $\psi$ is C-N-C-C along the backbone.

tar -xvf glyala-epot.tar.gz

**TASK 1**Visualize the structure

`glyala.pdb`

with VMD and determine the atomic indices of the atoms defining the dihedral angles.
With this knowledge at hand, we will fix the dihedral angles and perform geometry optimization for all remaining degrees of freedom.

**TASK 2**

- The atomic indices defining the dihedral indices in the input file
`geo.in`

are missing. Replace`I1`

to`I4`

by the atomic indices determined previously.*Note:*While VMD starts counting atoms from 0, CP2K starts counting from 1, i.e. the VMD indices need to be increased by 1. - Use
`perform-gopt.sh`

to perform the grid of geometry optimizations. - Use gnuplot to plot the potential energy surface (we have provided a script
`epot.gp`

). Which are the two most favoured conformations?$ gnuplot

gnuplot > load "epot.gp"

## Water

We have prepared a CP2K input file `water.inp`

for running a MD simulation of liquid water using the force field from the first exercise (parametrized by Praprotnik et al.). Download water.tar.gz

Repeat the MD using initial temperatures 200 and 400 K. In order not to overwrite any of your previous files, it is advisable to run the new simulations in different folders.

**TASK 3**

- Check that the MD is energy conserving and
*well-behaved*. - What are the final average temperatures in each of the simulations? Why are they different from the initial ones?
- The initial atomic configuration stems from an equilibration run. At which temperature was the system (approximately) equilibrated?

Next we are going to analyze the trajectories in order to calculate the radial distribution function (rdf, $g(r)$) as a function of temperature.

VMD comes with an extension for exactly this purpose: In the VMD Main window open “Extensions → Analysis” click on “Radial Pair Distribution function $g(r)$”. In the appearing window use “Utilities → Set unit cell dimensions” to let VMD know the simulation box you used. After that use Selection 1 and 2 to define the atomic types that you want to calculate the rdf for, for example “element H”.

**TASK 4**

- Plot $g_{O-O}(r)$ at 200, 300 and 400 K into the same graph.
- What are the differences in the height of the first peak?
- What does this say about the structure of the liquid and is this expected?
- Compare to experimental data
`goo.ALS`

taken at 300 K.

Then we will calculate diffusion coefficient. It is a proportionality constant between the molar flux due to molecular diffusion and the gradient in the concentration of the species (or the driving force for diffusion), which is defined by:
**$6D=\lim_{t\to\infty} \ \frac{\delta <r^2(t)>}{\delta t}$**

To evaluate this expression, all that is needed is to evaluate at each point in time in the calculation the average of the square of the distance that each atom has traveled since the start of the production phase of the dynamics, and examining the slope of this function at a long time. By storing the initial coordinates, it is straightforward to evaluate the square of the distance. However, some care is needed due to the use of periodic boundary conditions: the program stores x, the coordinates, but in many programs, during the dynamics, if any atom has its x, y, or z coordinate become larger than box size or smaller than zero, then it is moved back into the other side of the box. This has the effect of making the raw distance traveled meaningless. The value of D is obtained from the slope, at a long time, of the right-hand side of the above equation (you need to divide by six to obtain D, take the slope, and also be careful with units).

VMD comes with an extension for exactly this purpose: In the VMD Main window open “Extensions → Analysis” click on “RMSD Trajectory Tool”. In the appearing window use “all” to let VMD know the molecule you want to track. Tick “Plot”, and press “RMSD”, you will have the RMSD plot for the water system.

**TASK 5**

- Plot RMSD for the water at 200K, 300K, 400K.
- Calculate their corresponding diffusion coefficients, are they expected?

## Glyala in water

Now, we will move to a more realistic system - Glyala in water. We will preformed a MD of glyala in water and save the trajectory.

The initial geometry provided in the PDB file is a glyala molecule solvated by 73 water molecules. The geometry is not equilibrated. You need first to equilibrate the system at 300K. When the system is equilibrated, you need to analysis the result.

**TASK 6**

- Perform the molecular dynamics simulation using NVT ensemble at 300K.
- Re-run the calculation using NVT ensemble with different TIMECON (500, 2000 fs) in the &THERMOSTAT section, and plot the total energy, temperature against time. Explain what you observe.
- Determine from which step the system is equilibrated, plot the calculated properties and explain why.
- Compute the O-O radial distribution function for water with acceptable statistics using 20 ps (after equilibration) of simulated time.

**Tip for O-O RDF for water**

In last exercise, one already knew how to calculate the RDF for the Argon system. In TASK3, you need to calculate the RDF only for water instead of whole system. Since the glyala contain two oxygen atoms, it is not reasonable to include the oxygen atoms in glyala molecule if we are only interested in O-O RDF for water. Using VMD, the O-O RDF for the water can be easily calculated. In the

Selection 1, Selection 2

, one need to specify

element O and not same residue as element C

The frames should start from the beginning of production run.