Aims

The aim of this tutorial is to introduce the users to the PLUMED syntax. We will go through the writing of simple collective variable and we will use them to analyze existing trajectories.

Objectives

Once this tutorial is completed students will be able to:

Write a simple PLUMED input file and use it with the PLUMED driver to analyze a trajectory.
Use the GROUP keyword to make the input file compact and easy to read and to quickly build complex atom groups.
Print collective variables such as distances (DISTANCE), torsional angles (TORSION), gyration radius (GYRATION), and coordination numbers (COORDINATION) using the PRINT action.
Computing the geometric center of a group of atoms using CENTER.
Know how to take care of periodic boundary conditions within PLUMED using WHOLEMOLECULES and WRAPAROUND, and be able to verify the result with DUMPATOMS.
Extract from a trajectory snapshots satisfying specific conditions using UPDATE_IF.

Resources

The TARBALL for this project contains the following files:

ref.pdb : A PDB file with a RNA duplex solvated in a water box and a Mg ion.
traj-whole.xtc: A trajectory for the same system in GROMACS xtc format. To make the exercise easier, RNA duplex has been made whole already.
traj-broken.xtc: The same trajectory as it was originally produced by GROMACS. Here the RNA duplex is broken and should be fixed.

This tutorial has been tested on a pre-release version of version 2.4. However, it should not take advantage of 2.4-only features, thus should also work with version 2.3.

Also notice that in the .solutions directory of the tarball you will find correct input files. Please only look at these files after you have tried to solve the problems yourself.

Introduction

This tutorial asks you to compute a variety of different collective variables using PLUMED for a particular trajectory and to compare the files and graphs that you obtain with the correct ones that are shown online. Compared to some of the other tutorials that are available here this tutorial contains considerably less guidance so in doing this tutorial you will have to learn how to consult the manual. If you would like a more guided introduction to PLUMED it might be better to start with the tutorials Belfast tutorial: Analyzing CVs or MARVEL tutorial: Analyzing CVs. Also notice that, whereas this tutorial was tested using a pre-release version of PLUMED 2.4, it should be completely feasible using PLUMED 2.3.

Using PLUMED from the command line

As we will see later, PLUMED provides a library that can be combined with multiple MD codes. However, in this tutorial we will only use PLUMED to analyze trajectories that have been produced already. Once PLUMED is installed you can run a plumed executable that can be used for multiple purposes:

> plumed --help

Here we will use the driver tool, that allows you to process an already existing trajectory.

> plumed driver --help

What we will need is:

A trajectory to be analyzed (provided).
A PLUMED input file (you do it!).

The syntax of the PLUMED input file is the same that we will use later to run enhanced sampling simulations, so all the things that you will learn now will be useful later when you will run PLUMED coupled to an MD code. In the following we are going to see how to write an input file for PLUMED.

The structure of a PLUMED input file

The main goal of PLUMED is to compute collective variables, which are complex descriptors than can be used to analyze a conformational change or a chemical reaction. This can be done either on the fly, that is during molecular dynamics, or a posteriori, using PLUMED as a post-processing tool. In both cases one should create an input file with a specific PLUMED syntax. A sample input file is below:

Click on the labels of the actions for more information on what each action computes

# this is optional and tell to VIM that this is a PLUMED file
# vim: ft=plumed
# see comments just below this input file
# Compute distance between atoms 1 and 10.
# Atoms are ordered as in the trajectory files and their numbering starts from 1.
# The distance is called "d" for future reference.
d: DISTANCE ATOMSthe pair of atom that we are calculating the distance between. 
=1,10 The DISTANCE action with label d calculates a single scalar value
# Create a virtual atom in the center between atoms 20 and 30.
# The virtual atom only exists within PLUMED and is called "center" for future reference.
center: CENTER ATOMSthe list of atoms which are involved the virtual atom's definition. 
=20,30 The CENTER action with label center calculates the position of a virtual atom
# Compute the torsional angle between atoms 1, 10, 20, and center.
# Notice that virtual atoms can be used as real atoms here.
# The angle is called "phi" for future reference.
phi: TORSION ATOMSthe four atoms involved in the torsional angle 
=1,10,20,center The TORSION action with label phi calculates a single scalar value
# Compute some function of previously computed variables.
# In this case we compute the cosine of angle phi and we call it "d2"
d2: MATHEVAL ...
   ARGthe input for this action is the scalar output from one or more other actions. 
=phi FUNCcompulsory keyword 
the function you wish to evaluate =cos(x) 
   PERIODICcompulsory keyword 
if the output of your function is periodic then you should specify the periodicity
of the function. =NO 
...The MATHEVAL action with label d2 calculates a single scalar value
# The previous command has been split in multiple lines.
# It could have been equivalently written in a single line:
# d2: MATHEVAL ARG=phi FUNC=cos(x) PERIODIC=NO
# Print d and d2 every 10 step on a file named "COLVAR1".
PRINT ARGthe input for this action is the scalar output from one or more other actions. 
=d,d2 STRIDEcompulsory keyword ( default=1 )
the frequency with which the quantities of interest should be output 
=10 FILEthe name of the file on which to output these quantities 
=COLVAR1 The PRINT action with label 
# Print phi on another file names "COLVAR2" every 100 steps.
PRINT ARGthe input for this action is the scalar output from one or more other actions. 
=phi STRIDEcompulsory keyword ( default=1 )
the frequency with which the quantities of interest should be output 
=100 FILEthe name of the file on which to output these quantities 
=COLVAR2 The PRINT action with label

Note: If you are a VIM user, you might find convenient configuring PLUMED syntax files, see Using VIM syntax file. Syntax highlighting is particularly useful for beginners since it allows you to identify simple mistakes without the need to run PLUMED. In addition, VIM has a full dictionary of available keywords and can help you by autocomplete your commands.

In the input file above, each line defines a so-called action. An action could either compute a distance, or the center between two or more atoms, or print some value on a file. Each action supports a number of keywords, whose value is specified. Action names are highlighted in green and, clicking on them, you can go to the corresponding page in the manual that contains a detailed description for each keyword. Actions that support the keyword STRIDE are those that determine how frequently things are to be done. Notice that the default value for STRIDE is always 1. In the example above, omitting STRIDE keywords the corresponding COLVAR files would have been written for every frame of the analyzed trajectory. All the other actions in the example above do not support the STRIDE keyword and are only calculated when requested. That is, d and d2 will be computed every 10 frames, and phi every 100 frames. In short, you can think that for every snapshot in the trajectory that you are analyzing PLUMED is going to execute all the listed actions, though some of them are optimized out when STRIDE is different from 1.

Also notice that PLUMED works using kJ/nm/ps as energy/length/time units. This can be personalized using UNITS, but we will here stay with default values.

Variables should be given a name (in the example above, d, phi, and d2), which is then used to refer to these variables. Instead of a: DISTANCE ATOMS=1,2 you might equivalently use DISTANCE ATOMS=1,2 LABEL=a. Lists of atoms should be provided as comma separated numbers, with no space. Virtual atoms can be created and assigned a name for later use.

You can find more information on the PLUMED syntax at Getting Started page of the manual. The complete documentation for all the supported collective variables can be found at the Collective Variables page.

To analyze the trajectory provided here, you should:

Create a PLUMED input file with a text editor (let us call it plumed.dat) similar to the one above.
Run the command plumed driver --mf_xtc traj.xtc --plumed plumed.dat.

Here traj.xtc is the trajectory that you want to analyze. Notice that driver can read multiple file formats using embedded molfile plugins from VMD (that's where the mf letters come from).

Notice that you can also visualize trajectories with VMD directly. Trajectory traj.xtc can be visualized with the command vmd ref.pdb traj-whole.xtc.

In the following we will make practice with computing and printing collective variables.

Exercise 1: Computing and printing collective variables

Analyze the traj-whole.xtc trajectory and produce a colvar file with the following collective variables.

The gyration radius of the solute RNA molecule (GYRATION). Look in the ref.pdb file which are the atoms that are part of RNA (search for the first occurrence of a water molecule, residue name SOL). Remember that you don't need to list all the atoms: instead of ATOMS=1,2,3,4,5 you can write ATOMS=1-5.
The torsional angle (TORSION) corresponding to the glycosidic chi angle \(\chi\) of the first nucleotide. Since this nucleotide is a purine (guanine), the proper atoms to compute the torsion are O4' C1 N9 C4. Find their serial number in the ref.pdb file or learn how to select a special angle reading the MOLINFO documentation.
The total number of contacts (COORDINATION) between all RNA atoms and all water oxygen atoms. For COORDINATION, set reference distance R_0 to 2.5 A (be careful with units!!). Try to be smart in selecting the water oxygen atoms without listing all of them explicitly.
Same as before but against water hydrogen. Also in this case you should be smart to select water hydrogen atoms. Documentation of GROUP might help.
Distance between the Mg ion and the geometric center of the RNA duplex (use CENTER and DISTANCE).

Notice that some of the atom selections can be made in a easier manner by using the MOLINFO keyword with a proper reference PDB file. Also read carefully the Groups and Virtual Atoms page before starting. Here you can find a sample plumed.dat file that you can use as a template. Whenever you see an highlighted FILL string, this is a string that you should replace.

Click on the labels of the actions for more information on what each action computes

# First load information about the molecule.
MOLINFO __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Notice that this is special kind of "action" ("setup action")
# that is only used during setup. It will not be re-executed at each step.
# Define some group that will make the rest of the input more readable
# Here are the atoms belonging to RNA.
rna: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=1-258 You cannot view the components that are calculated by each action for this input file. Sorry
# This is the Mg ion. A group with atom is also useful!
mg: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=6580 You cannot view the components that are calculated by each action for this input file. Sorry
# This group should contain all the atoms belonging to water molecules.
wat: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Select water oxygens only:
owat: GROUP __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Select water hydrogens only:
hwat: GROUP __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute gyration radius:
r: GYRATION ATOMSthe group of atoms that you are calculating the Gyration Tensor for.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute the Chi torsional angle:
c: TORSION ATOMSthe four atoms involved in the torsional angle
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute coordination of RNA with water oxygens
co: COORDINATION GROUPAFirst list of atoms.
=rna GROUPBSecond list of atoms (if empty, N*(N-1)/2 pairs in GROUPA are counted).
=owat R_0 could not find this keyword
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute coordination of RNA with water hydrogens
ch: COORDINATION GROUPAFirst list of atoms.
=rna GROUPBSecond list of atoms (if empty, N*(N-1)/2 pairs in GROUPA are counted).
=hwat __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute the geometric center of the RNA molecule:
ce: CENTER ATOMSthe list of atoms which are involved the virtual atom's definition.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute the distance between the Mg ion and the RNA center:
d: DISTANCE ATOMSthe pair of atom that we are calculating the distance between.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Print the collective variables on COLVAR file
# No STRIDE means "print for every step"
PRINT ARGthe input for this action is the scalar output from one or more other actions.
=r,c,co,ch,d FILEthe name of the file on which to output these quantities
=COLVAR You cannot view the components that are calculated by each action for this input file. Sorry

Once your plumed.dat file is complete, you can use it with the following command

> plumed driver --plumed plumed.dat --mf_xtc whole.xtc

Scroll in your terminal to read the PLUMED log. As you can see, PLUMED gives a lot of feedback about the input that he is reading. There's the place where you can check if PLUMED understood correctly your input.

The command above will create a file COLVAR like this one:

#! FIELDS time r c co ch d
#! SET min_c -pi
#! SET max_c pi
 0.000000 0.788694 -2.963150 207.795793 502.027244 0.595611
 1.000000 0.804101 -2.717302 208.021688 499.792595 0.951945
 2.000000 0.788769 -2.939333 208.347867 500.552127 1.014850
 3.000000 0.790232 -2.940726 211.274315 514.749124 1.249502
 4.000000 0.796395 3.050949 212.352810 507.892198 2.270682

Notice that the first line informs you about the content of each column and the second and third lines tell you that variable c (the \(\chi\) torsion) is defined between \(-\pi\) and \(+\pi\).

In case you obtain different numbers, check your input, you might have made some mistake!

This file can then be shown with gnuplot

gnuplot> p "COLVAR" u 1:2, "" u 1:3

As a final note, look at what happens if you run the exercise twice. The second time, PLUMED will back up the previously produced file so as not to overwrite it. You can also concatenate your files by using the action RESTART at the beginning of your input file.

To learn more: Combining collective variables

In this first exercise we only computed simple functions of the atomic coordinates. PLUMED is very flexible and allows you to also combine these functions to create more complicated variables. These variables can be useful when you want to describe a complex conformational change. PLUMED implements a number of functions that can be used to this aim that are described in the page Functions. Look at the following example:

Click on the labels of the actions for more information on what each action computes

# Distance between atoms 1 and 2:
d1: DISTANCE ATOMSthe pair of atom that we are calculating the distance between. 
=1,2 The DISTANCE action with label d1 calculates a single scalar value
# Distance between atoms 1 and 3:
d2: DISTANCE ATOMSthe pair of atom that we are calculating the distance between. 
=1,3 The DISTANCE action with label d2 calculates a single scalar value
# Distance between atoms 1 and 4:
d3: DISTANCE ATOMSthe pair of atom that we are calculating the distance between. 
=1,4 The DISTANCE action with label d3 calculates a single scalar value
# Compute the sum of the squares of those three distances:
c: COMBINE ARGthe input for this action is the scalar output from one or more other actions. 
=d1,d2,d3 POWERScompulsory keyword ( default=1.0 )
the powers to which you are raising each of the arguments in your function 
=2,2,2 PERIODICcompulsory keyword 
if the output of your function is periodic then you should specify the periodicity
of the function. 
=NO The COMBINE action with label c calculates a single scalar value
# Sort the three distances:
s: SORT ARGthe input for this action is the scalar output from one or more other actions. 
=d1,d2,d3 The SORT action with label s calculates the following quantities:

 Quantity    Description  
s.1 the label of this action is set by user in the input. See documentation above.
s.2 the label of this action is set by user in the input. See documentation above.
s.3 the label of this action is set by user in the input. See documentation above.


# Notice that SORT creates a compund object with three components:
# s.1: the smallest distance
# s.2: the middle distance
# s.3: the largest distance
p: MATHEVAL ARGthe input for this action is the scalar output from one or more other actions. 
=d1,d2,d3 FUNCcompulsory keyword 
the function you wish to evaluate 
=x*y*z PERIODICcompulsory keyword 
if the output of your function is periodic then you should specify the periodicity
of the function. 
=NO The MATHEVAL action with label p calculates a single scalar value
# Print the sum of the squares and the largest among the three distances:
PRINT FILEthe name of the file on which to output these quantities 
=COLVAR ARGthe input for this action is the scalar output from one or more other actions. 
=c,s.3 The PRINT action with label

In case you have many distances to combine you can also use regular expressions to select them using ARG=(d.), see Regular Expressions.

Notice for many functions you should say to PLUMED if the function is periodic. See Functions for a detailed explanation of how to choose this keyword.

You might think that it is easier to combine the variables after you have written them already, using, e.g., an awk or python script. That's fine if you are analyzing a trajectory. However, as we will learn later, computing variables within PLUMED you will be able to add bias potentials on those combinations, influencing their dynamics. Actually, you could implement any arbitrarily complex collective variable using just DISTANCE and MATHEVAL! Anyway, if the CV combinations that you are willing to use can be computed easily with some external program, do it and compare the results with the output of the PLUMED driver.

Exercise 1b: Combining collective variables

As an optional exercise, create a file with the following quantities:

The sum of the distances between Mg and each of the phosphorous atoms.
The distance between Mg and the closest phosphorous atom.

Notice that the serial numbers of the phosphorous atoms can be easily extracted using the following command

> grep ATOM ref.pdb | grep " P " | awk '{print $2}'

Here's a template input file to be completed by you.

Click on the labels of the actions for more information on what each action computes

# First load information about the molecule.
MOLINFO __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Define some group that will make the rest of the input more readable
mg: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=6580 # a with one atom is also useful! You cannot view the components that are calculated by each action for this input file. Sorry
# Distances between Mg and phosphorous atoms:
d1: DISTANCE ATOMSthe pair of atom that we are calculating the distance between.
=mg,33 You cannot view the components that are calculated by each action for this input file. Sorry
d2: DISTANCE __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
d6: DISTANCE __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# You can use serial numbers, but you might also use MOLINFO strings
# Compute the sum of these distances
c: COMBINE __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Compute the distance between Mg and the closest phosphorous atom
s: SORT __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Print the requested variables
PRINT FILEthe name of the file on which to output these quantities
=COLVAR __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry

Notice that using the collective variable DISTANCES you might be able to do the same with a significantly simpler input file! If you have time, also try that and compare the result.

The resulting COLVAR file should look like this one:

#! FIELDS time c s.1
 0.000000 6.655622 0.768704
 1.000000 7.264049 0.379416
 2.000000 7.876489 0.817820
 3.000000 8.230621 0.380191
 4.000000 13.708759 2.046935

Solving periodic-boundary conditions issues

While running PLUMED can also dump the coordinate of the internally stored atoms using DUMPATOMS. This might seem useless (coordinates are already contained in the original trajectory) but can be used in the following cases:

To dump coordinates of virtual atoms that only exist within PLUMED (e.g. a CENTER).
To dump snapshots of our molecule conditioned to some value of some collective variable (see UPDATE_IF).
To dump coordinates of atoms that have been moved by PLUMED.

The last point is perhaps the most surprising one. Some of the PLUMED actions can indeed move the stored atoms to positions better suitable for the calculation of collective variables.

The previous exercise was done on a trajectory where the RNA was already whole. For the next exercise you will use the traj-broken.xtc file instead, which is a real trajectory produced by GROMACS. Open it with VMD to understand what we mean with broken

> vmd ref.pdb traj-broken.xtc

Select Graphics, then Representations, then type nucleic in the box Selected Atoms. You will see that your RNA duplex is not whole. This is not a problem during MD because of periodic boundary conditions. However, it is difficult to analyze this trajectory. In addition, some collective variables that you might want to compute could require the molecules to be whole (an example of such variables is RMSD).

You might think that there are alternative programs that can be used to reconstruct the molecules the molecules that are broken by the periodic boundary correctly in your trajectory before analyzing it. However, you should keep in mind that if you need to compute CVs on the fly to add a bias potential on those (as we will to in the next tutorials) you will have to learn how to reconstruct the molecules that are broken by the periodic boundary within PLUMED. If you know alternative tools that can reconstruct the molecules that are broken by the periodic boundary, it is a good idea to also use them and compare the result with PLUMED.

Exercise 2: Solving PBC issues and dump atomic coordinates

Analyze the provided trajectory traj-broken.xtc and use the DUMPATOMS action to produce new trajectories in gro format that contain:

The RNA duplex made whole (not broken by periodic boundary conditions). You should read carefully the documentation of WHOLEMOLECULES.
The whole RNA duplex aligned to a provided template (structure reference.pdb). See FIT_TO_TEMPLATE, using TYPE=OPTIMAL. Notice that you should provide to FIT_TO_TEMPLATE a pdb file with only the atoms that you wish to align. Use the ref.pdb file as a starting point and remove the lines non containing RNA atoms. More details on PDB files in PLUMED can be found here.
The whole RNA duplex and Mg ion, but only including the snapshots where Mg is at a distance equal to at most 4 A from phosphorous atom of residue 8. Search for the serial number of the proper phosphorous atom in the PDB file and use the UPDATE_IF action to select the frames.
The whole RNA duplex plus water molecules and mg ion wrapped around the center of the duplex. Compute first the center of the duplex with CENTER then wrap the molecules with WRAPAROUND. Make sure that individual water molecules are not broken after the move!

Here you can find a template input file to be completed by you.

Click on the labels of the actions for more information on what each action computes

# First load information about the molecule.
MOLINFO __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Define here the groups that you need.
# Same as in the previous exercise.
rna: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
mg: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
wat: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Make RNA duplex whole.
WHOLEMOLECULES __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Dump first trajectory in gro format.
# Notice that PLUMED understands the format based on the file extension
DUMPATOMS ATOMSthe atom indices whose positions you would like to print out.
=rna FILEcompulsory keyword
file on which to output coordinates; extension is automatically detected
=rna-whole.gro You cannot view the components that are calculated by each action for this input file. Sorry
# Align RNA duplex to a reference structure
# This should not be the ref.pdb file but a new file with only RNA atoms.
FIT_TO_TEMPLATE REFERENCEcompulsory keyword
a file in pdb format containing the reference structure and the atoms involved in
the CV.
=__FILL__ TYPEcompulsory keyword ( default=SIMPLE )
the manner in which RMSD alignment is performed.
=OPTIMAL You cannot view the components that are calculated by each action for this input file. Sorry
# Notice that before using FIT_TO_TEMPLATE we used WHOLEMOLECULES to make RNA whole
# This is necessary otherwise you would align a broken molecule!
# Dump the aligned RNA on a separate file
DUMPATOMS ATOMSthe atom indices whose positions you would like to print out.
=rna FILEcompulsory keyword
file on which to output coordinates; extension is automatically detected
=rna-aligned.gro You cannot view the components that are calculated by each action for this input file. Sorry
# Compute the distance between the Mg and the Phosphorous from residue 8
d: DISTANCE ATOMSthe pair of atom that we are calculating the distance between.
=mg,__FILL__ ## put the serial number of the correct phosphorous here You cannot view the components that are calculated by each action for this input file. Sorry
# here we only dump frames conditioned to the value of d
UPDATE_IF ARGthe input for this action is the scalar output from one or more other actions.
=d __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
DUMPATOMS ATOMSthe atom indices whose positions you would like to print out.
=rna,mg FILEcompulsory keyword
file on which to output coordinates; extension is automatically detected
=rna-select.gro You cannot view the components that are calculated by each action for this input file. Sorry
UPDATE_IF ARGthe input for this action is the scalar output from one or more other actions.
=d __FILL__# this command is required to close the UPDATE_IF above You cannot view the components that are calculated by each action for this input file. Sorry
# compute the center of the RNA molecule
center: CENTER ATOMSthe list of atoms which are involved the virtual atom's definition.
=rna You cannot view the components that are calculated by each action for this input file. Sorry
# Wrap atoms correctly
WRAPAROUND ATOMSwrapped atoms.
=mg AROUNDreference atoms.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
WRAPAROUND ATOMSwrapped atoms.
=wat AROUNDreference atoms.
=center __FILL__# anything missing here? You cannot view the components that are calculated by each action for this input file. Sorry
# Dump the last trajectory
DUMPATOMS ATOMSthe atom indices whose positions you would like to print out.
=rna,wat,mg FILEcompulsory keyword
file on which to output coordinates; extension is automatically detected
=rna-wrap.gro You cannot view the components that are calculated by each action for this input file. Sorry

After you have prepared a proper plumed.dat file, you can use it with the following command

> plumed driver --plumed plumed.dat --mf_xtc broken.xtc

Visualize the resulting trajectories using VMD. Since the gro files already contain atom names, you do not need to load the pdb file first. For instance, the first trajectory can be shown with

> vmd rna-whole.gro

TODO: I should perhaps add reference plots

To learn more: Mastering WHOLEMOLECULES

If you just simulate a single solute molecule in water it is easy to understand how to pick the right options for WHOLEMOLECULES. However, if you have multiple molecules it can be rather tricky. In the example above, we used WHOLEMOLECULES on the RNA molecule which is actually a duplex, that is two separated chains. This was correct for the following reasons:

the two chains are kept together by hydrogen bonds, and
the last atom of the first chain is always close to the first atom of the second chain.

In case the two molecules can separate from each other this would be rather problematic.

We will now see what happens when using WHOLEMOLECULES on multiple molecules incorrectly.

Exercise 2b: Mistakes with WHOLEMOLECULES

Prepare a PLUMED input file that makes all the water molecules whole. Use the following template

Click on the labels of the actions for more information on what each action computes

# First load information about the molecule.
MOLINFO __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry 
# Define here the groups that you need
rna: GROUP ATOMSthe numerical indexes for the set of atoms in the group. 
=__FILL__  You cannot view the components that are calculated by each action for this input file. Sorry 
mg: GROUP ATOMSthe numerical indexes for the set of atoms in the group. 
=__FILL__  You cannot view the components that are calculated by each action for this input file. Sorry 
wat: GROUP ATOMSthe numerical indexes for the set of atoms in the group. 
=__FILL__  You cannot view the components that are calculated by each action for this input file. Sorry 
# Make RNA whole
WHOLEMOLECULES ENTITY0the atoms that make up a molecule that you wish to align. 
=rna  You cannot view the components that are calculated by each action for this input file. Sorry 
# Now make water whole as if it was a single molecule
WHOLEMOLECULES ENTITY0the atoms that make up a molecule that you wish to align. 
=wat  You cannot view the components that are calculated by each action for this input file. Sorry 
# And dump the resulting trajectory
DUMPATOMS ATOMSthe atom indices whose positions you would like to print out. 
=rna,wat,mg FILEcompulsory keyword 
file on which to output coordinates; extension is automatically detected 
=wrong.gro  You cannot view the components that are calculated by each action for this input file. Sorry

Now look at the resulting file with vmd wrong.gro. Can you understand which is the problem?

The important take-home message here is that when you want to reconstruct periodic boundary conditions correctly in systems with multiple molecules you should be careful and always verify with DUMPATOMS that the system is doing what you expect.

To learn more: Mastering FIT_TO_TEMPLATE

In an exercise above we used FIT_TO_TEMPLATE. This action uses as a reference a PDB file which typically contains a subset of atoms (those that are fitted). However, when you apply FIT_TO_TEMPLATE with TYPE=OPTIMAL, the whole system is translated and rotated. The whole system here means all atoms plus the vectors defining the periodic box.

Exercise 2c: Mastering FIT_TO_TEMPLATE

Check how the periodic box rotates when using FIT_TO_TEMPLATE. Use the following template

Click on the labels of the actions for more information on what each action computes

# First load information about the molecule.
MOLINFO __FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Define here the groups that you need
rna: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
mg: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
wat: GROUP ATOMSthe numerical indexes for the set of atoms in the group.
=__FILL__ You cannot view the components that are calculated by each action for this input file. Sorry
# Make RNA whole
WHOLEMOLECULES ENTITY0the atoms that make up a molecule that you wish to align.
=rna You cannot view the components that are calculated by each action for this input file. Sorry
# Here's a compund variable with the box vectors
# computed before aligning RNA
cell_before: CELL You cannot view the components that are calculated by each action for this input file. Sorry
# Now we align RNA
FIT_TO_TEMPLATE __FILL__TYPEcompulsory keyword ( default=SIMPLE )
the manner in which RMSD alignment is performed.
=OPTIMAL You cannot view the components that are calculated by each action for this input file. Sorry
# Here's a compund variable with the box vectors
# computed after aligning RNA
cell_after: CELL You cannot view the components that are calculated by each action for this input file. Sorry
PRINT ARGthe input for this action is the scalar output from one or more other actions.
=cell_before.* FILEthe name of the file on which to output these quantities
=CELL_BEFORE You cannot view the components that are calculated by each action for this input file. Sorry
PRINT ARGthe input for this action is the scalar output from one or more other actions.
=cell_after.* FILEthe name of the file on which to output these quantities
=CELL_AFTER You cannot view the components that are calculated by each action for this input file. Sorry

You should obtains files like the ones reported below.

CELL_BEFORE should be

#! FIELDS time cell_before.ax cell_before.ay cell_before.az cell_before.bx cell_before.by cell_before.bz cell_before.cx cell_before.cy cell_before.cz
 0.000000 4.533710 0.000000 0.000000 0.000000 4.533710 0.000000 2.266860 2.266860 3.205821
 1.000000 4.533710 0.000000 0.000000 0.000000 4.533710 0.000000 2.266860 2.266860 3.205821
 2.000000 4.533710 0.000000 0.000000 0.000000 4.533710 0.000000 2.266860 2.266860 3.205821
 3.000000 4.533710 0.000000 0.000000 0.000000 4.533710 0.000000 2.266860 2.266860 3.205821
 4.000000 4.533710 0.000000 0.000000 0.000000 4.533710 0.000000 2.266860 2.266860 3.205821

CELL_AFTER should be

#! FIELDS time cell_after.ax cell_after.ay cell_after.az cell_after.bx cell_after.by cell_after.bz cell_after.cx cell_after.cy cell_after.cz
 0.000000 4.533710 -0.000059 -0.000008 0.000059 4.533710 -0.000172 2.266895 2.266952 3.205730
 1.000000 -0.396226 4.289476 -1.413481 -1.244340 1.260309 4.173460 2.249665 3.307132 2.134590
 2.000000 -3.016552 1.123968 -3.192434 -1.055123 -4.375593 -0.543533 -4.309790 -1.356178 0.375612
 3.000000 -4.083873 1.923282 -0.421306 0.339577 -0.267554 -4.513051 -3.243502 -2.069026 -2.398628
 4.000000 -4.020722 2.094622 -0.029688 -1.060483 -1.979827 3.938298 -1.263169 2.532008 3.542306

As you can see, the generating vectors of the periodic lattice before fitting are constant. On the other hand, after fitting these vectors change so as to keep RNA correctly aligned to its reference structure.

Later on you will learn how to add a bias potential on a give collective variable. In principle, you could also add a RESTRAINT to the cell_after.* variables of the last example. This would allow you to force your molecule to a specific orientation.

Conclusions

In summary, in this tutorial you should have learned how to use PLUMED to:

Manipulate atomic coordinates.
Compute collective variables.

All of this was done by just reading an already available trajectory. Notice that there are many alternative tools that could have been used to do the same exercise. Indeed, if you are familiar with other tools, it might be a good idea to also try them and compare the results. The special things of working with PLUMED are the following:

PLUMED implements a vast library of useful collective variables. Browse the manual and search for ideas that are suitable for your system.
PLUMED has a simple and intuitive syntax to combine collective variables ending up in descriptors capable to characterize complex conformational changes.
And finally, the most special thing: any collective variable that can be computed within PLUMED can also be biased while you are running your MD simulation! You will learn more later about this topic.

The last point is probably the main reason why PLUMED exists and what distinguishes it from other available software.

Quantity	Description
s.1	the label of this action is set by user in the input. See documentation above.
s.2	the label of this action is set by user in the input. See documentation above.
s.3	the label of this action is set by user in the input. See documentation above.