Action: MOLINFO

Module generic
Description Usage
This command is used to provide information on the molecules that are present in your system. used in 13 tutorialsused in 103 eggs
This action outputs data to a file. You can read more about how PLUMED manages output files here

Input

The atoms that serve as the input for this action are specified using one or more of the keywords in the following table.

Keyword Type Description
CHAIN atoms (for masochists ( mostly Davide Branduardi ) ) The atoms involved in each of the chains of interest in the structure

Further details and examples

This command is used to provide information on the molecules that are present in your system. A simple examples where this command is used to provide the information on which atoms are in the backbone of a protein to the ALPHARMSD action is shown below.

Click on the labels of the actions for more information on what each action computes
tested on2.11
#SETTINGS MOLFILE=regtest/basic/rt32/helix.pdb
MOLINFOThis command is used to provide information on the molecules that are present in your system. More details STRUCTUREa file in pdb format containing a reference structure=
regtest/basic/rt32/helix.pdb
Click here to see an extract from this file.
×

FILE: regtest/basic/rt32/helix.pdb

ATOM      1 HH31 ACE     1      -9.105  -2.402  21.804  0.00  0.00            
ATOM      2  CH3 ACE     1      -8.930  -3.352  22.308  0.00  0.00            
ATOM      3 HH32 ACE     1      -9.504  -3.501  23.223  0.00  0.00            
ATOM      4 HH33 ACE     1      -9.067  -4.173  21.604  0.00  0.00            
ATOM      5  C   ACE     1      -7.450  -3.303  22.659  0.00  0.00            
...
ATOM    129  CH3 NME    14      -1.690  -7.089  20.487  0.00  0.00            
ATOM    130 HH31 NME    14      -1.873  -6.629  21.459  0.00  0.00            
ATOM    131 HH32 NME    14      -1.152  -6.284  19.986  0.00  0.00            
ATOM    132 HH33 NME    14      -1.135  -8.020  20.602  0.00  0.00            
END
ALPHARMSDProbe the alpha helical content of a protein structure. This action is a shortcut. More details RESIDUESthis command is used to specify the set of residues that could conceivably form part of the secondary structure=all TYPE the manner in which RMSD alignment is performed=DRMSD LESS_THANcalculate the number of a residue segments that are within a certain target distance of this secondary structure type. Options for this keyword are explained in the documentation for LESS_THAN.={RATIONAL R_0=0.08 NN=8 MM=12} LABELa label for the action so that its output can be referenced in the input to other actions=a

The following example prints the distance corresponding to the hydrogen bonds in a GC Watson-Crick pair.

Click on the labels of the actions for more information on what each action computes
tested on2.11
#SETTINGS MOLFILE=regtest/basic/rt-ermsd/ref.pdb
MOLINFOThis command is used to provide information on the molecules that are present in your system. More details STRUCTUREa file in pdb format containing a reference structure=
regtest/basic/rt-ermsd/ref.pdb
Click here to see an extract from this file.
×

FILE: regtest/basic/rt-ermsd/ref.pdb

REMARK    GENERATED BY TRJCONV
TITLE     add Villa in water t=   0.00000
REMARK    THIS IS A SIMULATION BOX
CRYST1   79.904   79.904   79.904  60.00  60.00  90.00 P 1           1
MODEL        1
...
ATOM   2255 2HO' RG3    71      23.000  58.930  29.250  1.00  0.00            
ATOM   2256  O3' RG3    71      21.250  59.670  32.300  1.00  0.00            
ATOM   2257  H3T RG3    71      21.130  59.570  31.350  1.00  0.00            
TER
ENDMDL
MOLTYPE what kind of molecule is contained in the pdb file - usually not needed since protein/RNA/DNA are compatible=dna hb1: DISTANCECalculate the distance/s between pairs of atoms. More details ATOMSthe pair of atom that we are calculating the distance between=@N2-2the N2 atom in residue 2. Click here for more information. ,@O2-15the O2 atom in residue 15. Click here for more information. hb2: DISTANCECalculate the distance/s between pairs of atoms. More details ATOMSthe pair of atom that we are calculating the distance between=@N1-2the N1 atom in residue 2. Click here for more information. ,@N3-15the N3 atom in residue 15. Click here for more information. hb3: DISTANCECalculate the distance/s between pairs of atoms. More details ATOMSthe pair of atom that we are calculating the distance between=@O6-2the O6 atom in residue 2. Click here for more information. ,@N4-15the N4 atom in residue 15. Click here for more information. PRINTPrint quantities to a file. More details ARGthe labels of the values that you would like to print to the file=hb1,hb2,hb3

The last example shows you how you can use MOLINFO to calculate torsion angles

Click on the labels of the actions for more information on what each action computes
tested on2.11
#SETTINGS MOLFILE=regtest/basic/rt32/helix.pdb
MOLINFOThis command is used to provide information on the molecules that are present in your system. More details MOLTYPE what kind of molecule is contained in the pdb file - usually not needed since protein/RNA/DNA are compatible=protein STRUCTUREa file in pdb format containing a reference structure=
regtest/basic/rt32/helix.pdb
Click here to see an extract from this file.
×

FILE: regtest/basic/rt32/helix.pdb

ATOM      1 HH31 ACE     1      -9.105  -2.402  21.804  0.00  0.00            
ATOM      2  CH3 ACE     1      -8.930  -3.352  22.308  0.00  0.00            
ATOM      3 HH32 ACE     1      -9.504  -3.501  23.223  0.00  0.00            
ATOM      4 HH33 ACE     1      -9.067  -4.173  21.604  0.00  0.00            
ATOM      5  C   ACE     1      -7.450  -3.303  22.659  0.00  0.00            
...
ATOM    129  CH3 NME    14      -1.690  -7.089  20.487  0.00  0.00            
ATOM    130 HH31 NME    14      -1.873  -6.629  21.459  0.00  0.00            
ATOM    131 HH32 NME    14      -1.152  -6.284  19.986  0.00  0.00            
ATOM    132 HH33 NME    14      -1.135  -8.020  20.602  0.00  0.00            
END
t1: TORSIONCalculate one or multiple torsional angles. More details ATOMSthe four atoms involved in the torsional angle=@phi-3the four atoms that are required to calculate the phi dihedral for residue 3. Click here for more information. t2: TORSIONCalculate one or multiple torsional angles. More details ATOMSthe four atoms involved in the torsional angle=@psi-4the four atoms that are required to calculate the psi dihedral for residue 4. Click here for more information. PRINTPrint quantities to a file. More details ARGthe labels of the values that you would like to print to the file=t1,t2 FILEthe name of the file on which to output these quantities=colvar STRIDE the frequency with which the quantities of interest should be output=10

The information on the molecules in your system can either be provided in the form of a pdb file or as a set of lists of atoms that describe the various chains in your system. If a pdb file is used plumed the MOLINFO command will endeavor to recognize the various chains and residues that make up the molecules in your system using the chainIDs and resnumbers from the pdb file. You can then use this information in later commands to specify atom lists in terms residues. For example using this command you can find the backbone atoms in your structure automatically. Since the introduction of PLUMED 2.7 you have been able to use multiple MOLINFO actions. Every time you perform an atom selection, the last available MOLINFO action will be used. This allows you to provide multiple PDB files, for instance using different naming conventions (see issue 134).

[!CAUTION] Please be aware that the PDB parser in plumed is far from perfect. You should thus check the log file and examine what plumed is actually doing whenever you use the MOLINFO action. Also make sure that the atoms are listed in the pdb with the correct order. If you are using gromacs, the safest way is to use reference pdb file generated with gmx editconf -f topol.tpr -o reference.pdb.

More information of the PDB parser implemented in PLUMED can be found in the documentation for the PDB2CONSTANT command.

If the flag WHOLE is used, the reference structure is assumed to be whole, i.e. not broken by PBC. This will impact PBC calculations in all the actions following the MOLINFO action!. In particular, actions that reconstruct molecules locally will do the reconstruction using a minimum spanning tree rather than the order in which atoms are provided. Notice that in PLUMED 2.8, 2.9, and 2.10 this would only have affected the behavior of \ref WHOLEMOLECULES actions, whereas starting with PLUMED 2.11 all actions that reconstruct PBCs are affected. If you want a variable to ignore this, you can just repeate the MOLINFO action without the WHOLE flag: all the following actions will use the traditional reconstruction algorithm, based on the order in which atoms are listed.

Providing MOLTYPE=protein, MOLTYPE=rna, or MOLTYPE=dna will instruct plumed to look for known residues from these three types of molecule. In other words, this keyword is available for historical reasons and to allow future extensions where alternative lists will be provided. As of now, you can just ignore this keyword.

Using MOLINFO extends the possibility of atoms selection using the @ special symbol. All the various ways that the @ symbol can be used are listed in the following table:

In the table above you will find the following shortcuts that allow you to select atoms that are not part of one specific residue:

@nucleic : all atoms that are part of a DNA or RNA molecule
@protein : all atoms that are part of a protein
@water : all water molecules
@ions : all the ions
@hydrogens : all hydrogen atoms (those for which the first non-number in the name is a H)
@nonhydrogens : all non hydrogen atoms (those for which the first non-number in the name is not a H)

[!CAUTION] Be careful since these choices are based on common names used in PDB files. Always check if the selected atoms are correct.

When you would like atoms from a specific residue you will use one of the atom selection commands from the above table with the following form:

@"definition"-chain_residuenum
@"definition"-chainresiduenum
@"definition"-residuenum

So for example

@psi-1 will select the atoms defining the psi torsion of residue 1
@psi-C1  or @psi-C_1 will define the same torsion for residue 1 of chain C.
@psi-3_1 will define the same torsion for residue 1 of chain 3.

Using the underscore to separate chain and residue is available as of PLUMED 2.5 and allows you to select chains with a numeric id.

Breaking the table above down we see that for protein residues, the following groups are available:

# quadruplets for dihedral angles
@phi-#
@psi-#
@omega-#
@chi1-#
@chi2-#
@chi3-#
@chi4-#
@chi5-#

# all sidechain atoms (excluding glycine, including all hydrogens)
@sidechain-#
# all backbone atoms (including hydrogens)
@back-#

while for DNA or RNA residues, the following groups are available:

# quadruplets for backbone dihedral angles
@alpha-#
@beta-#
@gamma-#
@delta-#
@epsilon-#
@zeta-#

# quadruplets for sugar dihedral angles
@v0-#
@v1-#
@v2-#
@v3-#
@v4-#

# quadruplet corresponding to the chi torsional angle
@chi-#

# backbone, sugar, and base heavy atoms
@back-#
@sugar-#
@base-#

# ordered triplets of atoms on the 6-membered ring of nucleobases
# namely:
#  C2/C4/C6 for pyrimidines
#  C2/C6/C4 for purines
@lcs-#

Notice that zeta and epsilon groups should not be used on 3' end residue and alpha and beta should not be used on 5' end residue.

Furthermore it is also possible to pick single atoms using the syntax atom-chain_residuenum, @atom-chainresiduenum or @atom-residuenum. As of PLUMED 2.5, this also works when the residue is not a protein/rna/dna residue. For instance, @OW-100 will select oxygen of water molecule with residue number 100.

Finally, notice that some shortcuts are available even when you not using the MOLINFO command.

[!CAUTION] If a residue-chain is repeated twice in the reference pdb only the first entry will be selected.

[!CAUTION] At the moment the HA1 atoms in a GLY residues are treated as if they are the CB atoms. This may or may not be true - GLY is problematic for secondary structure residues as it is achiral.

[!CAUTION] If you use WHOLEMOLECULES RESIDUES=1-10 for a 18 amino acid protein ( 18 amino acids + 2 terminal groups = 20 residues ) the code will fail as it will not be able to interpret terminal residue 1.

Advanced atom selection with mdtraj or MDAnalysis

Since PLUMED 2.6 it is possible to use the expressive selection syntax of mdtraj and/or MDAnalysis:

Click on the labels of the actions for more information on what each action computes
tested on2.11
#SETTINGS MOLFILE=regtest/basic/rt32/helix.pdb
MOLINFOThis command is used to provide information on the molecules that are present in your system. More details STRUCTUREa file in pdb format containing a reference structure=
regtest/basic/rt32/helix.pdb
Click here to see an extract from this file.
×

FILE: regtest/basic/rt32/helix.pdb

ATOM      1 HH31 ACE     1      -9.105  -2.402  21.804  0.00  0.00            
ATOM      2  CH3 ACE     1      -8.930  -3.352  22.308  0.00  0.00            
ATOM      3 HH32 ACE     1      -9.504  -3.501  23.223  0.00  0.00            
ATOM      4 HH33 ACE     1      -9.067  -4.173  21.604  0.00  0.00            
ATOM      5  C   ACE     1      -7.450  -3.303  22.659  0.00  0.00            
...
ATOM    129  CH3 NME    14      -1.690  -7.089  20.487  0.00  0.00            
ATOM    130 HH31 NME    14      -1.873  -6.629  21.459  0.00  0.00            
ATOM    131 HH32 NME    14      -1.152  -6.284  19.986  0.00  0.00            
ATOM    132 HH33 NME    14      -1.135  -8.020  20.602  0.00  0.00            
END
PYTHON_BIN python interpreter=python g1: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group=@mda:atom selection built using syntax for MDAnalysis. Click here for more information. backbone g2: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group={@mda:atom selection built using syntax for MDAnalysis. Click here for more information. {resnum 1 or resid 3:5}} g3: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group={@mda:atom selection built using syntax for MDAnalysis. Click here for more information. {resid 3:5} @mda:{resnum 1}} g4: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group={@mdt:atom selection built using syntax for mdtraj. Click here for more information. {protein and (backbone or resname ALA)}} g5: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group={@mdt:atom selection built using syntax for mdtraj. Click here for more information. {mass 5.5 to 20}} # masses guessed by mdtraj based on atom type! g6: GROUPDefine a group of atoms so that a particular list of atoms can be referenced with a single label in definitions of CVs or virtual atoms. More details ATOMSthe numerical indexes for the set of atoms in the group={@mda:atom selection built using syntax for MDAnalysis. Click here for more information. {resid 3:5} 1-10 @mda:{resnum 1}the resid 3:5} 1 atom in residue 10 @mda:{resnum 1}. Click here for more information. }

Here @mda: indicates that the MDAnalysis language is used, whereas @mdt: indicates that the mdtraj language is used. Notice that these languages typically select atoms in order. If you want to specify a different order, you can chain definitions as in g3 above (compare with g2). Selections can be also chained with standard PLUMED selections (see g6).

The double braces are required due to the way PLUMED parses atom lists. In particular:

  • The outer braces are needed to show PLUMED where the ATOMS=... option ends.
  • The inner braces are needed to show PLUMED where each selector ends.

MDAnalysis also supports geometric selectors based on atomic coordinates. These selectors are static and return lists computed using the coordinates stored in the MOLINFO pdb file.

In order to use this syntax you should check the following points at runtime:

  1. plumed --no-mpi config has subprocess prints subprocess on (should be ok on most UNIX systems).
  2. You have a python interpreter with mdtraj and/or MDAnalysis installed. You can check using:
  3. python -c "import mdtraj"
  4. python -c "import MDAnalysis"

In order to install these packages refer to their documentation. Pip or conda install should be ok, provided you make sure the correct python interpreter is in the execution PATH at runtime. Notice that you will only need the package(s) related to the syntax that you want to use. 3. In case you installed these modules on a python with a different name (e.g. python3.6), the correct check is: - python3.6 -c "import mdtraj" - python3.6 -c "import MDAnalysis"

If this is the case, you should set the environment variable export PYTHON_BIN=python3.6 or export PLUMED_PYTHON_BIN=python3.6 (higher priority). Alternatively, directly provide the interpreter in the PLUMED input file using MOLINFO PYTHON_BIN=python3.6 (even higher priority). 4. The PDB file that you provide to MOLINFO should have consecutive atom numbers starting from 1. This is currently enforced since reading atom numbers out of order (as PLUMED does) is not supported by other packages.

Advanced atom selection with VMD (experimental)

Similarly to the @mda: and @mdt: selectors above, you can use the two following selectors in order to access to VMD syntax for atoms selection: - @vmdexec:: This selector launches an instance of VMD, so vmd executable should be in your execution path. Might be very slow or even crash your simulation. Notice that even if vmd executable is used, the implementation is still python based and so a working python interpreter should be provided. - @vmd:: This selector tries to import the vmd python module. Notice that the best way to obtain this module is not within the standard VMD installer but rather by installing the python module that can be found at this link. The module is also available on conda. You should make sure the module is available in the python interpreter used by MOLINFO (check using the command python -c "import vmd").

These two selectors are experimental and might be removed at some point.

Syntax

The following table describes the keywords and options that can be used with this action

Keyword Type Default Description
CHAIN input none (for masochists ( mostly Davide Branduardi ) ) The atoms involved in each of the chains of interest in the structure
STRUCTURE compulsory none a file in pdb format containing a reference structure
MOLTYPE compulsory protein what kind of molecule is contained in the pdb file - usually not needed since protein/RNA/DNA are compatible
PYTHON_BIN compulsory default python interpreter
WHOLE optional false The reference structure is whole, i