Action: FARTHEST_POINT_SAMPLING

Module landmarks
Description Usage
Select a set of landmarks using farthest point sampling. used in 0 tutorialsused in 0 eggs
output value type
a vector which has as many elements as there are rows in the input matrix of dissimilarities vector

Input

The arguments that serve as the input for this action are specified using one or more of the keywords in the following table.

Keyword Type Description
ARG matrix the input matrix

Further details and examples

Select a set of landmarks using farthest point sampling.

This action is used within the LANDMARK_SELECT_FPS shortcut, which performs farthest point sampling. Farthest point sampling is a method of selecting a subset of input coordinates that works by selecting a first point at random. The remaining points are then selected by taking the unselected point in the input data set that is the furthest from all the points that have been selected thus far.

This particular action is designed to be used in conjunction with SELECT_WITH_MASK in the same way that CREATE_MASK is designed to be be used with that action. This action takes an N×N matrix of dissimilarities in input and outputs and N dimensional vector whose elements are ones and zeros. As shown in the example input below, this output vector is passed to a SELECT_WITH_MASK using the MASK keyword.

The example below thus shows how this action is used in the LANDMARK_SELECT_FPS shortcut to select 100 landmark frames from the trajectory using farthest point sampling.

Click on the labels of the actions for more information on what each action computes
tested on2.11
# This stores the positions of all the first 10 atoms in the system for later analysis
cc: COLLECT_FRAMESCollect atomic positions or argument values from the trajectory for later analysis This action is a shortcut and it has hidden defaults. More details ATOMSlist of atomic positions that you would like to collect and store for later analysis=1,2,3,4,5,6,7,8,9,10 ALIGN if storing atoms how would you like the alignment to be done can be SIMPLE/OPTIMAL=OPTIMAL

# We now compute the dissimilarities between these frames cc_dataT: TRANSPOSECalculate the transpose of a matrix More details ARGthe label of the vector or matrix that should be transposed=cc_data dd: DISSIMILARITIESCalculate the matrix of dissimilarities between a trajectory of atomic configurations. More details ARGthe label of the two matrices from which the product is calculated=cc_data,cc_dataT # Create a mask that will be used to create our landmark data mask: FARTHEST_POINT_SAMPLINGSelect a set of landmarks using farthest point sampling. This action has hidden defaults. More details ARGthe input matrix=dd NZEROSthe number of landmark points that you want to select=100
# These are the landmark points landmarks: SELECT_WITH_MASKUse a mask to select elements of an array More details ARGthe label for the value upon which you are going to apply the mask=cc_data ROW_MASKan array with ones in the rows of the matrix that you want to discard=mask # Output the landmarks to a file DUMPPDBOutput PDB file. More details ATOMSvalue containing positions of atoms that should be output=landmarks ATOM_INDICESthe indices of the atoms in your PDB output=1,2,3,4,5,6,7,8,9,10 FILEthe name of the file on which to output these quantities=traj.pdb

This only saves the coordinates of the landmark points. If you look at the shortcuts in the documentation for LANDMARK_SELECT_FPS you can see how the shortcut also saves information on the dissimilarities between the landmarks, the dissimilarities between the landmarks and all the other points and the weights of the landmarks, which are determined using a VORONOI analysis.

Syntax

The following table describes the keywords and options that can be used with this action

Keyword Type Default Description
ARG input none the input matrix
NZEROS compulsory none the number of landmark points that you want to select
SEED compulsory 1234 a random number seed
MATRIX optional not used the input matrix (can use ARG instead)