Trajectory Looping: Difference between revisions

From Laboratory of Modeling in Biology and Medicine
Jump to: navigation, search
mNo edit summary
mNo edit summary
Line 6: Line 6:




=== Download ===
=== Downloads ===
Source code of a C++ implementation of the method, called '''Looper''', can be retrieved [http://pmbm.ippt.pan.pl/software/looping/looper-v20180314.tar.gz here].
* [http://pmbm.ippt.pan.pl/software/looping/looper-v20180314.tar.gz '''Looper''' – an implementation of the method in C++]
* [http://pmbm.ippt.pan.pl/software/looping/input.bin Example input trajectory]




=== Building ===
=== On the implementation ===


'''Compiler.'''  A decent and recent C++ compiler that supports C++14 standard is required.
Trajectory looping can be implemented in two different but
The code has been confirmed to compile successfully under Linux with GNU
equivalent waysOne may first perform multiple joins of the base
g++ 8.0 and Clang++ 6.0.
contacts trajectory and in this manner obtain a looped contacts
 
trajectory of a desired length without considering chemical events.  
'''Toolchain:'''  Compilation is managed by CMakeTo build Looper under Linux, it should be
The looped contacts trajectory, consisting of the base contacts
sufficient to enter the build directory and execute 'build_release.sh'.
trajectory and index maps resulting from optimal assignments, provides
 
a complete information necessary for a subsequent simulation of chemical  
 
trajectory (this way has been presented in Figure 1 in the [[#RefZukEtAl|article]]).
===Usage ===
 
Invoke simply as: 
 
  ./Looper trajectory chemprefix
 
'''Multiple input trajectories.''' 
More than one trajectory file can be read in and used.  Multiple files should
be listed consecutively before the last invocation argument, e.g., as:
 
  ./Looper traj1.bin traj2.bin traj3.bin chemprefix
 
The order of the use of multiple trajectories is random.  The trajectories
should have the same number of molecules and the same time step.
 
 
'''Multiple simulations – seed.''' 
If more than one simulation is to be performed (by setting appropriately
variable <code>kNumOfReplicas</code> in file Settings.hpp), each simulation is seeded
sequentially starting from the value assigned to kSimulationSeedsBase (file
Settings.hpp).  This value can be overridden by setting the environmental
variable <code>LOOPER_SEED_BASE</code> prior to Looper invocation.
 
 
'''Multiple simulations – multi-threading'''&nbsp;
Multiple simulations of chemical kinetics that use the same base chemical
trajectory can run in parallel.  During run-time, the software chooses the
number of threads based on both the <code>kNumOfReplicas</code> (described above) and
hardware capabilities.  If necessary, this behavior can be tweaked manually
using variable <code>kNumOfThreads</code> (in file Settings.hpp).


The second way consists in interweaving the simulation of chemical events
with joins of the base contacts trajectory.  This way can be more convenient
for some applications because what is expanding in computer memory is the
ultimate looped chemical trajectory, which is of direct interest, and not
the intermediary looped contacts trajectory, whose desired length may be
hard to guess.  When using the second way, a stop condition can be defined
depending on the chemical state of the of the system or based on the already
collected chemical event statistics.  This implementation of trajectory
looping has been written according to the second way.




Line 75: Line 55:




=== Modifying the code ===


=== Defining a new chemistry ===
<accordion parent="accordion" heading="Detailed instructions">
=== Defining a new reaction system ===


In file Chemistry.hpp there are two systems of reactions: <code>ChemistrySystem1</code>  
In file Chemistry.hpp there are already two systems of reactions: <code>ChemistrySystem1</code>  
and <code>ChemistrySystem2</code>, which correspond to the monostable and bistable  
and <code>ChemistrySystem2</code>, which correspond to the monostable and bistable  
reaction systems, respectively, analyzed in the article.
reaction systems, respectively, analyzed in the article.
Line 87: Line 69:
To provide a new system of reactions, please follow the conventions used to
To provide a new system of reactions, please follow the conventions used to
implement <code>ChemistrySystem1</code> and <code>ChemistrySystem2</code>.
implement <code>ChemistrySystem1</code> and <code>ChemistrySystem2</code>.




Line 101: Line 82:
Variable <code>kWriteToChemStateFileEverySteps</code> in Settings.hpp
Variable <code>kWriteToChemStateFileEverySteps</code> in Settings.hpp


=== Building ===
'''Compiler.'''&nbsp; A decent and recent C++ compiler that supports C++14 standard is required.
The code has been confirmed to compile successfully under Linux with GNU
g++ 8.0 and Clang++ 6.0.
'''Toolchain.'''&nbsp; Compilation is managed by CMake.  To build Looper under Linux, it should be
sufficient to enter the build directory and execute <code>./build_release.sh</code>.
===Usage ===
Invoke simply as: 
  ./Looper trajectory chemprefix




=== Implementation details ===
'''Multiple input trajectories.'''&nbsp;
More than one trajectory file can be read in and used.  Multiple files should
be listed consecutively before the last invocation argument, e.g., as:
 
  ./Looper traj1.bin traj2.bin traj3.bin chemprefix
 
The order of the use of multiple trajectories is random.  The trajectories
should have the same number of molecules and the same time step.
 
 
'''Multiple simulations – seed.'''&nbsp;
If more than one simulation is to be performed (by setting appropriately
variable <code>kNumOfReplicas</code> in file Settings.hpp), each simulation is seeded
sequentially starting from the value assigned to <code>kSimulationSeedsBase</code> (file
Settings.hpp).  This value can be overridden by setting the environmental
variable <code>LOOPER_SEED_BASE</code> prior to Looper invocation.


In principle, trajectory looping can be applied in two different but
equivalent ways.  One may first perform multiple joins of the base
contacts trajectory and in this manner obtain a looped contacts
trajectory of a desired length without considering chemical events.
The looped contacts trajectory, consisting of the base contacts
trajectory and index maps resulting from optimal assignments, provides
a complete information necessary for a subsequent simulation of chemical
trajectory (this way has been presented in Figure 1 in the article).


The second way consists in interweaving the simulation of chemical events
'''Multiple simulations – multi-threading'''&nbsp;
with joins of the base contacts trajectory.  This way can be more convenient
Multiple simulations of chemical kinetics that use the same base chemical
for some applications because what is expanding in computer memory is the
trajectory can run in parallelDuring run-time, the software chooses the  
ultimate looped chemical trajectory, which is of direct interest, and not
number of threads based on both the <code>kNumOfReplicas</code> (described above) and
the intermediary looped contacts trajectory, whose desired length may be
hardware capabilitiesIf necessary, this behavior can be tweaked manually
hard to guessWhen using the second way, a stop condition can be defined
using variable <code>kNumOfThreads</code> (in file Settings.hpp).
depending on the chemical state of the of the system or based on the already
</accordion>
collected chemical event statisticsThis implementation of trajectory
looping has been written according to the second way.




<span id="RefZukEtAl"></span>
=== Reference ===
=== Reference ===
* <span class="pmbm">Żuk PJ</span>, <span class="pmbm">Kochańczyk M</span>, <span class="pmbm">Lipniacki T</span>. '''Trajectory looping for sampling rare events in stochastic reaction–diffusion systems''', ''Bioinformatics'' (submitted)
* <span class="pmbm">Żuk PJ</span>, <span class="pmbm">Kochańczyk M</span>, <span class="pmbm">Lipniacki T</span>. '''Trajectory looping for sampling rare events in stochastic reaction–diffusion systems''', ''Bioinformatics'' (submitted)

Revision as of 00:12, 15 March 2018

A method for sampling of rare events in stochastic reaction–diffusion systems


235px-Ouroboros_1.jpg


Downloads


On the implementation

Trajectory looping can be implemented in two different but equivalent ways. One may first perform multiple joins of the base contacts trajectory and in this manner obtain a looped contacts trajectory of a desired length without considering chemical events. The looped contacts trajectory, consisting of the base contacts trajectory and index maps resulting from optimal assignments, provides a complete information necessary for a subsequent simulation of chemical trajectory (this way has been presented in Figure 1 in the article).

The second way consists in interweaving the simulation of chemical events with joins of the base contacts trajectory. This way can be more convenient for some applications because what is expanding in computer memory is the ultimate looped chemical trajectory, which is of direct interest, and not the intermediary looped contacts trajectory, whose desired length may be hard to guess. When using the second way, a stop condition can be defined depending on the chemical state of the of the system or based on the already collected chemical event statistics. This implementation of trajectory looping has been written according to the second way.


Format of input trajectory

Looper reads in binary trajectories. They are expected to have a header consisting of:

  • the number of molecules (4-byte signed integer),
  • time step (double-precision real number),
  • length of the edge of the cubical simulation box (positive value) or three (negative) values for lengths of non-cubical but cuboidal box (double-precision real number(s)).

The body of the trajectory file contains multiple frames. A single frame consists of:

  • current step index (4-byte signed integer),
  • current time (double-precision real number)
  • an ordered series of molecule coordinates.

Molecule coordinates comprise six values (six double-precision real numbers), out of which three first describe absolute molecule position (x, y, z) and the remaining three are required to be present but are not used.


Modifying the code

Defining a new reaction system

In file Chemistry.hpp there are already two systems of reactions: ChemistrySystem1 and ChemistrySystem2, which correspond to the monostable and bistable reaction systems, respectively, analyzed in the article.

To define and use a new system of reactions, one has to create a class that inherits from the StochasticChemistry class. The subclass should set initial conditions in its constructor and should provide a match_events() method. To provide a new system of reactions, please follow the conventions used to implement ChemistrySystem1 and ChemistrySystem2.


Various settings

Simulation duration:   Variable kTimeEnd in Settings.hpp.

Contact diameter:   Variable kSearchRadius in Settings.hpp.

Skipping frames when writing out:   Variable kWriteToChemStateFileEverySteps in Settings.hpp


Building

Compiler.  A decent and recent C++ compiler that supports C++14 standard is required. The code has been confirmed to compile successfully under Linux with GNU g++ 8.0 and Clang++ 6.0.

Toolchain.  Compilation is managed by CMake. To build Looper under Linux, it should be sufficient to enter the build directory and execute ./build_release.sh.


Usage

Invoke simply as:

 ./Looper trajectory chemprefix


Multiple input trajectories.  More than one trajectory file can be read in and used. Multiple files should be listed consecutively before the last invocation argument, e.g., as:

 ./Looper traj1.bin traj2.bin traj3.bin chemprefix

The order of the use of multiple trajectories is random. The trajectories should have the same number of molecules and the same time step.


Multiple simulations – seed.  If more than one simulation is to be performed (by setting appropriately variable kNumOfReplicas in file Settings.hpp), each simulation is seeded sequentially starting from the value assigned to kSimulationSeedsBase (file Settings.hpp). This value can be overridden by setting the environmental variable LOOPER_SEED_BASE prior to Looper invocation.


Multiple simulations – multi-threading  Multiple simulations of chemical kinetics that use the same base chemical trajectory can run in parallel. During run-time, the software chooses the number of threads based on both the kNumOfReplicas (described above) and hardware capabilities. If necessary, this behavior can be tweaked manually using variable kNumOfThreads (in file Settings.hpp).


Reference

  • Żuk PJ, Kochańczyk M, Lipniacki T. Trajectory looping for sampling rare events in stochastic reaction–diffusion systems, Bioinformatics (submitted)