{"id":76,"date":"2015-07-01T18:12:09","date_gmt":"2015-07-01T18:12:09","guid":{"rendered":"http:\/\/statisticalbiophysicsblog.org\/?p=76"},"modified":"2022-11-02T21:07:21","modified_gmt":"2022-11-02T21:07:21","slug":"everything-is-markovian-nothing-is-markovian","status":"publish","type":"post","link":"https:\/\/statisticalbiophysicsblog.org\/?p=76","title":{"rendered":"Everything is Markovian; nothing is Markovian"},"content":{"rendered":"<p>The Markov model, without question, is one of the most powerful and elegant tools available in many fields of biological modeling and beyond.&nbsp; In my world of molecular simulation, Markov models have provided analyses more insightful than would be possible with direct simulation alone.&nbsp; And I\u2019m a user, too.&nbsp; Markov models, in their chemical-kinetics guise, play a prominent role in illustrating cellular biophysics in my online book, <a href=\"http:\/\/physicallensonthecell.org\/\">Physical Lens on the Cell<\/a>.<\/p>\n<p>Yet it\u2019s fair to say that everything is Markovian and nothing is Markovian &#8211; and we need to understand this.<\/p>\n<p>If you\u2019re new to the business, a quick word on what \u201cMarkovian\u201d means.&nbsp; A Markov process is a stochastic process where the future (i.e., the distribution of future outcomes) depends only on the present state of the system.&nbsp; Good examples would be chemical kinetics models with transition probabilities governed by rate constants or simple Monte Carlo simulation (a.k.a. Markov-chain Monte Carlo).&nbsp; To determine the next state of the system, we don\u2019t care about the past: only the present state matters.<\/p>\n<p><!--more--><\/p>\n<p>What about systems following deterministic dynamics, e.g., following classical equations of motion, Newton\u2019s laws?&nbsp; Let\u2019s say that such systems are \u201ceffectively Markovian\u201d in that the future depends only on the present state of the system.&nbsp; In such cases, we must include the full \u201cphase space\u201d of the system in the present state &#8211; i.e., velocities as well as positions of particles.<\/p>\n<p>Back to those contradictory claims &#8230;<\/p>\n<p><em>Everything is Markovian.<\/em>&nbsp; Really all the physics (and chemistry and biology) most of us are interested in, in fact, is Markovian or effectively so.&nbsp; Once the states and velocities\/momenta of all particles are accounted for, the future (or distribution of future outcomes) is determined.&nbsp; This is not to say that you <em>personally<\/em> know how to figure out the distribution &#8211; only that the distribution is determined.&nbsp; This is true for classical and quantum systems, stochastic and deterministic dynamics, you name it.&nbsp; The only trick is to be sure your definition of the present state includes all necessary information.<\/p>\n<p><em>Nothing is Markovian.<\/em>&nbsp; And yet, as you may know, for complex systems it is extremely challenging to build Markov models.&nbsp; For that matter, why are they called \u201cmodels\u201d anyway if the true dynamics are (effectively) Markovian as just claimed above?&nbsp; The answer is arguably in the eye of the beholder, or more precisely, in the subset of a system\u2019s phase space chosen for analysis.&nbsp; If only a subset of system coordinates is examined, or even if a discretized representation of the full set of coordinates is used, a systems behavior generally will <em>appear<\/em> non-Markovian.&nbsp; In such a sub-space, we will not be able to predict the distribution of future outcomes knowing only the present, because we are ignoring system details that matter in the fundamental (effectively Markovian) dynamics.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"  wp-image-79 aligncenter\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/uploads\/2015\/07\/markoveq-300x21.png\" alt=\"markoveq\" width=\"447\" height=\"45\"><\/p>\n<p>The idea that dynamics in sub-spaces appears to be non-Markovian is an old one.&nbsp; The most famous example I know of arises in Langevin dynamics.&nbsp; Standard Langevin dynamics is a Markovian process in the full phase space of all system configurational\/positional coordinates and velocities: the future depends only on the present.&nbsp; We can see this in the one-dimensional Langevin equation: <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/ql-cache\/quicklatex.com-2e1a5ab49445217165695fe9bb470b11_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#109;&#97;&#32;&#61;&#32;&#102;&#32;&#45;&#32;&#92;&#103;&#97;&#109;&#109;&#97;&#32;&#109;&#32;&#118;&#32;&#43;&#32;&#102;&#95;&#123;&#92;&#109;&#97;&#116;&#104;&#114;&#109;&#123;&#116;&#104;&#101;&#114;&#109;&#97;&#108;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"16\" width=\"192\" style=\"vertical-align: -4px;\"\/> where <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/ql-cache\/quicklatex.com-4de02fc502ed5dbd15f371728ea270a3_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#103;&#97;&#109;&#109;&#97;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"10\" style=\"vertical-align: -4px;\"\/> is the \u201cfrictional frequency\u201d opposing motion and <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/ql-cache\/quicklatex.com-77ac894b1135d369d296da3c6deb89a9_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#102;&#95;&#123;&#92;&#109;&#97;&#116;&#104;&#114;&#109;&#123;&#116;&#104;&#101;&#114;&#109;&#97;&#108;&#125;&#8203;\" title=\"Rendered by QuickLaTeX.com\" height=\"16\" width=\"55\" style=\"vertical-align: -4px;\"\/> the random force due to thermal collisions; other symbols are standard.&nbsp; If we specify the present\/initial position <em>and <\/em>velocity, we specify the future distribution of outcomes.&nbsp; However, if we examine a standard Langevin trajectory in only positional space, the behavior will appear non-Markovian: the future will depend not only on the present but also on the past &#8211; i.e., on inertial effects not represented in instantaneous positional coordinates.&nbsp; In other words, to model the effect of inertia without velocities, one requires some previous history of the system to implicitly represent the velocity.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"  wp-image-80 aligncenter\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/uploads\/2015\/07\/nonmarkovimg-300x215.png\" alt=\"nonmarkovimg\" width=\"415\" height=\"300\"><\/p>\n<p>Sometimes non-Markov effects can appear in subtle ways.&nbsp; Imagine a simple two dimensional system, such as in the sketch, where our primary interest is in transitions in the x coordinate, from low to high values of x.&nbsp; Assume that the true dynamics are Markovian in x and y, so no inertia to worry about in this abstract example.&nbsp; Because we are only interested in transitions in x, it is natural to build a Markov model by binning in (subdividing) x and estimating transition rate among the bins.&nbsp; Such a model will become more accurate as the bins become smaller.&nbsp; But interestingly, for this energy landscape, Markovian behavior will never be recovered so long as the y coordinate is not also subdivided.&nbsp; Why?&nbsp; Because the probability to move left or right clearly will depend on whether the system is in the upper or lower energy valley: the upper one slopes downward to the right and the lower one slopes to the left.&nbsp; As this example suggests, to guarantee recovery of Markovian behavior, bins must become <em>uniformly<\/em> small in all coordinates.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"  wp-image-81 aligncenter\" src=\"https:\/\/statisticalbiophysicsblog.org\/wp-content\/uploads\/2015\/07\/nonmarkovimg2-300x176.png\" alt=\"nonmarkovimg2\" width=\"421\" height=\"253\"><\/p>\n<p>The (apparently) non-Markovian behavior doesn\u2019t emerge only in higher dimensional systems.&nbsp; Consider simple diffusion in one dimension, as exemplified in the x vs. t trajectory shown.&nbsp; Inspection of the trajectory shows that the transition probability from state 3 to 4 depends on the fact that the trajectory was previously in state 2.&nbsp; Because the trajectory is strictly diffusive and Markovian in x space, when it reaches the boundary of states 2 and 3, it is equally likely to return to 2 \u2026 but less likely to get to 4 due to the finite time required to transit across bin 3.<\/p>\n<p>What does this mean for \u201creal\u201d systems of interest?&nbsp; The challenge in typical complex systems is that one has only a finite amount of trajectory data, and so making small bins &#8211; especially ones that are small in all dimensions &#8211; is effectively impossible.&nbsp; There will not be enough transition counts among the bins to enable accurate estimation of the transition rates.&nbsp; Finite states will be intrinsically non-Markovian.<\/p>\n<p>Bottom line: In a finite discrete model of a complex system, you should expect the behavior to be non-Markovian.&nbsp; It is entirely a separate question &#8211; and a topic of current research (see references) &#8211; whether a Markov <em>approximation<\/em> to the non-Markovian behavior is sufficiently accurate.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Further reading<\/strong><\/p>\n<ul>\n<li>\u201cProgress and challenges in the automated construction of Markov state models for full protein systems,\u201d Gregory R. Bowman, Kyle A. Beauchamp, George Boxer and Vijay S. Pande, J. Chem. Phys. 131, 124101 (2009). <a href=\"http:\/\/dx.doi.org\/10.1063\/1.3216567\">http:\/\/dx.doi.org\/10.1063\/1.3216567<\/a><\/li>\n<li>\u201cConstructing the equilibrium ensemble of folding pathways from short off-equilibrium simulations,\u201d Frank No\u00e9, Christof Sch\u00fctte, Eric Vanden-Eijnden, Lothar Reich and, Thomas R. Weikl, Proc. Natl. Acad. Sci. 106, 19011\u201319016 (2009). <a href=\"http:\/\/www.pnas.org\/content\/106\/45.cover.gif\">doi:10.1073\/pnas.0905466106<\/a><\/li>\n<li>\u201cStructure-guided simulations illuminate the mechanism of ATP transport through VDAC1,\u201d Choudhary, O.P., A. Paz, J.L. Adelman, J.P. Coulletier, J. Abramson, and M. Grabe, Nat. Struct. Mol. Bio. 21: 621-632 (2014). <a href=\"http:\/\/www.nature.com\/nsmb\/journal\/v21\/n7\/full\/nsmb.2841.html\">http:\/\/www.nature.com\/nsmb\/journal\/v21\/n7\/full\/nsmb.2841.html<\/a><\/li>\n<li><em>Statistical Physics of Biomolecules: An Introduction,<\/em> Daniel M. Zuckerman, CRC Press, 2010.<\/li>\n<li><em>Stochastic Methods: A Handbook for the Natural and Social Sciences,<\/em> Crispin Gardiner, Springer, 2009.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>The Markov model, without question, is one of the most powerful and elegant tools available in many fields of biological modeling and beyond.&nbsp; In my world of molecular simulation, Markov models have provided analyses more insightful than would be possible with direct simulation alone.&nbsp; And I\u2019m a user, too.&nbsp; Markov models, in their chemical-kinetics guise, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":77,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11,8],"tags":[],"class_list":["post-76","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general-biophysics","category-trajectory-physics"],"_links":{"self":[{"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/posts\/76","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=76"}],"version-history":[{"count":9,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/posts\/76\/revisions"}],"predecessor-version":[{"id":476,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/posts\/76\/revisions\/476"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=\/wp\/v2\/media\/77"}],"wp:attachment":[{"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=76"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=76"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/statisticalbiophysicsblog.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=76"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}