Summary: This module explains how to model a protein as a robotic manipulator and introduces the robotic path planning problem and algorithms for solving it, including the probabilistic roadmap method. Variations of the Probabilistic Roadmap Method have been employed for problems related to protein motion.
In the modules on protein kinematics and inverse kinematics, it was shown that, structurally and kinematically, proteins are very similar to a class of robotic manipulators consisting of arms connected by revolute joints. Because of this analogy, in the late 1990s, some robotics researchers began to speculate that methods developed for use with robots might also be applicable to the study of proteins. For the remainder of this module, the analogy between robots and proteins will be explored, and then a class of robotics algorithms, called path planners, that will be adapted in a later module to use with proteins, will be introduced.
Recall that a protein is a chain of amino acid residues. Each residue contributes two rotatable dihedral angles, designated φ and ψ, to the main chain of the protein, and may additionally have a side chain with up to five rotatable dihedral angles. Under the rigid geometry simplification, these rotatable dihedral angles are the only degrees of freedom available to the protein.
| A Dihedral Angle |
|---|
![]() |
If we replace each bond by a rigid bar and each rotatable dihedral by a revolute joint, we can build a robotic linkage kinematically equivalent to a protein under rigid geometry.
| Work Space, Two-Dimensional Disc Robot |
|---|
![]() |
| Configuration Space, Two-Dimensional Disc Robot |
|---|
![]() |
In the above figures, the configuration space is two dimensional because the robot has two degrees of freedom. If the heading of the robot mattered (i.e., if the robot were not circular), then a configuration would consist of a position and an orientation. The configuration space would therefore be three dimensional. If the robot had a rotatable joint, this would add another degree of freedom and another dimension to the C-space.
The robotic path planning problem is, given a robot, a work space, and starting and goal configurations for the robot in the work space, find a collision-free path for the robot from the starting configuration to the goal, if one exists. Otherwise determine that no such path exists. An extensive introduction to the path planning problem and existing solutions may be found in [1] .
Early approaches to path planning included:
One solution to the scalability problem was to find methods whose run time does not depend on the dimensionality of the C-space, but on some other factor. This led to sampling-based path planning. Rather than making some explicit analysis of the whole C-space, sampling based planners built their representation of C-space by sampling random configurations and using a fast collision checker to determine whether they are in collision.
The basis of many modern sampling-based planners is the Probabilistic Roadmap Method (PRM) [cite] . Although the implementation details can become complicated, the basic algorithmic framework is quite straightforward and easy to understand.
| Configuration Space for Path Planning |
|---|
| Sampling |
|---|
| Connection |
|---|
| Query |
|---|
Unlike some slower methods, there is no guarantee that PRM will find a path if one exists. A different kind of guarantee is possible, however. While not complete, PRM is probabilistically complete. As the number of samples increases, the probability of the planner finding a path if one exists approaches 1. For many real-world path-planning problems, the method is very fast and reliable in practice.
PRMS are not the only kind of robotic path planner. Building a roadmap is a time-consuming process. The advantage of doing so is that once the roadmap is built, and assuming that the obstacles are not allowed to move, it can be used to rapidly plan an arbitrary number of paths. If the goal is only to find a single path, however, much of the effort of building the roadmap is wasted. It would be preferable to use a method that is concerned with connecting the start and goal configurations rather than covering the configuration space.
Such a class of sampling-based planners exist, and they are called tree-based or single-query planners. All of these planners take the same overall approach: They begin with the start configuration for the path planning query, and build a tree data structure of samples growing away from it, with a bias toward the goal configuration.
| Operation of a Tree-Based Planner |
|---|
![]() |
With adjustments, one can apply sampling-based robotic path planning to study protein motion. First and foremost, the configuration space, which sorts conveniently into occupied and free configurations, must be replaced by the more complicated molecular conformation space. Each conformation of a molecule has a free energy, which depends on chemical and physical interactions between its component atoms and those of any other molecules (such as solvent molecules) that may be nearby. This free energy is related to the probability of a protein being in a conformation. Thus, instead of dealing with the binary problem of colliding and free conformations, the conformation space is one of continuously varying probabilities. This problem may be simplified by the use of energy cutoffs, but it is difficult to decide, for any given problem, what energy is so high that a conformation is effectively in collision. Free energy and how it is incorporated into the study of molecular motion is discussed in more detail in Motion Planning for Proteins: Biophysics and Applications.
Choset, Howie, Kevin M. Lynch, Seth Hutchinson, George Kantor, Wolfram Burgard, Lydia E. Kavraki and Sebastian Thrun. Principles of Robot Motion: Theory, Algorithms, and Implementations. Cambridge, MA: MIT Press, 2005. Chapter 1 for introduction to robotics and analogy between robots and biomolecules. Chapter 7 for a summary of sampling-based planners.