a novel algorithm to investigate domain motions in proteins.
This homepage contains the documentation of the algorithm.
It is recommended the user tests the script with the domains of interest assigned manually beforehand, then tries automatic "fas" partitioning with the resolution in the shellscript set between 50 and 100 (%). Finally the partitioning should be repeated in "slo" mode for selected cases and resolutions.
3. A brief description of the algorithm
The method will be published in the near future, please inquire about a preprint or reference at the e-mail above. The algorithm is separated in two parts: the "partitioning" and the "rotational fit" section. The "partitioning" part determines domains with preserved structure in the two compared coordinate sets, depending on a prespecified resolution. The method uses the least-squares fit method (W. Kabsch, 1976) as implemented in X-PLOR. A domain is found in a iterative procedure, in which poor matching residues are excluded from the domain and good matches are included. In the "slo" mode only the heaviest connected set is considered, maintaining the connectivity of the changing domain.
The "rotational fit" method attempts to locate a hingepoint and a rotation axis which characterize the transformation of the domain between main and comparison coordinate set as a hingerotation. The hingepoint could be anywhere on the axis, but is determined here, by construction, as closest point to the center of mass of the domain . The rotation about an axis without translation, in general, will not yield the closest fit of the Kabsch least-squares method, so the problem is to find the least-squares solution with the constraint that transformations are not allowed, only rotations about an unknown hingepoint. It turns out that this constrained problem is not easy to solve and the exact solution may be too expensive to compute, so an approximation is used in the algorithm. The accuracy of the approximation can be assessed by comparing the least-squares fit with the proposed rotational fit. Note that the (rmsd) error of the fit may be due to the error of the approximation OR to the constraint of not allowing translations.
The construction works as follows: A Kabsch least-squares fit yields an translation vector v of the COM and a rotation axis r with angle alpha. The rotation axis is then projected on the bisecting plane of v which yields a new rotation axis r' and a "projection angle" beta, defined by r and r', from which one can compute the new rotation angle alpha' = alpha * cos(beta). Using this projected rotation, one can construct a hingepoint on the bisecting plane. A rotation with r' and angle alpha' about the hingepoint then transforms the COM of the main set on the COM of the comp set. So the projection maintains (relative to the least-squares method) the removal of the COM difference between the sets, but approximates the rotation. The idea is that in hingebending motions there should be a relatively large COM separation |v|, and the rotation r should be almost parallel to the bisecting plane of v. Thus, in addition to the rmsd error of the fit, the validity of the approximation can be assessed by checking the angle beta, which should be small. One finds that the method works best for larger domains comprising several secondary structure elements.
4. Output files
There are three output files specified by the variables $oname $uname and $dname: pdb and psf files of the labeled structure, and the log file of the run. The pdb and psf files can be used to visualize the results of the algorithm: The data is labeled by segid's:
The dummy molecules show an arrow along the rotation axis with it's orientation representing the right-handed rotation about the axis. The hingepoint in the middle of the arrow is connected to the COM of the main and comparison coordinate set of the domain to illustrate the rotation angle. The rotation angle and other useful information about the run, the domains, and the accuracy of the rotational fitting can be found in the self-explanatory log file.
NOTE: The X-PLOR logfiles would contain several MBytes of data for each run, so the standard output is piped to /dev/null. The standard output should only be used for debugging of modified or augmented scripts.
5. Accuracy check and other useful info
The log file contains information about the accuracy of the fitting as outlined above in 3. The relative error (in percent) is computed as
It is recommended to run the cases within a range of resolutions between 50 and 100%. Running a particular system with a range of resolutions in "fas" mode, it was found that there exist one or more windows of optimum resolution where the relative errors were very small. Therefore it is recommended to try a range of resolutions first with the "fas" mode, find the window(s) of small error and then calculate selected resolutions in the window(s) in "slo" mode with a higher number of domains. The error of the domain fitting was found to decrease 5 times with "slo" partitioning due to the connectivity of the domains.
Recommended reading about classification of domain movements: Gerstein et al., Biochemistry 33 (1994), 6739-6749.
How to find hingeregions: For shear-type motions (see Gerstein et al.) the found effective rotation axis will intersect the boundary between the domains in most cases and the effective hinge region can be found at the intersection. In hinge-type motions, the axis will be parallel to the interface. To find 'real' hingeresidues, it is useful to investigate the proteinbackbone at the segment interface. A hingeresidue should be close to the proposed rotation axis. For this type of movement, the proposed hingepoint may be useful to find the hinge, but it should be clear that a hinge"point" can be anywhere on the axis.
Updates and changes may of the method be neccessary once in a while. To stay informed about changes send e-mail to the author. The known users will also receive a preprint of the upcoming paper. The author would appreciate 'bug reports' and any comments regarding the usefulness of the algorithm and strategies of usage. Please send your correspondence to firstname.lastname@example.org (NeXT-mail OK).
|Modified: Mon Sep 18 16:00:00 1995 GMT|
|Page accessed 6020 times since Sat Apr 17 21:34:38 1999 GMT|