[gmx-users] [Fwd: please help me with PCA questions]
David van der Spoel
spoel at xray.bmc.uu.se
Wed Nov 24 09:30:22 CET 2004
-------- Forwarded Message --------
From: Jinzhi Tan <jztan at mail.shcnc.ac.cn>
Reply-To: jztan at mail.shcnc.ac.cn
To: spoel at xray.bmc.uu.se
Subject: please help me with PCA questions
Date: Wed, 24 Nov 2004 9:29:43 +0800
Dear Prof.van der Spoel,
I am a gmx-user. A few days ago, I ask some questions on gmx-user list. I think maybe they are some very stupid questions for nobody would answer them. I hope you can help me. Thank you very much! Please see the followings:
After I run the conventional MD simulation for several nanoseconds,I want to do PCA. I encountered some problems.
Firstly, How long time should I run the conventional MD when I try to do PCA? as long as possible? I was told that the samples in the conformational space will be enough if the simulation time is long enough. But I am not sure it does work because I found some loops are mobile and they moved just at the first several hundreds picoseconds and then they hold the new position for a long time. I wonder if they can come back to their original conformation if I run long MD simulation? Another case is the protein unfolding. Some papers reported the protein unfolding after a long MD time (several nanoseconds), but I wonder if the time is long enough, the protein can fold automatically. What do we think about the effect of the force field?
Secondly, which time should I select as the initial time of PCA? Should I select the time when the RMSD of the protein tends to be level off after about two nanosecond or should I select the whole MD simulation time? But in some papers, they just run one nanosecond in total and then do PCA? Is it correct?
Thirdly, I used two methods to analyze the first eigenvector and got different results? I am not sure why they are different? If I use: g_anaeig -v eigenvec.trr -first 1 -last 1 -extr vec1_extreme.pdb, I got the following result:
1 eigenvectors selected for output: 1
Last frame 9445 time 9445.000
eigenvector Minimum Maximum
value time value time
1 -6.273994 454.0 5.266299 9429.0
Writing 2 frames along eigenvector 1 to vec1_extreme.pdb
When I use: g_anaeig -v eigenvec.trr -first 1 -last 8 -extr vec18_extreme.pdb, I got:
8 eigenvectors selected for output: 1 2 3 4 5 6 7 8
Last frame 9445 time 9445.000
eigenvector Minimum Maximum
value time value time
1 -6.273994 454.0 5.266299 9429.0
2 -4.850856 11.0 4.864636 5113.0
3 -2.722965 6113.0 2.619274 2238.0
4 -2.837103 3826.0 2.447154 8460.0
5 -3.493261 7502.0 2.076011 778.0
6 -2.219512 5995.0 2.655742 489.0
7 -1.916822 5302.0 2.395802 2613.0
8 -2.154755 62.0 1.883655 7235.0
Writing 2 frames along eigenvector 1 to vec18_extreme1.pdb
Writing 2 frames along eigenvector 2 to vec18_extreme2.pdb
Writing 2 frames along eigenvector 3 to vec18_extreme3.pdb
Writing 2 frames along eigenvector 4 to vec18_extreme4.pdb
Writing 2 frames along eigenvector 5 to vec18_extreme5.pdb
Writing 2 frames along eigenvector 6 to vec18_extreme6.pdb
Writing 2 frames along eigenvector 7 to vec18_extreme7.pdb
Writing 2 frames along eigenvector 8 to vec18_extreme8.pdb
So what is the mean of "value"? Is the time corresponding to the real simulation time? But I check the snapshot at 454.0 ps,vec1_extreme.pdb (select the minimal) and vec18_extreme1.pdb (select the minimal), they are not the same! So what is meaning of the time?
For the two results, the information of first eigenvector is the same (as above), but actually the vec1_extreme.pdb and vec18_extreme1.pdb is different. Should they be the same?
I am not sure if I am confused about the basic theory of PCA or make some other mistakes. Hope you can give me some advice. Thank you very much!
Best wishes,
Jinzhi Tan
2004-11-10
************************************
E-mail: tanjinzhi at hotmail.com
jztan at mail.shcnc.ac.cn
************************************
--
David.
________________________________________________________________________
David van der Spoel, PhD, Assoc. Prof., Molecular Biophysics group,
Dept. of Cell and Molecular Biology, Uppsala University.
Husargatan 3, Box 596, 75124 Uppsala, Sweden
phone: 46 18 471 4205 fax: 46 18 511 755
spoel at xray.bmc.uu.se spoel at gromacs.org http://xray.bmc.uu.se/~spoel
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
More information about the gromacs.org_gmx-users
mailing list