twin/pseudo-symmetry question
Hello all- I have a question regarding the possibility of twinning and or psuedo-symmetry. I have a crystal that processes well in R32, ie. less than 6% r-merge at 2.2 angstrom. We collected multiple potential derivatives and were able to get some phasing from one that was good enough to model in the helices. The strange thing is that when looking at the harker sections there were a lot of unexplained strong peaks. And there is a very narrow range of parameters that we could get the sites with Shelx and end up with interpretable maps(i know that this part might not be unusual). My first thought was that there could be twinning. Since there is no twin law for this space group, I thought that it might suggest that there was twinning in a lower symmetry space group and/or that we have a pseudo-R32 crystal. Processing works well in C2 and R3, as expected. In those cases Xtriage suggests that the real symmetry is likely to be R32. In the case of C-2, xtriage suggetss if it is twinned then it is about 45% twinned. That's the background. Fast forward with the help of phenix Autobuild and some manual correction, we have been able to refine the structure with the R32 processed data(with tls) to r_work = 0.1894, r_free = 0.2287, bonds = 0.007, angles = 0.908 at 2.2 angstrom with almost everything accounted for. I think I should be quite comfortable with this. BUT, I'm still wondering about those Harker sections. I've been reading the manuscripts of Dauter regarding twinning and I think I am at this point more curious than anything. But not leaving well enough alone, I went back to the data processed in C2. I can easily do molecular replacement, following refinement at 2.3 angstrom(I need a lot more frames for this sg so the cutoffs are a bit lower but r-merge is ~ the same as R32) with TLS I get r_work = 0.1850, r_free = 0.2249, bonds = 0.007, angles = 0.946. this doesn't seem significant to me. However if I refine with the two possible twin operations, the values drop with one being better than the other. both have the same starting model: r_work = 0.1639 r_free = 0.1880 bonds = 0.006 angles = 0.902 r_work = 0.1582 r_free = 0.1866 bonds = 0.006 angles = 0.891 The twin fraction from the refined pdb header is 0.544 for the better statistical model and the r-values seem too low for 2.3 angstrom data. Is the high twin fraction suggestive of the higher symmetry space group or inconclusive towards that. of interest is that the maps look better to me than the r32 maps. by that i mean that the maps don't have what i would call minor dots of negative density for some(not a lot) of the side chains as we see for some side chains in the R32 case. also some side chains look better in the C2 twin refined case. Lastly for some of the monomers(there are now 6 copies versus 2 in the R32 case), I can add an additional residue or two on the termini which are not seen before. I wonder if this also suggests that the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something? In the end, the models are essentially the same in either space group (with the exception of a couple of residues at the termini). So, I don't think I learn much one way or the other. But the curiocity keeps coming back. So is it likely to be R32 or C2 or C2 with a hefty twin fraction? Anyone have an explanation? Thanks in advance- Todd
On Fri, Nov 11, 2011 at 9:47 AM, Green, Todd
the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something?
I don't know enough to answer the rest of the question, but one thing to look out for is how you generate the R-free flags - which program did you use, and when you reprocessed in C2, did you generate a new set of R-free flags? If you're not careful with this, you can end up severely biasing R-free when performing twinned refinement. -Nat
For the R32 set, I generate the r-free in phenix directly from the scalepack file for data processed in R32. For C2, it was generated from the C2 processed scalepack file but in ccp4 using the scalepack2mtz script then imported to phenix(because i used molrep for the molecular replacement). But, I kept this same data file with the same r-free for all of the refinement runs before and during twin refinement. Seem reasonable?
-Todd
-----Original Message-----
From: [email protected] on behalf of Nathaniel Echols
Sent: Fri 11/11/2011 11:53 AM
To: PHENIX user mailing list
Subject: Re: [phenixbb] twin/pseudo-symmetry question
On Fri, Nov 11, 2011 at 9:47 AM, Green, Todd
the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something?
I don't know enough to answer the rest of the question, but one thing to look out for is how you generate the R-free flags - which program did you use, and when you reprocessed in C2, did you generate a new set of R-free flags? If you're not careful with this, you can end up severely biasing R-free when performing twinned refinement. -Nat _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
Hi Todd,
There are 3 possible C2's you can choose from. Try besides R32
R3
C2 (no 1)
C2 (no 2)
C2 (no 3)
P1
If you use labelit / imosflm you have full control over which
spacegroup you can choose. HKL2000 doesn't provide enough control over
which indexing solution you want to use.
Alternatively, you could have an 'obverse/reverse twin' in R32, a
special kind of reticular twin. In this twinning form, the hexagonal
sublattice causes the twinning. When you index your data, check if the
integral absences are indeed absent (when you are in the hexagonal
setting). If not, you could have an obverse reverse twin.
I have no clue about CCP4, but in phenix selecting the free set is
done properly using the full symmetry of the lattice. If this is done
incorrectly, you will get in trouble later down the line, so better
make sure that things are ok.
P
On 11 November 2011 10:03, Green, Todd
For the R32 set, I generate the r-free in phenix directly from the scalepack file for data processed in R32. For C2, it was generated from the C2 processed scalepack file but in ccp4 using the scalepack2mtz script then imported to phenix(because i used molrep for the molecular replacement). But, I kept this same data file with the same r-free for all of the refinement runs before and during twin refinement. Seem reasonable?
-Todd
-----Original Message----- From: [email protected] on behalf of Nathaniel Echols Sent: Fri 11/11/2011 11:53 AM To: PHENIX user mailing list Subject: Re: [phenixbb] twin/pseudo-symmetry question
On Fri, Nov 11, 2011 at 9:47 AM, Green, Todd
wrote: the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something?
I don't know enough to answer the rest of the question, but one thing to look out for is how you generate the R-free flags - which program did you use, and when you reprocessed in C2, did you generate a new set of R-free flags? If you're not careful with this, you can end up severely biasing R-free when performing twinned refinement.
-Nat _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- ----------------------------------------------------------------- P.H. Zwart Research Scientist Berkeley Center for Structural Biology Lawrence Berkeley National Laboratories 1 Cyclotron Road, Berkeley, CA-94703, USA Cell: 510 289 9246 BCSB: http://bcsb.als.lbl.gov PHENIX: http://www.phenix-online.org SASTBX: http://sastbx.als.lbl.gov -----------------------------------------------------------------
Regarding imosflm and full control, I remember reading about the variant choices for C2 somewhere. So somewhere I have the files from imosflm done this way. I just never went beyond the processing to see what happens. So there is room to check.
I'll also check for the integral absences to check for the obverse/reverse twin. If this is the case. Can I apply an twin operation to the refinement? I'll spend some time googling this.
Thanks also for the info regarding rfree selection (including you Nat) I can just reverse course and import the original scalepack file (or imosflm files rather) and follow the refinement as i just did.
If the current refinement in C2 (with twin) were correct, are the numbers r_work = 0.1582 r_free = 0.1866 too low to be reasonable?
Thanks -
Todd
-----Original Message-----
From: [email protected] on behalf of Peter Zwart
Sent: Fri 11/11/2011 12:28 PM
To: PHENIX user mailing list
Subject: Re: [phenixbb] twin/pseudo-symmetry question
Hi Todd,
There are 3 possible C2's you can choose from. Try besides R32
R3
C2 (no 1)
C2 (no 2)
C2 (no 3)
P1
If you use labelit / imosflm you have full control over which
spacegroup you can choose. HKL2000 doesn't provide enough control over
which indexing solution you want to use.
Alternatively, you could have an 'obverse/reverse twin' in R32, a
special kind of reticular twin. In this twinning form, the hexagonal
sublattice causes the twinning. When you index your data, check if the
integral absences are indeed absent (when you are in the hexagonal
setting). If not, you could have an obverse reverse twin.
I have no clue about CCP4, but in phenix selecting the free set is
done properly using the full symmetry of the lattice. If this is done
incorrectly, you will get in trouble later down the line, so better
make sure that things are ok.
P
On 11 November 2011 10:03, Green, Todd
For the R32 set, I generate the r-free in phenix directly from the scalepack file for data processed in R32. For C2, it was generated from the C2 processed scalepack file but in ccp4 using the scalepack2mtz script then imported to phenix(because i used molrep for the molecular replacement). But, I kept this same data file with the same r-free for all of the refinement runs before and during twin refinement. Seem reasonable?
-Todd
-----Original Message----- From: [email protected] on behalf of Nathaniel Echols Sent: Fri 11/11/2011 11:53 AM To: PHENIX user mailing list Subject: Re: [phenixbb] twin/pseudo-symmetry question
On Fri, Nov 11, 2011 at 9:47 AM, Green, Todd
wrote: the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something?
I don't know enough to answer the rest of the question, but one thing to look out for is how you generate the R-free flags - which program did you use, and when you reprocessed in C2, did you generate a new set of R-free flags? If you're not careful with this, you can end up severely biasing R-free when performing twinned refinement.
-Nat _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- ----------------------------------------------------------------- P.H. Zwart Research Scientist Berkeley Center for Structural Biology Lawrence Berkeley National Laboratories 1 Cyclotron Road, Berkeley, CA-94703, USA Cell: 510 289 9246 BCSB: http://bcsb.als.lbl.gov PHENIX: http://www.phenix-online.org SASTBX: http://sastbx.als.lbl.gov ----------------------------------------------------------------- _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
On Fri, Nov 11, 2011 at 11:05 AM, Green, Todd
If the current refinement in C2 (with twin) were correct, are the numbers r_work = 0.1582 r_free = 0.1866 too low to be reasonable?
They're not unheard of - the attached image shows the distribution of R-free values in the PDB for structures between 2.25 and 2.35 Angstrom resolution, and an R-free of 0.1866 does fall within this distribution. But it's low enough that you should probably be a little suspicious, and check to make sure you're not doing anything that might bias the statistic. (Note that the presence of NCS can also make it very difficult to obtain an unbiased R-free - you can try assigning the flags in thin resolution shells, which Phenix can also do, but this isn't a guaranteed solution.) -Nat
yes,
you should be able to use the obverse/reverse twin law when you refine
in the hexagonal setting.
Do the intensity statistics indicate that data is twinned though?
Your C2 refinement with its low r value can be due to bias if the free
set hasnt been chosen carefully.
P
On 11 November 2011 11:05, Green, Todd
Regarding imosflm and full control, I remember reading about the variant choices for C2 somewhere. So somewhere I have the files from imosflm done this way. I just never went beyond the processing to see what happens. So there is room to check.
I'll also check for the integral absences to check for the obverse/reverse twin. If this is the case. Can I apply an twin operation to the refinement? I'll spend some time googling this.
Thanks also for the info regarding rfree selection (including you Nat) I can just reverse course and import the original scalepack file (or imosflm files rather) and follow the refinement as i just did.
If the current refinement in C2 (with twin) were correct, are the numbers r_work = 0.1582 r_free = 0.1866 too low to be reasonable?
Thanks - Todd
-----Original Message----- From: [email protected] on behalf of Peter Zwart Sent: Fri 11/11/2011 12:28 PM To: PHENIX user mailing list Subject: Re: [phenixbb] twin/pseudo-symmetry question
Hi Todd,
There are 3 possible C2's you can choose from. Try besides R32
R3 C2 (no 1) C2 (no 2) C2 (no 3) P1
If you use labelit / imosflm you have full control over which spacegroup you can choose. HKL2000 doesn't provide enough control over which indexing solution you want to use.
Alternatively, you could have an 'obverse/reverse twin' in R32, a special kind of reticular twin. In this twinning form, the hexagonal sublattice causes the twinning. When you index your data, check if the integral absences are indeed absent (when you are in the hexagonal setting). If not, you could have an obverse reverse twin.
I have no clue about CCP4, but in phenix selecting the free set is done properly using the full symmetry of the lattice. If this is done incorrectly, you will get in trouble later down the line, so better make sure that things are ok.
P
On 11 November 2011 10:03, Green, Todd
wrote: For the R32 set, I generate the r-free in phenix directly from the scalepack file for data processed in R32. For C2, it was generated from the C2 processed scalepack file but in ccp4 using the scalepack2mtz script then imported to phenix(because i used molrep for the molecular replacement). But, I kept this same data file with the same r-free for all of the refinement runs before and during twin refinement. Seem reasonable?
-Todd
-----Original Message----- From: [email protected] on behalf of Nathaniel Echols Sent: Fri 11/11/2011 11:53 AM To: PHENIX user mailing list Subject: Re: [phenixbb] twin/pseudo-symmetry question
On Fri, Nov 11, 2011 at 9:47 AM, Green, Todd
wrote: the true space group is C2. Are these r-values too low? does the difference in values according to each twin law mean something?
I don't know enough to answer the rest of the question, but one thing to look out for is how you generate the R-free flags - which program did you use, and when you reprocessed in C2, did you generate a new set of R-free flags? If you're not careful with this, you can end up severely biasing R-free when performing twinned refinement.
-Nat _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- ----------------------------------------------------------------- P.H. Zwart Research Scientist Berkeley Center for Structural Biology Lawrence Berkeley National Laboratories 1 Cyclotron Road, Berkeley, CA-94703, USA Cell: 510 289 9246 BCSB: http://bcsb.als.lbl.gov PHENIX: http://www.phenix-online.org SASTBX: http://sastbx.als.lbl.gov ----------------------------------------------------------------- _______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
This email was scanned with Mcafee's Anti-Virus appliance, but this is no guarantee that no virus exists. You are asked to make sure you have virus protection and that it is up to date.
_______________________________________________ phenixbb mailing list [email protected] http://phenix-online.org/mailman/listinfo/phenixbb
-- ----------------------------------------------------------------- P.H. Zwart Research Scientist Berkeley Center for Structural Biology Lawrence Berkeley National Laboratories 1 Cyclotron Road, Berkeley, CA-94703, USA Cell: 510 289 9246 BCSB: http://bcsb.als.lbl.gov PHENIX: http://www.phenix-online.org SASTBX: http://sastbx.als.lbl.gov -----------------------------------------------------------------
On Fri, Nov 11, 2011 at 10:03 AM, Green, Todd
For the R32 set, I generate the r-free in phenix directly from the scalepack file for data processed in R32. For C2, it was generated from the C2 processed scalepack file but in ccp4 using the scalepack2mtz script then imported to phenix(because i used molrep for the molecular replacement). But, I kept this same data file with the same r-free for all of the refinement runs before and during twin refinement. Seem reasonable?
I'm not totally certain about this, but I don't think scalepack2mtz will take the higher lattice symmetry into account. When you generate the flags in Phenix, by default reflections which are related by possible twin laws will have the same flag. If you don't do this, however, when you use twinned refinement you will effectively be refining against (most of) your test set, which leads to an artificially low R-free. (I'm probably butchering the terminology here, sorry.) What I would do instead is take the MTZ file from scalepack2mtz and get rid of the R-free flags, and let Phenix generate new ones - then start refining the MR solution with the new file. -Nat
participants (3)
-
Green, Todd
-
Nathaniel Echols
-
Peter Zwart