Re: [phenixbb] model vs Wilson b-factor

22 Apr 2010

      It's also true that the Wilson B calculation assumes that all the B factors
in the crystal are the same - which is also far from the true in most macromolecular
crystals.  A person who holds to the practice of aggressively building water
molecules and loops will create a model with a higher average B than one who
uses the same data but is more restrained.  The Wilson B will, of course, be
unchanged.

   If you have a protein that is equally ordered throughout and you do not
build in weak water molecules and the crystal diffracts high enough to allow
a reasonably accurate calculation of the Wilson B, your average B and and the
Wilson B should be close to each other.  If your protein has mobile loops,
which most lower resolution crystals do (and most higher resolution crystals
for that matter) then your average B will be larger than your Wilson B.

   Since the Wilson B and the average B are such different quantities I don't
believe there are any useful conclusions that can be made by comparing the
two.  If you want to see if your model is consistent with your Wilson B you
should calculate F squared values from it and calculate a Wilson B from those.
If the calculated Wilson B doesn't match the observed Wilson B your model has
a serious problem, but I expect that every refinement program will produce
models that match quite closely - even models that are quite wrong will at
least match the Wilson B.  A discrepancy between calculated and observed
Wilson B causes a horrible increase in R values (both kinds) which is easily
fixed by the refinement program adjusting whatever B values are being refined.

Dale Tronrud

On 04/21/10 15:01, MARTYN SYMMONS wrote:
...
Perhaps worth pointing out that that Wilson B is the based on the assumption of randomly distributed atoms. This is not at all how proteins are, and in particular secondary structures give a preponderance of spacings in the 4 angstrom-ish region and a peak of mean intensity in these shells. For this reason the apparent fall-off in the Wilson in this resolution range is steeper as you are falling down off of the peak due to this seondary structure giving favoured spacings that produce a deviation from randomness in this resolution range. So it will be dependent on the secondary structure of the an individual protein . So the Wilson gets about right when you deal with spacings that tend to be unbiased by secondary structure - which unfortunately is the bit that is missing in the low resolution crystal data. Wilson fall off in low resolution looks steep because the random assumption is invalid.
Maybe you can guess the secondary content of your protein from where the bump is in the Wilson plot - beta gives a bulge in the 4 ang region - alpha in the 5 to 9 ang region.
all the best
               Martyn
Martyn Symmons
Cambridge
----- Original Message ----
From: Pavel Afonine 
To: [email protected]; PHENIX user mailing list 
Sent: Wednesday, 21 April, 2010 18:56:47
Subject: Re: [phenixbb] model vs Wilson b-factor
Hi Gino,
here are a few points:
- my understanding (please correct me if I'm wrong) is that the accuracy of Wilson B estimate drops with the resolution: lower the resolution, less accurate is the estimate;
- Wilson B is not a given calculated value - it's just an estimate;
- the total atomic B-factor includes the trace of overall anisotropic scale matrix (see Fmodel formula for the total model structure factor: Fmodel = scale_overall * exp(-h*U_overall*ht) * (Fcalc + k_sol * exp(-B_sol*s^2) * Fmask) ). You can try to disable this and see if this was the cause (use "apply_back_trace_of_b_cart=true" keyword for this).
- the things you "tried to resolve this discrepancy" will unlikely to change the average B-factor;
- assuming that you used the proper model parameterization and refinement strategy given your model and data quality,  I would just accept these values as a matter of fact.
Pavel.
...
we've solved a large structure (~20,000 residues/asymm unit), with 4-fold ncs and diffraction data to 3.3A.
The Rfree/Rfac is ~28%-24% with OK geometry with no major outliers in the Ramachandran plot.
I would think I'm done (.. after 6 years!). 
However, my refined model b-factor (~130A2) is >> Wilson b-factor (~80A2). Obviously I'm not too happy with it.
Here is what I tried to resolve this discrepancy: --> play with wxu_scale --> play with B-factor weight in ncs restraint (4-fold ncs)
--> play with number of macrocycles --> Redefine tls groups
So far nothing really works, except switching from
individual_adp to group_adp. However, this increases my Rfree by almost 3%.
Any ideas?
_______________________________________________
phenixbb mailing list
[email protected]
http://phenix-online.org/mailman/listinfo/phenixbb
_______________________________________________
phenixbb mailing list
[email protected]
http://phenix-online.org/mailman/listinfo/phenixbb