Sandwalk: Does natural selection constrain neutral diversity?

Friday, April 17, 2015

Does natural selection constrain neutral diversity?

Razib Khan is an adaptationist and he's discovered a paper that gets him very excited: Selectionism Strikes Back!.

Here's the paper and the abstract.

Corbett-Detig, R.B., Hartl, D.L., Sackton, T.B. (2015) Natural Selection Constrains Neutral Diversity across A Wide Range of Species. PLoS Biology Published: April 10, 2015 doi: 10.1371/journal.pbio.1002112

The neutral theory of molecular evolution predicts that the amount of neutral polymorphisms within a species will increase proportionally with the census population size (Nc). However, this prediction has not been borne out in practice: while the range of Nc spans many orders of magnitude, levels of genetic diversity within species fall in a comparatively narrow range. Although theoretical arguments have invoked the increased efficacy of natural selection in larger populations to explain this discrepancy, few direct empirical tests of this hypothesis have been conducted. In this work, we provide a direct test of this hypothesis using population genomic data from a wide range of taxonomically diverse species. To do this, we relied on the fact that the impact of natural selection on linked neutral diversity depends on the local recombinational environment. In regions of relatively low recombination, selected variants affect more neutral sites through linkage, and the resulting correlation between recombination and polymorphism allows a quantitative assessment of the magnitude of the impact of selection on linked neutral diversity. By comparing whole genome polymorphism data and genetic maps using a coalescent modeling framework, we estimate the degree to which natural selection reduces linked neutral diversity for 40 species of obligately sexual eukaryotes. We then show that the magnitude of the impact of natural selection is positively correlated with Nc, based on body size and species range as proxies for census population size. These results demonstrate that natural selection removes more variation at linked neutral sites in species with large Nc than those with small Nc and provides direct empirical evidence that natural selection constrains levels of neutral genetic diversity across many species. This implies that natural selection may provide an explanation for this longstanding paradox of population genetics.

It is impossible for someone like me to evaluate this paper. Can someone take a look to see if it's valid?

How many selective sweeps must there every 50,000 years in order to remove substantial amounts of neutral diversity from junk DNA?

117 comments:

UnknownFriday, April 17, 2015 6:09:00 PM
I've been waiting for the post on this one since I saw it and you fail to disappoint by posting this. My initial reaction is preserved in a mail I sent to a colleague after reading it:
"Interesting indeed, although not really surprising.
Basically if the distribution of s-values is roughly constant, then we would expect S=2Ns values to become more extreme, i.e. for |S| to go up. In this case neutral variation should decrease (although it should be noted that neutrality is used for cases where the gene effect on fitness is 0 and also where the fitness effect on genes is 0 and these are only the same when there is no linkage. Neutral variation in the gene->fitness sense should decrease, but neutral variation in the fitness->gene sense shouldn't).
A neat result is that this affects local substition rates, but global rates are unaffected, i.e. this effect can disturb clocklike divergence for single genes, but it's unlikely to affect the clockwise divergence for larger regions."

As for Khans comment: Meh. As noted above, this does not have an effect that would make the neutral null invalid and clockwise behaviour is still expected for long time intervals and large genomic datasets.
ReplyDelete
Replies
gnomonFriday, April 17, 2015 9:25:00 PM
check out this paper to see experimental evidence for no junk
http://www.sklmg.edu.cn/Public/Uploads/attached/file/20140830/20140830063859_64243.pdf

Yuan, D., Zhu, Z., Tan, X., Liang, J., Zeng, C., Zhang, J., Chen, J., Ma, L., Dogan, A., Brockmann, G., Goldmann, G., Medina,E., Rice, A.D., Moyer, R.W., Man, X., Yi, K., Li, Y., Lu, Q., Huang, Y. and Huang, S. (2014) Scoring the collective effects of SNPs: association of minor alleles with complex traits in model organisms. Sci China Life Sci. 57:876-888.

Abstract:
It has long been assumed that most parts of a genome and most genetic variations or SNPs are non-functional with regard to reproductive fitness. However, the collective effects of SNPs have yet to be examined by experimental science. We here developed a novel approach to examine the relationship between traits and the total amount of SNPs in panels of genetic reference populations. We identified the minor alleles (MAs) in each panel and the MA content (MAC) that each inbred strain carried for a set of SNPs with genotypes determined in these panels. MAC was nearly linearly linked to quantitative variations in numerous traits in model organisms, including life span, tumor susceptibility, learning and memory, sensitivity to alcohol and anti-psychotic drugs, and two correlated traits poor reproductive fitness and strong immunity. These results suggest that the collective effects of SNPs are functional and do affect reproductive fitness.
ReplyDelete
Replies
gnomonFriday, April 17, 2015 9:28:00 PM
also this paper in press with more evidence for the same point:

http://www.sciencedirect.com/science/article/pii/S0888754315000725

Zhu, Z., Man, X., Huang, Y., Xia, M., Yuan, D., and Huang, S. (2015) Collective effects of SNPs on transgenerational inheritance in Caenorhabditis elegans and budding yeast. Genomics, in press

Abstract
We studied the collective effects of single nucleotide polymorphisms (SNPs) on transgenerational inheritance in C. elegans recombinant inbred advanced intercross lines (RIAILs) and yeast segregants. We divided the RIAILs and segregants into two groups of high and low minor allele content (MAC). RIAILs with higher MAC needed less generations of benzaldehyde training to gain a stable olfactory imprint and showed a greater change from normal after benzaldehyde training. Yeast segregants with higher MAC showed a more dramatic shortening of the lag phase length after ethanol exposure. The short lag phase as acquired by ethanol training was more dramatically lost after recovery in ethanol free medium for the high MAC group. We also found a preferential association between MAC and traits linked with higher number of additive QTLs. These results suggest a role for the collective effects of SNPs in transgenerational inheritance, and may help explain human variations in disease susceptibility.
ReplyDelete
Replies
John HarshmanFriday, April 17, 2015 9:28:00 PM
Haven't looked at the paper yet, but doesn't it concern only sites linked to selected sites? That should still be a fairly small proportion of the genome, and only if there's been a selective sweep within the not too distant past. Are they saying that most of the average genome is linked to a locus that undergoes frequent selective sweeps?
ReplyDelete
Replies
gnomonFriday, April 17, 2015 9:43:00 PM
Neutral is only an assumption, and even worse a counter-intuitive one. Why would anyone take it seriously!! It never really worked in explaining nature, as this mainstream paper made it clear:

“Revisiting an old riddle: what determines genetic diversity levels within species?”

-- Leffler et al., 2012, PLoS Biology
ReplyDelete
Replies
W. BensonSaturday, April 18, 2015 1:05:00 AM
The authors are saying that enough of the genome is affected by selective sweeps to put a ceiling on standing neutral variation in species that are numerically abundant. Since hitchhiking by neutral variation will speed up fixation, the loss of variation caused by sweeps may be compensated. The two effects, loss of variability and faster fixation, may cancel out such that the rate of neutral evolution is little affected. There will be work for theoretical population geneticists.
The article doesn’t explain very well why big populations should have faster adaptive evolution. They seem to imply that, in big populations, mutations with small favorable effects will be less affected by drift and tend to evolve in a more deterministic manner. I would propose (and it may be mentioned in the paper) that if adaptive evolution is as mutation-limited as the paper seems to show, a large population will have more adaptive mutations and more repetition of adaptive mutations than a small one. As a consequence, adaptive evolution, measured by the frequency of new adaptive mutations, will go faster in populations that are large. This was a major finding of Darwin in the Origin of Species: species that are widespread and abundant evolve to be exceptionally variable.
(typos corrected)
ReplyDelete
Replies
Larry MoranMonday, April 20, 2015 10:31:00 AM
Here's a brief description of the molecular clock that was based on the first sequence comparisons in the 1960s: The Modern Molecular Clock.

The existence of an approximate molecular clock is not in doubt in spite of what Shi Huang (gnomon) is saying. He is speaking nonsense.

The explanation for an approximate molecular clock is that most of the fixed alleles are neutral and the rate of fixation is equal to the mutation rate. Since the mutation rate is approximately constant in different lineages, this gives rise to a relatively constant (stochastic) rate of fixation in each branch of the phylogenetic tree.

This explanation is consistent with everything we know about population genetics. The idea that the alleles (amino acid substitutions) are neutral fits with everything we know about protein structure and evolution. You would have to be crazy to reject all of that.
ReplyDelete
Replies
Mong H Tan, PhDMonday, April 20, 2015 4:49:00 PM
LAM: "The existence of an approximate molecular clock is not in doubt in spite of what Shi Huang (gnomon) is saying. He is speaking nonsense."

On the contrary, I think you both are talking passing each other: ie, the Molecular Clock (or Neutral) theory vs the Maximum Genetic Diversity (MGD) hypothesis!?

While both theories/hypotheses are theoretically and scientifically sound -- especially from the mid-20th-to-21st-century biomolecular points of view -- both the Neutral and the MGD theories may not be scientifically or deductively used to infer or prove the "evolutionary theory" of species by Natural Selection as first globally observed and speculated by Charles Darwin in the years 1831-1858! -- Despite the rhetorical claims by the Natural Selectionist or Neo-Darwinists since the late-19th-to-mid-20th century, I have since several years ago proclaimed that the classical Darwinism (since 1859) is a philoscientific observation-analysis of sort, that may be classified in the 20th-century Natural Phenomenology: a philoscientific observational theory that may not be empirically proven: thus the Macroevolution vs the Microevolution, forever!?

As I read through the research projects as proposed in the MGD hypothesis by SH, I thought that the project #3 -- when its complete data are obtained -- shall answer my proclamation above!?

Best wishes, MHT.
ReplyDelete
Replies
Tom MuellerTuesday, April 21, 2015 11:46:00 AM
Please correct me if I am wrong, but I must be missing something.

Neutral Theory does not deny the existence or importance of selection but rather questions the relative importance of selection vs. random drift as THE major driving force of evolution. I hope I have not over-simplified this all.

OK so far? … and the champions of Neutral Theory would therefore have NO problem with data suggesting that certain categories of lineages demonstrate enhanced importance of selection vs. other lineages (the majority presumably) that exhibit the contrary.

I hope I have got this correct so far.

It seems to me that large populations are more likely to demonstrate the “Allee Effect” than small populations.

But (and this is the important bit)

An Allee effect is by definition a positive association between absolute average individual fitness and population size over some finite interval.

http://www.nature.com/scitable/knowledge/library/allee-effects-19699394

So again, please correct me if I am wrong… but I may be missing something.

The Champions of Neutral Theory should have no problem with certain populations demonstrating enhanced selection compared to others if in fact they were exhibiting a positive Allee effect, a positive effect that by definition occurs in some but not all larger populations (ergo a trend).

I hope I am not hopelessly confused again.
ReplyDelete
Replies
Donald ForsdykeWednesday, April 22, 2015 11:17:00 AM
George Romanes, like Shi Huang,
Who calls himself Gmonon,
Was sad there were no praises sang,
For his favorite axiom.
Advanced in distant eighteen eighties,
But admired not by his maties.

So “Intuitive and self-evident” thought
Darwin’s young associate.
Yet, fruitless battle long he fought,
For “collective variation” postulate.
This today sounds not absurd,
Though “collective mutation” is the word.

But Thomas Huxley and Thiselt’n-Dyer,
On this issue ‘came quite testy,
Romanes’ axiom did not admire,
Deplored his lack of modesty.
Circled wagons round this crank,
‘Til they died off as in dictum Plank.

Yes, one by one they all died off,
Grim reaper’s selective sweep.
Romanes had won no single prof,
Ideas now fade in dusty heap.
Ignored by modern biometricians
All intent on neutral missions.

Said, for us to do our sums,
Need mutation sans adaptation.
And we can cite our neutral chums
Befuddle rest with long equations.
Then along came proud Gmonon
With reductio ad absurdum.

He knoweth not Akiyoshi Wada,
Nor Grantham’s Genome Hypothesis,
Yet casting doubt on Kimura,
To our ears his words are bliss.
And if you endure not poetry
See Notes and Rec R. S’ciety!

Forsdyke DR (2010) Notes & Records of the Royal Society 64:139-154.
http://rsnr.royalsocietypublishing.org/content/early/2009/10/27/rsnr.2009.0045.full.pdf+html
ReplyDelete
Replies
Greg LadenWednesday, April 22, 2015 2:50:00 PM
It is great to see someone sticking up for selection, because it is way more interesting than neutral process. But, I have two questions about the paper, pertaining to the species selected. Maybe three.

There are a lot of domestic species (or quasi domestic) in the data base, and thus, species with strong artificial selection.

My gut feeling is that there is a disproportionate number of species that tend to have larger than average changes in population size (boom/bust). This would mean that larger population size would be associated with relatively low diversity (initially) because of founder effects.

Third (maybe) I'm not sure if mixing entirely different reproductive patterns together is wise (i.e., looking at bees alongside bighorn sheep. Seems like you could get stung (or butted) that way.
ReplyDelete
Replies
JmacWednesday, April 22, 2015 4:59:00 PM
Greg,

Larry no likes selection even if it is natural. As you can tell he is in love with the G-drift. There is a reason for it. Larry is not stupid and realized that natural selection can't account for the real evolution we all keep asking for the evidence for. I'm not going to comment on the latter part of my post. I'm sorry.
ReplyDelete
Replies
Tom MuellerFriday, April 24, 2015 11:26:00 AM
Hello everybody

FYI - Carl Zimmer has jumped on the HERV-bandwagon

http://www.nytimes.com/2015/04/23/science/ancient-viruses-once-foes-may-now-serve-as-friends.html?smid=pl-share&_r=0

which if you remember segues from an earlier version

http://blogs.discovermagazine.com/loom/2012/06/14/we-are-viral-from-the-beginning/

Raising the vexing jDNA question (or would that be begging the question) in the public forum of the hoi poloi

I offer this merely as an FYI in passing.
ReplyDelete
Replies
Tom MuellerFriday, April 24, 2015 12:34:00 PM
John,

I would appreciate any reaction you may care to offer regarding my suggestions about the Allee Effect above.

Thanks in advance for even considering my petition.
ReplyDelete
Replies
Tom MuellerFriday, April 24, 2015 5:15:00 PM
Hi again John

You say:

The Allee effect doesn't predict large population sizes in the abstract, or larger population sizes than other species.

I agree, unless of course "species range" is employed as a "proxy" for Nc as was done in this paper.
ReplyDelete
Replies
Tom MuellerSunday, April 26, 2015 7:48:00 AM
just as a postscript:

Re: “body size”

Presuming identical Biomass – shoals of smaller fish by definition would demonstrate higher Nc than shoals of larger fish.
ReplyDelete
Replies
Tom MuellerSunday, April 26, 2015 3:05:00 PM
Of course I meant to say

Presuming identical Biomass – smaller species of fish that exhibit shoaling over a large range by definition would demonstrate higher Nc than larger species of fish under identical circumstances
ReplyDelete
Replies

Add comment