More Recent Comments

Tuesday, December 22, 2015

Waiting for multiple mutations: Michael Lynch v. Michael Behe

Casey Luskin is trying to help out a university student by describing some important ID contributions to science [No ID Research? Let's Help Out This Iowa State Student].

One of those contributions is a paper by Michael Behe and David Snoke published eleven years ago in Protein Science (Behe and Snoke, 2004). I described the result in a previous post: Waiting for multiple mutations: Intelligent Design Creationism v. population genetics.

If Behe & Snoke are correct then modern evolutionary theory cannot explain the formation of new functions that require multiple mutations.

Cassey Luskin is aware of the fact that this result has not been widely accepted. He mentions one specific criticism:
In 2008, Behe and Snoke's would-be critics tried to refute them in the journal Genetics, but found that to obtain only two specific mutations via Darwinian evolution "for humans with a much smaller effective population size, this type of change would take > 100 million years." The critics admitted this was "very unlikely to occur on a reasonable timescale."
He's referring to a paper by Durrett and Schmidt (2008). Those authors examined the situation where one transcription factor binding site was disrupted by mutation and another one nearby is created by mutation. The event requires two prespecified coordinated mutations.

Durrett and Schmidt show that in Drosophila such event can occur on a timescale of about 4,300 years. In humans with an effective population size of 10,000 the event would take about 160 million years. The authors point out that such and event—two prespecified mutations—is very unlikely. They also point out that their estimates are 5 million times more likely that those given by Michael Behe in The Edge of Evolution.

This study is not very relevant to the evolution of new functions as described in Behe and Snoke (2004).

Casey Luskin ignored the more relevant criticism of Michael Lynch (Lynch, 2005). Lynch's paper was published in the same journal as the Behe & Snoke paper and it's a direct attack on that paper.

Lynch wrote the editors when the Behe & Snoke paper appeared asking if they would be receptive to a rebuttal in the form of a new research paper. The editors agreed, subject to peer review [see Editorial and position papers].

Lynch begins with:
In a recent paper in this journal, Behe and Snoke (2004) questioned whether the evolution of protein functions dependent on multiple amino acid residues can be explained in terms of Darwinian processes. Although an alternative mechanism for protein evolution was not provided, the authors are leading proponents of the idea that some sort of external force, unknown to today's scientists, is necessary to explain the complexities of the natural world (Behe 1996; Snoke 2003). The following is a formal evaluation of their assertion that point-mutation processes are incapable of promoting the evolution of complex adaptations associated with protein sequences. It will be shown that the contrarian interpretations of Behe and Snoke are entirely an artifact of incorrect biological assumptions and unjustified mathematical oversimplification.
He continues by pointing out that, contrary to their claims, Behe & Snoke were not modeling "Darwinian processes" because they were examining intermediates that were neutral with respect to selection and created a nonfunctional product. (The initial "compatible" mutations inactivated one copy of a duplicated gene pair.) Lynch concludes that, "... there is no logical basis to the authors' claim that observations from a non-Darwinian model provide a test of the feasibility of Darwinian processes."

This is standard criticism of ID proponents. It will be familiar to most Sandwalk readers. It proves that even when Intelligent Design Creationists should know better—like publishing a paper about population genetics—they still get confused by their own rhetoric.

Lynch's model differs from the Behe & Snoke model in several ways. He does not assume, for example, that a gene duplication event is required before mutations can accumulate. That's because Lynch does not assume that the initial compatible mutations are deleterious. He assumes that the mutations leading to amino acid substitutions are neutral.

This gives rise to seven different alleles and allele combinations that form the initial features of his model. He also assumes that there can be more than one way to change the function via a combination of such different mutations.

The vertical lines in the figure indicate "compatible" mutations that, in combination, will change the function. As we see in allele #2, some of them can occur before the gene duplication because they are not disruptive. This is different than the Behe & Snoke assumption.

Lynch retains several other assumptions in the Behe & Snoke paper:
To simplify the presentation as much as possible, the focus here is on a nonrecombining haploid genome (as assumed by Behe and Snoke), with the origin of a new adaptive function involving a two-residue interaction, for example, the disulfide bond between two cysteines. As in Behe and Snoke (2004), this adaptation is assumed to be acquired at the expense of an essential function of the ancestral protein, so that the new function can only be permanently established via gene duplication, with one of the copies maintaining the original function. The Behe-Snoke assumption that a selective advantage only results after both participating residues are in place is also adhered to.
Lynch quotes several papers to justify his assumption that the initial mutations are neutral, not disruptive. He concludes, correctly, that "... most proteins in all organisms harbor tens to hundreds of amino acid sites available for evolutionary modification prior to gene duplication."

A new variable, n is introduced. This is the number of codons that can potentially mutate to produce a new function. In Behe & Snoke this number was 1. If three mutations were required then there were only three possible codons that could mutate. In the Lynch model this number could be 10 or even 50.

Here's how Michael Lynch describes the process:
Successful establishment of the new function (neofunctionalization of one of the copies) requires the founding pair of linked gene duplicates to (1) initially attain a high frequency; (2) acquire the mutations essential to the expression of the new function (allelic types 6 or 7) while en route or subsequent to fixation; and (3) be preserved by positive selection subsequent to the origin of the new function. All three processes occur in parallel with a background production of null alleles. The two central issues to be resolved are then: (1) How frequently will a duplication event lead to neofunctionalization; and (2) How long will this take? Answers to these questions can be acquired by recursively following the population through the sequential steps of mutation, selection, and random sampling.
The model includes parameters for the gene duplication event as well as the standard variables such as mutation rate and population size. The mutation rate for amino acid substitution is 10-8, the same as in Behe & Snoke. The mutation rate for production of a null allele (deleterious mutation rate) is 10-6 per gene per generation.

Simulations were run using the requirement that two mutations are required for neofunctionalization (formation of a new gene). As you might expect, the probability is very sensitive to the total number of codons that can potentially be mutated. If there are 50 such sites (n=50), then fixation is almost assured for a population size of 10,000 individuals. If there are only two such sites, then you need a population size of 100 million to reach a reasonable probability of success.

Combining equations leads to simulations that estimate the time to neofunctionalization in generations.

Behe & Snoke claimed that with two mutations you need a population size of 1012 in order to fix the new allele in one million generations. The Lynch simulation shows that the new allele can be fixed in one million generations with population size of only 106 provided there are 50 potential mutable sites (n=50). That's a difference of six orders of magnitude.

Even with n=2 (as in Behe & Snoke) a population of one million could lead to fixation in 108 generations in the Lynch simulation whereas it would take 100 times as long according to Behe & Snoke. Recall that most of these simulations apply to single-cell, haploid organisms such as bacteria. In those populations the generation times can be measured in days. There might be 100 generations per year so 108 generations could be only one million years—a short time in the history of life.

There are three important differences between the two approaches. First, as already noted, Lynch assumes that the initial mutations are neutral. Second, Lynch assumes that there are multiple pathways to the formation of a new functional protein.

The third difference is more subtle. Behe & Snoke failed to take into account the fact that a linked pair of functional genes has a selective advantage because it now takes two null mutations to silence both copies. This effect doesn't make much difference in small population but in large populations it makes fixation of the new allele more probable.

According to Lynch, there are several reasons for thinking that his estimates of time to fixation are too high. You should read the paper to see what they are. (They're a little too complicated for the average Sandwalk reader, e.g. me.)

The bottom line is:
In summary, the conclusions derived from the current study are based on a model that is quite restrictive with respect to the requirements for the establishment of new protein functions, and this very likely has led to order-of-magnitude underestimates of the rate of origin of new gene functions following duplication. Yet, the probabilities of neofunctionalization reported here are already much greater than those suggested by Behe and Snoke. Thus, it is clear that conventional population-genetic principles embedded within a Darwinian framework of descent with modification are fully adequate to explain the origin of complex protein functions.
Casey Luskin may be unaware of this criticism but Michael Behe was not.

Behe and Snoke were invited to comment on the Lynch paper when it was published and they did (Behe and Snoke, 2005). They quibble about some of the wording in Lynch's article but the important part of their letter is:
Our paper (Behe and Snoke 2004) contains one simple result. When reasonable parameters are used with our model to estimate actual time scales or population sizes for the evolution of multi-residue (MR) protein features, they are unrealistically large. This implies that the model we chose, which is restricted to point mutations and assumes intermediate states to be deleterious, isn't a plausible evolutionary pathway. One must therefore look about for a new model. We did not rule out such a possibility; in our original article, we explicitly stated, “we should look to more complicated pathways, perhaps involving insertion, deletion, recombination, selection of intermediate states, or other mechanisms, to account for most MR protein features.”

In his Editorial (this issue), Professor Hermodson reports that comments sent to him assume a consensus, “Thus, intermediate states must also be assumed to be selected.” Some significant previous work does not make this assumption (Kimura 1985; Ohta 1989), but our paper supports such a consensus. This is a strong requirement—that not only the end products, but steps along the way to a multi-residue function, must be either selected or at least neutral. Michael Lynch makes a similar assumption. Our model posited necessary intermediate mutations to be deleterious in the unduplicated gene; Lynch's model assumes them to be neutral: “all 20 amino acids are equally substitutable in the intermediate neutral state” (Lynch 2005, this issue). All of his objections to our work stem from this difference.
In other words, Behe and Snoke deliberately choose a model that leads to unrealistic results but they are perfectly willing to accept other valid evolutionary models that seem much more reasonable.

Recall that Casey Luskin touts this paper as one of the few results from ID proponents that produce a genuine "scientific discovery." He notes that there was criticism but fails to mention the Lynch paper or the concession published by Behe and Snoke.

Isn't that strange?


Behe, M.J., and Snoke, D.W. (2004) Simulating evolution by gene duplication of protein features that require multiple amino acid residues. Protein science, 13:2651-2664. [doi: 10.1110/ps.04802904]

Behe, M. and Snoke, D.W. (2005) A response to Michael Lynch. Protein Science 14:2226-2227. [doi: 10.1110/ps.051674105]

Durrett, R., and Schmidt, D. (2008) Waiting for two mutations: with applications to regulatory sequence evolution and the limits of Darwinian evolution. Genetics, 180:1501-1509. [doi: 10.1534/genetics.107.082610]

Lynch, M. (2005) Simple evolutionary pathways to complex proteins. Protein science, 14(9), 2217-2225. [doi: 10.1110/ps.041171805]

218 comments :

«Oldest   ‹Older   201 – 218 of 218
judmarc said...

But it would be a nice touch if someone actually analyzed mutations in the context of a big developmental picture.

I can appreciate, to some degree, why nobody does.


Actually, quite a lot of people have done this over recent decades, and Dr. Moran has referred to this type of work, popularly known as "evo-devo" (evolutionary developmental biology). It's all over your local university library. It's just that as with so much of evolutionary fact and theory, you personally are blissfully unaware of it, and so you assume it doesn't exist. Before you make this particular mistake again (which is apparent to just about everyone on this blog except you) go to your local library and read a popular book or two by Dr. Sean B. Carroll, or if you're up for it, go to the local university library and start reading his published peer-reviewed journal articles.

Unknown said...

'If someone actually analyzed mutations in the contex of a big developmental picture.'

This is exactly what irks me about creationists. You think you're the 'only' person to think about these questions. In fact there's an entire field dedicated to this endeavor, but apparently PubMed or, shit, Google is outside your realm.

I used to think we needed better creationists, but in reality there are none.

Unknown said...

Sorry judmarc for some reason your comment wasn't showing until I 'just' posted mine.

Jmac said...

No probleomo Ryan,
judmarc can reconstruct anything in his lab or so he claims.

After all he is more intelligent than the processes that brought him to life from the matter that is just dumb and dead.
He'd better have better arguments than Moses, because he needs scientific, experimental proof for his beliefs or he will be marked as a blind believer.

judmarc said...

We're still waiting in the other thread, Eric, for the proof positive you claimed to have that Coyne was wrong.

Here's some probability math for you: The chances I'll win the $700 million PowerBall lottery versus the chances we'll ever see that proof from you. I say chances of the former are much greater than chances of the latter. Go ahead and prove me wrong. (Crickets chirping....)

Bilbo said...

Lutesuite, in that case, Behe wasn't "quibbling," but making an unsupported assertion. And given that Professor Moran is an expert in this field, he would know whether or not Behe's assertion was true or false. Therefore, Moran should have been, "Behe either quibbles or makes false assertions." Since Moran has refused to reply, I'm left wondering whether Moran knows that Behe made a true assertion and doesn't want to acknowledge it.

Larry Moran said...

Behe assumes that the intermediate steps in his model are deleterious. Lynch assumes that they are neutral and he quotes from numerous papers showing that somewhere between 10% and 50% of amino acid substitutions are neutral.

The Lynch model only assumes that the intermediates in the specific pathway are neutral. It does not assume that all substitutions are neutral.

Behe and Snoke criticize Lynch's calculations by saying ...

1. Experimental studies contradict Lynch's assumption of complete neutrality as a rule; the majority of amino acid substitutions decrease protein function.

This is just quibbling. Nobody is saying that the majority of substitutions are neutral. Lynch is only saying that the pathway doesn't have to go through a deleterious allele as Behe and Snoke assumed in their calculations.

Bilbo, do you honestly believe that Michael Behe and David Snoke could know more about population genetics than Michael Lynch? If so, there's no point in discussing this with you any more.

Faizal Ali said...

Therefore, Moran should have been, "Behe either quibbles or makes false assertions."

Now that's a perfect example of "quibbling," courtesy of Bilbo.

Bilbo said...

Interesting: At first, I didn't see a new reply from Professor Moran, nor my more recent reply to Lutesuite, so I posted a new reply to Lutesuite. When I did that, suddenly a second reply from Professor Moran showed up, so I deleted my recent reply to Lutesuite. But now I do not see the second reply from Professor Moran.

I'll assume that Moran's reply is here somewhere. I'll point out what I remember that he said: Behe is correct that most mutations are deleterious. So all of you who said that Behe was incorrect are, according to Moran, incorrect yourselves. Moran went on to say that Behe's point was irrelevant to Lynch's point, since Lynch was merely pointing out that a new function could be reached by neutral mutations, not that most mutations are neutral. Or at least I think that's what Moran said. And perhaps that's enough to refute Behe's point. I wouldn't know. But now I wonder: if most mutations are deleterious, how probable is it that new functions can be found by avoiding deleterious mutations?

Faizal Ali said...

Behe is correct that most mutations are deleterious.

Nope. Not what Larry said, and not even what Behe said.

Faizal Ali said...

But now I wonder: if most mutations are deleterious, how probable is it that new functions can be found by avoiding deleterious mutations?

What you should be asking: If not all amino acid substitutions are deleterious, then why would Behe propose a model that assumed they all were?

Faizal Ali said...

In any event, Bilbo, the answer to your question is provided by Michael Lynch in the article discussed above. It's not easy reading, to be sure. But you'd be better of saying you don't understand the answer to the question, rather than lying and saying it hasn't been answered.

The whole truth said...

Bilbo, as of right now I see a total of 4 comments by Larry in this thread. Strange things sometimes happen to comments so I have copied Larry's most recent comment and pasted it here so that you can be sure to see it:

Laurence A. Moran Sunday, January 10, 2016 12:45:00 PM

Behe assumes that the intermediate steps in his model are deleterious. Lynch assumes that they are neutral and he quotes from numerous papers showing that somewhere between 10% and 50% of amino acid substitutions are neutral.

The Lynch model only assumes that the intermediates in the specific pathway are neutral. It does not assume that all substitutions are neutral.

Behe and Snoke criticize Lynch's calculations by saying ...

1. Experimental studies contradict Lynch's assumption of complete neutrality as a rule; the majority of amino acid substitutions decrease protein function.

This is just quibbling. Nobody is saying that the majority of substitutions are neutral. Lynch is only saying that the pathway doesn't have to go through a deleterious allele as Behe and Snoke assumed in their calculations.

Bilbo, do you honestly believe that Michael Behe and David Snoke could know more about population genetics than Michael Lynch? If so, there's no point in discussing this with you any more.

The whole truth said...

I have some comments and questions for those of you who are pushing ID (i.e. your religious beliefs). You apparently believe that most or all mutations are harmful. If most or all mutations are harmful, why isn't everything extinct? Apparently you believe that repair processes heal some or many mutations. You rely on claims of the repair processes being too complex to have evolved, but aren't the parts of biological entities in which mutations occur just as complex as the repair process parts?

What is it that you are actually claiming? Are you claiming that all mutations are natural and harmful but that your chosen, so-called god reaches into some living things and heals some mutations? Are you claiming that harmful mutations are designed-created and are punishment because of 'the fall' and that your chosen, so-called god is delaying the extinction of some living things by healing some mutations? Are you claiming that mutations are natural but that the repair processes were/are designed-created at the origin of life? If you're claiming that the repair processes were/are designed-created by your chosen, so-called god, why aren't the repair processes perfect? If your chosen, so-called god designed-created all life, why are there any mutations, especially harmful ones?

Faizal Ali said...

Bilbo, as of right now I see a total of 4 comments by Larry in this thread.

Maybe that just shows Bilbo's Bible goggles are working.

Unknown said...

Nothing that wasn't to be expected: creationists are intellectually dishonest. End of story.

Unknown said...

Even if lynch's choice of words wasn't the best or he misrepresented a couple of aspects, this doesn't invalidate the results or the fact that the creationist claims are wrong. The difference, as mentioned by Behe himself is "largely due to opposing starting presumptions about whether mutations are deleterious."
This Darwinian model works and even Behe accepts it.

sidharth chhabra said...

I apologize for bringing back this post. I recently started to follow this debate and have few questions. I would appreciate if any of you scientists can help.

1. In the above figure mentioned from the Lynch's 2005 paper, if we were to apply it to human's population (population size ~ 10^4), the waiting time is 10^9 generations. Does that mean on average a two step mutation would take 10^9 generations? Btw in Lynch and Abegg (2010) Fig 4, they report similarly 10^8 generations.

2. Has any complex adaptation (requiring 2 or more mutations) being observed/found yet in case of human evolution from their LCA?

Thank you for the valuable time.

«Oldest ‹Older   201 – 218 of 218   Newer› Newest»