Sandwalk: On the Evolution of New Enzymes: Completely Different Enzymes Can Catalyze Similar Reactions

Friday, August 03, 2012

On the Evolution of New Enzymes: Completely Different Enzymes Can Catalyze Similar Reactions

It's often quite difficult to imagine how a new enzyme activity could have evolved "from scratch." After all, aren't enzymes highly complex proteins with very specific folds? What's the probability of stringing together just the right amino acids by chance in order to get a new enzyme?

In many cases, new enzymes evolve from primitive enzymes that catalyzed similar reactions [see The Evolution of Enzymes from Promiscuous Precursors]. It's quite easy to see how this could happen by gene duplication and there are tons of examples.

But what about the first primitive enzymes themselves? Presumably, they evolved all on their own. When scientists think of this problem, they usually think in terms of evolving a specific modern enzyme. This looks like a long shot, similar to the probability that a specific person will win the lottery tomorrow. What they don't realize is that this is an unnecessarily restrictive scenario.

There are many possible ways of catalyzing a given metabolic reaction. What we see today is the "lucky winner"—the one enzyme that happened to win the lottery. There were many other possible enzymes that could have evolved and that makes the overall probabilities much more reasonable. There could be a million structurally distinct proteins capable of catalyzing a particular reaction. What you should be thinking about is the probability that any one of those possible enzymes will evolve and not the probability that a specific enzyme will evolve. It's like calculating the probability that anyone in a large city will win the lottery—a much more reasonable number.

Is there any evidence that many different enzymes can catalyze a similar reaction? Yes, there are plenty of examples of completely different enzymes, existing in different species, that catalyze the same reactions. Some of these examples are very well known, such as the two different aldolase enzymes of the glycolysis pathway [Aldolase in Gluconeogenesis & Glycolysis]. That example is covered in every biochemistry textbook.

There are also examples of parallel evolution in the citric acid cycle. At the beginning of the cycle, for example, there are two completely different enzymes catalyzing the formation of acetyl-CoA [Some Bacteria Don't Need Pyruvate Dehydrogenase]. Several other reaction in the pathway are catalyzed by different enzymes in different species of bacteria.

This suggests that early forms of life evolved several different enzymes for the same reaction although one of them might have taken over because it was more efficient (or lucky). Eugene Koonin calls this non-orthologous gene displacement (NOGD) and it's one of the reasons why the set of genes common to all species is surprisingly small [The Core Genome].

The reason I decided to write about this is the discovery of some new enzymes in cyanobacteria. Cyanobacteria are photosynthetic bacteria that have the same complex photosynthesis pathway as algae and plants. In fact, the chloroplasts of algae and plants are derived from cyanobacteria.

It's been known for a long time that cyanobacteria are missing one of the common enzymes of the citric acid cycle. It's enzyme #4 in the figure at the top of the post. The common name of this enzyme is α-ketoglutarate dehydrogenase but nowadays it's called by its more formal name in the scientific literature: 2-oxoglutarate dehydrogenase. Cyanobacteria also exhibit low levels of enzyme #5 in the pathway; succinly-CoA synthetase.

Together, these two enzymes catalyze these reactions.

A recent paper by Zhang and Bryant (2011) reports on the discovery of two new enzymes in cyanobacteria: α-ketoglutarate decarboxylase and succinic semialdehyde dehydrogenase. Together these two new enzyme catalyze the conversion of α-ketoglutarate to succinate. What this means is that cyanobacteria can complete the citric acid cycle using two enzymes that are completely different than the ones used in most other species. It's another example of parallel evolution.

This is one more example of the evolution of different enzymes catalyzing the same reaction. It's further evidence that the earliest forms of life may have evolved lots of different enzymes with similar functions and it suggests that the specific enzymes we see today are just the lucky ones that arose first. There were many other possible enzymes that could have done the job just as well.

[Photo Credit: Cyanobactreria]

Zhang, S., and Bryant, D.A. (2011) The tricarboxylic acid cycle in cyanobacteria. Science 334:1551-1553. [PubMed] [DOI: 10.1126/science.1210858]

76 comments:

BarbaraFriday, August 03, 2012 11:21:00 AM
Really cool!
ReplyDelete
Replies
AtheistoclastFriday, August 03, 2012 12:06:00 PM
Don't try and weasel out of this one, Larry.

The first paragraph still holds no matter how much you want to claim that different enzymes can end up performing similar reactions. Just to follow up, there 3.78*10^97 ways of forming a single protein domain consisting of 75 residues from all 20 amino acids. But only a tiny fraction are likely to prove functional in any way. That what the theory of evolution is up against when accounting for the elusive origins of these peptide sequences.
ReplyDelete
Replies
Richard EdwardsFriday, August 03, 2012 2:11:00 PM
That's really interesting. Do chaperones have any tendency to increase the likelihood of different protein sequences to converge on similar structures? (I know that they can buffer the affects of mutations.) I'm not suggesting that they somehow squash completely different sequences into the same structure but wondering whether they act to reduce the total structure space that the sequence space feeds into?
ReplyDelete
Replies
chemicalscumFriday, August 03, 2012 2:49:00 PM
Listen IDiot any random string of amino acids forming a peptide will have some enzymatic activity for a wide range of reactions. These activities will be very very low compared to modern functional enzymes. How for cells in the early biotic environment of 4M years ago (you know 4M-6K years before the creation and long before humans invented your god)such low level activities were metabolically valuable. If the enzyme is replicated by a nucleic acid template, then any point mutation in the template that produces a peptide with a small improvement for a specific metabolic function that is enough to enable the cells to reproduce faster than before, will give that cell a selective advantage. Natural selection will operate on this. Slowly bit by bit over deep time modern highly specific and highly active enzymes will evolve. Now go away.
ReplyDelete
Replies
DiogenesFriday, August 03, 2012 3:55:00 PM
But Atheistoclast is invoking what I call the "fallacy that fallacies don't matter." He's already been informed that it's a fallacy to compute the probability of a pre-specified outcome (this is sometimes called the Lottery Winner fallacy, or the bridge hand fallacy.)

What you have to do is, you have to compute the cumulative probability over all possible peptide or protein sequences that have any function, not this particular function.

And yet, while he knows this is a fallacy, still Atheistoclast responds with a typical bullprob calculation:

3.78*10^97 ways of forming a single protein domain consisting of 75 residues from all 20 amino acids

Which has so many problems, I don't know where to begin. That number isn't even right even if it was correct to pre-specify a highly specific enzymatic reaction. Even in the case of modern enzymes of specific functions, you can mutate maybe two-thirds of their amino acids without affecting function (this varies from protein to protein). So even for a pre-specified function, that's way, way off.

And of course it's Lottery Winner fallacy to pre-specify the enzyme function. To sum up, you need to compute cumulative probability over:

1. All functions that are beneficial
2. All sequences that produce each function from 1.

Which are huge corrections to Atheistoclast's bullprob.

Moreover, it's completely arbitrary that he pulled out 75 residues for the length of his protein. Protein domains can be functional with 35 residues.

Moreover, proteins are often built up from repeating motifs or elements, like secondary structure elements. A four-helix bundle fold or a beta barrel are all made out of repeating secondary structures. They could be assembled from many copies of short peptides. So even 35 is too high a threshold for a peptide of minimal function.

Atheistoclast knows that his bullprob is based on fallacious arguments. But he invokes the final fallacy, the "Fallacy that fallacies don't matter."

Sure, his probability is off by an astronomically huge number. But-- it's very small! So, a very small number must disprove evolution, right? Even if it's computed with astronomically large errors built in.

Very bad math doesn't matter when it gives big numbers!
ReplyDelete
Replies
Mikkel Rumraket RasmussenSaturday, August 04, 2012 3:24:00 AM
Hence, I am justified in using the example of conserved protein domains.
No you aren't because your conserved domain constitutes a "hilltop" selection originally had to either climb to from some nearby valley(meaning that the reason it's conserved is that it's best at the job, not that it's the ONLY possible choice), or there's an alternative, possibly entirely different and unrelated catalytic activity "nearby" from which the extant function could be reached by drift.

It is true that within protein motifs, there are sub-motifs. But they all have to synergistically function together. You can reduce the function of a protein only so much.
Yes and then some of them will become non-function in their extant environments, or as they lose their original activity they gain a new one.

The recurrent theme in ALL your posts is that you keep assuming an absolutist view of protein sequence space where if the current function is broken, then the protein is completely dead and useless. This is wrong and everyone here has been telling you how and why now several times. I've been telling you this for ages, they've been telling you this for even longer on talkrational and you've been ignoring it on pandasthump for at least as long. Stop being so fucking dishonest.
ReplyDelete
Replies
Allan MillerSaturday, August 04, 2012 6:12:00 AM
The recurrent theme in ALL your posts is that you keep assuming an absolutist view of protein sequence space where if the current function is broken, then the protein is completely dead and useless.

A common feature of such arguments, trotted out and refuted ad nauseam from Hoyle onwards, is that people seem to mislead themselves by their mental model of 'protein space'. Protein and nucleic acid monomers are given letters, we know how unlikely it would be to get long meaningful English sentences to mutate into others, or arise from scratch, ergo evolution impossible. Someone has to write the sentences, and they have to be spelt rite.

But there is no requirement that the notional spaces containing all possibilities of n-letters-from-v-variants have any relationship in terms of their density of 'well-formed strings', nor the clustering of such strings. All such spaces have v^n positions, and the probability of randomly hitting a particular one is 1 in v^n. Simply by increasing v or n, one can increase the post-hoc wonderment massively. The chance of hitting that sequence is so tiny that if you wrote all the zeroes out ... yadda yadda yadda. The real statistic of interest is the proportion of 'well-formed strings' in the whole space. And of relevance to evolution is the local clustering of other functional strings. Who gives a shit if most of long-string protein space gives an amorphous blob?

If there were one amino acid in the world, and you picked n 'randomly', the chance of getting a specific string n bits long would be 1. But what use is such a peptide? Well, as Atheistoclast inadvertently illustrated, polymeric acids have properties. They can perform structural roles. Add an amino acid to the library, and you have 2^n possibilities. Another makes 3^n, and so on. These spaces rapidly get bigger, but they are never explored by lottery. So how small was the first peptide with catalytic functional value to the organism that made it? How much catalytic function (of some kind) was there in its neighbourhood?

That is, of course, the $64,000 question(s). But determination of phase space based upon a 20-acid library - because modern proteins incorporate 20 acids - is misleading. How does adding acids to the library - making v^n bigger - make evolution less likely?
ReplyDelete
Replies
Allan MillerMonday, August 06, 2012 1:58:00 PM
On functional specificity and sequence space, this paper is very interesting:

http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0015364

The authors constructed a library of 1.5 million peptides, designed for structure but not for function. The peptides were 102-acids in length. 4 'natural' genes of considerably greater length and structural complexity in E. Coli were deleted, and the library assayed on the basis of ability to get these mutants growing again.

And of their library of 1.5 million, they found 18 artifical sequences, with no designed homology to the deleted proteins, that were nonetheless able to rescue function. Bearing in mind that the search was restricted to functional analogues of just 4 proteins, this is quite an impressive hit rate. It would be interesting to know how many of E.Coli's genes could be individually replaced from this tiny segment of overall protein space.

I think this demonstrates that 'random' protein space is much more function-rich than many believe. The proteins were restricted in their sequential composition only in terms of residue polarity, which is important for folding functional protein catalysts. This is weighting the game slightly - but only in the same way evolution does. It takes working structures and rejigs them; it does not shake random amino acids up in a bag.
ReplyDelete
Replies
The whole truthTuesday, August 07, 2012 4:30:00 AM
atheistoclast said:

"I don't want to play games. What I personally believe, whether I am an atheist or a fundamentalist, doesn't matter."

Wow.

When it comes to your 'side' (IDiot creationism), ALL that matters is your beliefs and agenda. You IDiots don't really care about doing science, you only want to destroy it and replace it with your creationist/dominionist agenda.

EVERYTHING you IDiots do is "play[ing] games".

Like I said, you IDiots (and yes, this goes for you) are fundamentally dishonest, and your statements above prove it.

I used to wonder if you just might have something worthwhile to offer in your arguments regarding biology, but your recent comments on this site have thoroughly convinced me that you are just like all the other IDiots. And no, that is not a compliment.
ReplyDelete
Replies

Add comment