Sandwalk: Are lncRNAs really mRNAs in waiting?

Monday, September 22, 2014

Are lncRNAs really mRNAs in waiting?

Biology News Net has become a joke. It's rare to see a paper that it hasn't mangled or a press release that it hasn't fallen for, hook line and sinker. I read it for amusement.

A recent report began with ... [Parts of genome without a known function may play a key role in the birth of new proteins]

Researchers in Biomedical Informatics at IMIM (Hospital del Mar Medical Research Institute) and at the Universitat Politècnica de Catalunya (UPC) have recently published a study in eLife showing that RNA called non-coding (lncRNA) plays an important role in the evolution of new proteins, some of which could have important cell functions yet to be discovered.

That sounds intriguing. Maybe I should read the paper even though it's in eLife.

It took a little more work than I expected, but eventually I found the paper (Ruiz-Orera et al., 2014). Here's the abstract.

Deep transcriptome sequencing has revealed the existence of many transcripts that lack long or conserved open reading frames (ORFs) and which have been termed long non-coding RNAs (lncRNAs). The vast majority of lncRNAs are lineage-specific and do not yet have a known function. In this study, we test the hypothesis that they may act as a repository for the synthesis of new peptides. We find that a large fraction of the lncRNAs expressed in cells from six different species is associated with ribosomes. The patterns of ribosome protection are consistent with the translation of short peptides. lncRNAs show similar coding potential and sequence constraints than evolutionary young protein coding sequences, indicating that they play an important role in de novo protein evolution.

The study suggests that a lot of "noncoding" RNAs are being translated. The products appear to be short polypeptides of less than 100 residues.

New protein encoding genes do arise from time to time although the number of proven examples is very small. Let's assume, for the sake of argument, that a new gene arises about once every million years in a given lineage. That would mean about five new genes in humans since they split from chimpanzees and that seems about right for an upper limit.

Now, if you make a lot of junk RNAs by randomly transcribing junk DNA, then some of them will undoubtedly make short polypeptides. There's a chance that random mutations will create a peptide that takes on a functional role of some kind. There's an even smaller chance that this function will confer a selective advantage on the individual carrying the mutation. That's one way new genes are born.

Is this a reason for carrying a huge amount of junk DNA in your genome and making thousands of lncRNAs? Is the potential to make a new gene one million years in the future sufficient explanation for the preservation of junk DNA? The answer is "no."

You don't have junk DNA because it might proven useful in the future. You have it because you can't get rid of it. You don't transcribe your junk DNA because it might be useful, you transcribe it because the general properties of RNA polymerase and transcription factors don't allow for perfect discrimination between real genes and junk DNA. Junk transcripts aren't translated because they contain potential coding regions, they are sometimes translated because they must, by chance, contain some open reading frames.

Sloppiness might, by accident, lead to new genes but that's not why things are sloppy. If having junk DNA were a clear advantage for future evolution then the genomes of all extant lineages should have lots of junk DNA and should make lots of lncRNAs.

Ruiz-Orera, J., Messeguer, X., Subirana, J.A., and Alba, M.M. (2014) Long non-coding RNAs as a source of new peptides. eLife 2014;3:e03523 [doi: 10.7554/eLife.03523]

20 comments :

Jonathan Badger said...: If sloppiness is never useful, then how did bacterial mutator alleles evolve? If accuracy is paramount, then these alleles would have been eliminated. Yes, evolution can't "think ahead", but it doesn't have to if during crises, the "sloppier" organisms simply preferentially survive. I'm as tired as you with ENCODE-style defenses of the utility of junk DNA, but arguing that sloppiness is always a mistake doesn't agree with the data.; Monday, September 22, 2014 3:10:00 PM
Joe Felsenstein said...: Note that if you translate random DNA, you get a stop codon with probability 3/64 each time, so the resulting proteins should have a mean length of (64/3)-1 = 20 amino acids. It changes a bit if the bases do not have equal frequency, but not by much. So any such proteins should be short.

Or do I misunderstand?; Monday, September 22, 2014 3:25:00 PM
Larry Moran said...: No, I think that's about right. Of course they are selecting for those lncRNAs that have "long" open reading frames because they are looking at ribosome protection. If you have 10,000 lncRNAs and each one is 1000 bp in length, then how many are going to encode a 100 residue protein?

That sounds like something you (but not me) could calculate.; Monday, September 22, 2014 5:14:00 PM
Larry Moran said...: Are their any species or wild-type strains of bacteria that have fixed mutator alleles?; Monday, September 22, 2014 5:17:00 PM
Jonathan Badger said...: They certainly exist in nature and aren't just a laboratory curiosity. A study back in the 1990s by a group at the FDA found high rates of mutation and the mutator form of mutS in strains related to food illness (LeClerc, et al, Science 274:1208-1211, 1996), and more recently I've been to a couple of microbiome talks that have shown that cystic fibrosis patients have a higher rate of mutator strains in their lungs as well.; Monday, September 22, 2014 7:19:00 PM
Joe Felsenstein said...: Let me try. If the start is at the beginning of the lncRNA, then we just have to calculate what the chance is that none of the first 100 codons is a start codon. If the bases are assumed equal in frequency (OK, an oversimplification) and we have 3 possible stop codons we just need to compute (61/64) raised to the 100th power. That is 0.008222163 so on average 8 of the 1000 lncRNAs would code for a protein of length 100 or more.; Monday, September 22, 2014 7:41:00 PM
Joe Felsenstein said...: Typo: " ... that none of the first 100 codons is a stop codon."; Monday, September 22, 2014 7:42:00 PM
Robert Byers said...: Off thread but a quickie.
If this bio mag is a joke, probably, then why is it nor a accurate sample for creationists to say there is a general problem in understanding and investigating science matters especially in origin matters?
Why just this mag? A lot of them miss creationists excellent points?; Monday, September 22, 2014 8:14:00 PM
SRM said...: I imagine it would probably come down to what is meant by "creationist's excellent points".; Monday, September 22, 2014 8:20:00 PM
Cale B.T. said...: Prof. Moran, in your previous post, you wrote "what if 90% of all 10,000 lncRNAs have no function" . Just to clarify, was that just a hypothetical figure, or do you think that something like Struhl's 2007 estimate of 90% of lncRNAs in Saccharomyces cerevisiae resulting from the inefficiency of RNA polymerase II is also going to be a close estimate for human lncRNAs?; Monday, September 22, 2014 9:50:00 PM
Joe Felsenstein said...: The problem with all these " ... in-waiting" arguments is, what protects the sequence from being deleted in the meantime? I suppose that if they are just random sequences waiting to be expressed as random polypeptides, point mutation wouldn't hurt them, but random deletion would not be opposed. Saying that the species would not survive unless sequences like that were around is invoking a group selection (or even species-selection) mechanism. Which doesn't mean that it is wrong, but means that we have to think about the strength of that selection.

There is the same issue with "front-loading" arguments, with the additional weakness that the genes that are set up in order to be expressed billions of years later would be eroded continually by point mutation as well as eliminated by deletion, and nothing would oppose that.; Tuesday, September 23, 2014 12:52:00 AM
Larry Moran said...: It's just my best guess. The important point is that too many scientists are assuming that the value is much closer to 0%. It's bad enough to make unjustified assumptions but it's even worse when you don't realize that you are making an assumption.; Tuesday, September 23, 2014 11:00:00 AM
Barbara said...: Ah, yes. There was a reason I eventually started most biology classes with a list of assumptions we would make. Most assumptions were from physics. One was "In a time series, causes happen before the events they cause." Students would look at me like I was crazy; of course causes come before effects!

And here we have what skates very close to "Future useful genes cause lncRNA's." Random, unavoidable variation and erroneous transcription occasionally throwing up useful products seems more probable to me. lncRNA's aren't useful because some small percent of them may in the future become useful; that's just a nice side effect of their existence.; Tuesday, September 23, 2014 12:48:00 PM
Tom Mueller said...: Hi Larry – forgive me for resorting to an argumentum ad absurdum…

If having junk DNA were a clear advantage for future evolution then the genomes of all extant lineages should have lots of junk DNA and should make lots of lncRNAs.

Hmmm… Does it then follow that; If having nerve cells underlie the retina resulting in sharper vision with no blind spot were a clear advantage, then the eyes of all metazoans should have eyes just like squids.

I thought we were calling Creationists "IDiots" because there in fact is no evident teleology and evolution is often jerry-rigged.

I must be missing something here. What am I not following?; Tuesday, September 23, 2014 4:14:00 PM
Larry Moran said...: Hmmm… Does it then follow that; ...

No; Tuesday, September 23, 2014 5:10:00 PM
AllanMiller said...: It seems substantially less likely that random intergenic sequence can provide novel functional peptides than existing translated genes, introns or untranslated pseudogenes. These already possess the upstream and downstream sites necessary for transcription, cleavage, capping, export to the ribosome, and translation initiation - plus the kinds of motif that appear to make for successful enzymes. All of this would somehow have to come together in an 'intergene' before it can even get out the starting gate.

Only .02% of a random genome would be the sextuplet (?) AAUAAA necessary for binding CPSF, for example, which would also need a randomly GU-rich region, and an initiator methionine triplet with a random STOP sufficiently far from it to actually make something, with a consistently foldable product. Beyond these passive barriers, one would expect that the production of random catalysts was suppressed if anything, rather than being an evolutionary force selecting for retention of the potential.; Wednesday, September 24, 2014 4:40:00 AM
Larry Moran said...: I agree with you that creation of a new gene by this mechanism is extremely unlikely. Furthermore, there are very few examples (none?) so this is mostly speculation. Some scientists, who should know better, are being influenced by reports of "orphan" genes. These are actually putative genes that almost always turn out to be false alarms. If you were to believe all those false positive claims then new genes are popping up from junk DNA all over the place.; Wednesday, September 24, 2014 7:15:00 AM
Tom Mueller said...: Hi Larry,

I agree with you that the squid eye statement is a non sequitor.

But

To my understanding, your statement:

If having junk DNA were a clear advantage for future evolution then the genomes of all extant lineages should have lots of junk DNA and should make lots of lncRNAs.

... represents exactly the identical category of non sequitor. That's the part I don't get and clearly I must be missing something.; Wednesday, September 24, 2014 7:21:00 AM
Tom Mueller said...: OK Reality check

http://users.rcn.com/jkimball.ma.ultranet/BiologyPages/T/Transcription.html#lncRNA

some lncRNAs have been found to participate in the regulation of such diverse activities as
• splicing,
• translation,
• imprinting, and
• transcription. Two examples:
o XIST. XIST RNA, which contains thousands of nucleotides, inactivates one of the two X chromosomes in female vertebrates. [Discussion]
o Some lncRNAs participate in bringing the enhancer and promoter regions of genes close together ("looping" — View) to regulate gene transcription. (More)

I am fascinated by XIST lncRNA-mediated Barr body formation and wonder out loud whether lncRNA is in general crucial for another level of gene control often not considered in introductory textbooks… namely chromatin architecture in the nucleus.

I realize I am rehashing – but I am going to float this balloon again with premeditation aforethought; in order to have my exuberant naiveté reined in.

I thank any and all in advance for their patience and indulgence.

Perhaps chromosomes have their equivalent to tertiary and quaternary structure. Otherwise how does one explain constancy of karyotypes across primate lineages unless invoking positive selection?

This makes intuitive sense to me - Check out this link: http://phys.org/news/2013-09-x-shape-true-picture-chromosome-imaging.html; Wednesday, September 24, 2014 7:31:00 AM
AllanMiller said...: Otherwise how does one explain constancy of karyotypes across primate lineages unless invoking positive selection?

Is it that constant (compared to other equivalent groups)? Genuine question; I don't know either way and would be interested in your data.

One of the primary drivers of gross karyotype rearrangement in mammals appears to be the polarity of female meiosis, which appears to be stronger than conservative selection in those groups subject to frequent reversal.; Wednesday, September 24, 2014 8:37:00 AM

Quotations

The old argument of design in nature, as given by Paley, which formerly seemed to me to be so conclusive, fails, now that the law of natural selection has been discovered. We can no longer argue that, for instance, the beautiful hinge of a bivalve shell must have been made by an intelligent being, like the hinge of a door by man. There seems to be no more design in the variability of organic beings and in the action of natural selection, than in the course which the wind blows.Charles Darwin (c1880)

Although I am fully convinced of the truth of the views given in this volume, I by no means expect to convince experienced naturalists whose minds are stocked with a multitude of facts all viewed, during a long course of years, from a point of view directly opposite to mine. It is so easy to hide our ignorance under such expressions as "plan of creation," "unity of design," etc., and to think that we give an explanation when we only restate a fact. Any one whose disposition leads him to attach more weight to unexplained difficulties than to the explanation of a certain number of facts will certainly reject the theory.

Charles Darwin (1859)

Science reveals where religion conceals. Where religion purports to explain, it actually resorts to tautology. To assert that "God did it" is no more than an admission of ignorance dressed deceitfully as an explanation...

Peter Atkins

Quotations

The world is not inhabited exclusively by fools, and when a subject arouses intense interest, as this one has, something other than semantics is usually at stake. Stephen Jay Gould (1982)
I have championed contingency, and will continue to do so, because its large realm and legitimate claims have been so poorly attended by evolutionary scientists who cannot discern the beat of this different drummer while their brains and ears remain tuned to only the sounds of general theory. Stephen Jay Gould (2002) p.1339
The essence of Darwinism lies in its claim that natural selection creates the fit. Variation is ubiquitous and random in direction. It supplies raw material only. Natural selection directs the course of evolutionary change. Stephen Jay Gould (1977)
Rudyard Kipling asked how the leopard got its spots, the rhino its wrinkled skin. He called his answers "just-so stories." When evolutionists try to explain form and behavior, they also tell just-so stories—and the agent is natural selection. Virtuosity in invention replaces testability as the criterion for acceptance. Stephen Jay Gould (1980)
Since 'change of gene frequencies in populations' is the 'official' definition of evolution, randomness has transgressed Darwin's border and asserted itself as an agent of evolutionary change. Stephen Jay Gould (1983) p.335
The first commandment for all versions of NOMA might be summarized by stating: "Thou shalt not mix the magisteria by claiming that God directly ordains important events in the history of nature by special interference knowable only through revelation and not accessible to science." In common parlance, we refer to such special interference as "miracle"—operationally defined as a unique and temporary suspension of natural law to reorder the facts of nature by divine fiat. Stephen Jay Gould (1999) p.84

Quotations

My own view is that conclusions about the evolution of human behavior should be based on research at least as rigorous as that used in studying nonhuman animals. And if you read the animal behavior journals, you'll see that this requirement sets the bar pretty high, so that many assertions about evolutionary psychology sink without a trace.

Jerry Coyne
Why Evolution Is True

I once made the remark that two things disappeared in 1990: one was communism, the other was biochemistry and that only one of them should be allowed to come back.

Sydney Brenner
TIBS Dec. 2000

It is naïve to think that if a species' environment changes the species must adapt or else become extinct.... Just as a changed environment need not set in motion selection for new adaptations, new adaptations may evolve in an unchanging environment if new mutations arise that are superior to any pre-existing variations

Douglas Futuyma

One of the most frightening things in the Western world, and in this country in particular, is the number of people who believe in things that are scientifically false. If someone tells me that the earth is less than 10,000 years old, in my opinion he should see a psychiatrist.

Francis Crick

There will be no difficulty in computers being adapted to biology. There will be luddites. But they will be buried.

Sydney Brenner

An atheist before Darwin could have said, following Hume: 'I have no explanation for complex biological design. All I know is that God isn't a good explanation, so we must wait and hope that somebody comes up with a better one.' I can't help feeling that such a position, though logically sound, would have left one feeling pretty unsatisfied, and that although atheism might have been logically tenable before Darwin, Darwin made it possible to be an intellectually fulfilled atheist

Richard Dawkins

Another curious aspect of the theory of evolution is that everybody thinks he understand it. I mean philosophers, social scientists, and so on. While in fact very few people understand it, actually as it stands, even as it stood when Darwin expressed it, and even less as we now may be able to understand it in biology.

Jacques Monod

The false view of evolution as a process of global optimizing has been applied literally by engineers who, taken in by a mistaken metaphor, have attempted to find globally optimal solutions to design problems by writing programs that model evolution by natural selection.

Richard Lewontin

More Recent Comments

Monday, September 22, 2014

Are lncRNAs really mRNAs in waiting?

20 comments :