More Recent Comments

Showing posts with label Genes. Show all posts
Showing posts with label Genes. Show all posts

Sunday, January 27, 2019

Yeast loses its introns

Baker’s yeast (Saccharomyces cerevisiae) is one of the best studied eukaryotes. Its genome is just slightly larger than the largest bacterial genome and it was the first eukaryotic genome to be sequenced (Mewes at al., 1997). It has about 7000 genes in total and 6,604 of these genes are protein-coding genes but only 280 of these genes contain introns.1 The rest have lost their introns over the course of several hundred million years of evolution (Hooks et al., 2014).

We know that introns have been lost in yeast because the genes of related species have lots of introns. The common ancestor of all fungi undoubtedly had genes with multiple introns because the available evidence indicates that introns invaded eukarotic genes very early in the evolution of eukaryotes. The fact that most introns have been purged from the yeast genome suggests that introns are not essential for gene function. In other words, introns are mostly junk.2

Wednesday, January 23, 2019

What happens when twins get their DNA tested?

The Canadian Broadcastng Company (CBC) has a TV show called Marketplace that promotes itself as an advocate of consumers' rights. It has a history of testing the claims of advertisers and usually shows that these claims are misleading or false. Here's what they say on their website.
On air since 1972, Marketplace is Canada’s consumer watchdog. We get the goods to help you shop smarter and protect yourself from slick scams and misleading marketing claims. We investigate the products and services we all use every day and push companies and government for answers. And we expose the truth on stories that matter to you and your family.

Saturday, December 08, 2018

The persistent myth of alternative splicing

I'm convinced that widespread alternative splicing does not occur in humans or in any other species. It's true that the phenomenon exists but it's restricted to a small number of genes—probably fewer than 1000 genes in humans. Most of the unusual transcripts detected by modern technology are rare and unstable, which is consistent with the idea that they are due to splicing errors. Genome annotators have rejected almost all of those transcripts.

You can see links to my numerous posts on this topic at: Alternative splicing and the gene concept and Are splice variants functional or noise?.

Wednesday, December 05, 2018

The textbook view of alternative splicing

As most of you know, I'm interested in the problem of alternative splicing. I believe that the number of splice variants that have been detected is perfectly consistent with the known rate of splicing errors and that there's no significant evidence to support the claim that alternative splicing leading to the production of biologically relevant protein variants is widespread. In fact, there's plenty of evidence for the opposite view; namely, splicing errors (lack of conservation, low abundance, improbable protein predictions, inability to detect the predicted proteins).

My preferred explanation is definitely the minority view. What puzzles me is not the fact that the majority is wrong () but the fact that they completely ignore any other explanation of the data and consider the case for abundant alternative splicing to be settled.

Monday, November 19, 2018

Latest Tango in Halifax

I've known Yana Eglit for many years. She frequently posts comments on this blog but you won't recognize her name because she uses a pseudonym.1 Yana is a graduate student in the lab of Alastair Simpson at Dalhousie University in Halifax, Nova Scotia, Canada. A few years ago she saw some strange organisms dancing in a Petri dish.2

The microorganisms belong to the group Hemimastigophora. Yana found them in a clump of dirt she picked up while hiking near Halifax. They named the species Hemimastix kukwesjijk. The group only contains a few other species.

Hemimastigophora is one of those protist groups that have been difficult to classify and difficult to place relative to other protists. It's traditionally been given the status of a phylum but its position in the eukaryotic tree was ambiguous.

The Simpson lab, in a collaboration with Andrew Roger's group, sequenced a number of genes (transcripts) from H. kukwesjijk and another species that they recently identified (Spirenema). The datasets contained samples of about 300 genes of each species. Trees constructed with this dataset place the Hemimastigophora near the base of the eukrayotic tree as a sister group to Diaphoretices. The work was published in a recent issue of Nature (Lax, Egrit, et al., 2018).

The details of eukaryotic taxonomy and the various subdivisions needn't concern us but the important take-home lesson is that there are a huge number of protists forming diverse groups that separated more than a billion years ago. The authors claim that Hemimastigophora deserves supra-kingdom status equivalent to the other supra-kingdoms shown in the figure (modified from Figure 4 of the paper).

The root of the eukaryotic tree is controversial. It could be at positions a, b, or c, shown in the figure. According to the authors, position a is the most favored these days. Regardless of where the root is actually placed, the new positioning of Hemimastigophora adds a lot of information to the deepest parts of the eukaryotic tree and brings us closer to identifying the most primitive features of the eukaryotic cell.

I wonder how many other strange species can be found in Canadian dirt?


Photo Credit: The photo of Yana Eglit at her microscope is from the Dalhousie University press office [Hidden in plain sight: Dal evolutionary biologists uncover a new branch on the Tree of Life]

1. Which she might accidentally reveal if she responds to this post!

2. The fact they were "dancing" gave me an excuse to use a corny title that refers to one of our favorite TV shows, "Last Tango in Halifax."

Lax, G., Eglit, Y., Eme, L., Bertrand, E. M., Roger, A. J., and Simpson, A. G. B. (2018) Hemimastigophora is a novel supra-kingdom-level lineage of eukaryotes. Nature. (in press) [doi: 10.1038/s41586-018-0708-8]

Sunday, November 18, 2018

Revisiting the deflated ego problem

Humans are just another animal. All animals share a core set of several thousand genes and all mammals have about the same number of homologous genes (~25,000). The differences between species such as gorillas, bats and whales are due almost exclusively to differences in the timing of expression of these common genes.

This concept is not new. It was the major theme of Stephen Jay Gould's book, Ontogeny and Phylogeny, back in 1977 [Learning About EVO-Devo]. Over the next twenty years or so, the concept was confirmed repeatedly by the work of hundreds of developmental biology labs working mostly with model organisms such as Drosophila (fruit flies). The field is evolutionary developmental biology or "evo-devo" and that work has been nicely summarized in several popular books appearing in the 21st century.

Thursday, November 08, 2018

DNA Is Not Destiny by Steven J. Heine

DNA Is Not Destiny: The Remarkable Completely Misunderstood Relationship between You and Your Genes
by Steven J. Heine
W.W. Norton & Company, New York/London (2017)
ISBN: 978-0-393-24408-3

Steven Heine is a Professor in the Department of Psychology at the University of British Columbia (Vancouver, B.C., Canada). He has written a book about the perils of DNA testing and his main thesis is that the results of such tests are bound to make you fell unhappy because you will learn that you have a higher risk of several nasty diseases. He warns us that the science behind these predictions is not nearly as solid as the testing companies would have you believe but his main point is the psychological impact of the test results. He claims that we are not conditioned to put the results into the proper perspective because we are pre-conditioned to adopt a very fatalist view of our genetic makeup.

He had his DNA analyzed by a number of companies. Here's some of what he learned from 23 and Me.
23andMe provided me with a gripping set of predictions about my health with real concrete numbers—I learned that I have a 2.1 percent chance of developing Parkinson's disease, and this is 32 percent higher than the average person. The 23andMe experience "felt" satisfying because it provided a wealth of highly specific and personal information about my health. But then, so would the fortune-teller down the street, and at least she isn't claiming any scientific foundation to her predictions.

Tuesday, October 09, 2018

Alternative splicing and the gene concept

I just learned about a workshop scheduled for the end of this month. The topic is: Evolutionary Roles of Transposable Elements and Non-coding DNA: The Science and the Philosophy.

I'd love to attend but it's a just small workshop designed to encourage dialogue between scientists and philosophers who are interested in the topic. Here's a list of the speakers ...
  • Ryan Gregory: Junk DNA, genome size, and the onion test.
  • Stefan Linquist: Four decades debating junk DNA and the Phenotype Paradigm is (somehow) alive and well.
  • Chris Ponting: 92.9% of the human genome evolved neutrally.
  • Paul Griffiths: Both adaptation and adaptivity are relevant to diagnosing function.
  • Ford Doolittle: Selfish genes and selfish DNA: is there a difference?
  • Justin Garson: Biological functions, the liberality problem, and transposable elements.
  • Joyce Havstad: Evolutionary Thinking about Critique of Function Talk.
  • Guillame Bourque: Impact of transposable elements on human gene regulatory networks.
  • Ulrich Stegman: On parity, genetic causation and coding.
  • Steven Downes: Understanding non-coding variants as disease risk alleles.
  • Alexander Palazzo: How nuclear retention and cytoplasmic export of RNAs reduces the deleteriousness of junk DNA.
  • David Haig: Pax somatica
  • Cedric Feschotte: Transposable elements as catalysts of genome evolution.

Friday, July 13, 2018

How many protein-coding genes in the human genome?

The three main human databases (GENCODE/Ensembl, RefSeq, UniProtKB) contain a total of 22,210 protein-coding genes but only 19,446 of these genes are found in all three databases. That leaves 2764 potential genes that may or may not be real. A recent publication suggests that most of them are not real genes (Abascal et al., 2018). The issue is the same problem that I discussed in recent posts [Disappearing genes: a paper is refuted before it is even published] [Nature falls (again) for gene hype].

Sunday, July 08, 2018

Nature falls (again) for gene hype

Nature is arguably the most prestigious science journal. Articles published in Nature are widely perceived to be correct, unbiased, and factual. This perception is certainly true of articles that appear in the News section of the journal since these article are presumably written by expert science writers who have evaluated the new study and decided that it's worth reporting.

Sandwalk readers know that this perception is false (fake news). It turns out that science writers who publish in Nature are not very much better than science writers in general and that's not good.

I recently published a post about an extraordinary claim concerning the number of human genes [Disappearing genes: a paper is refuted before it is even published ]. It concerns a paper posted on an archive site claiming to have found 4,998 new genes of which 1,178 are new protein-coding genes (Pertea et. al., 2018). About five weeks later another paper was posted that effectively refuted the claim of new protein-coding genes (Jungreis et al., 2018). In between publication of those two papers, a freelance science writer, Cassandra Willyard, wrote an article for Nature News that covered the original claim of 4,998 new genes [New human gene tally reignites debate].

Let's see how she handled the controversy.

Disappearing genes: a paper is refuted before it is even published

Several readers alerted me to a paper that was posted on bioRxiv a few weeks ago (May 28, 2018). The paper claimed that the human genome contains 43,162 genes consisting of 21,306 protein-coding genes and 21,856 noncoding genes. The authors reported that they had discovered 3,819 new noncoding genes and 1,178 new protein-coding genes. In addition, they claim to have discovered 97,511 new splice variants raising the total number of splice variants to 12.5 per protein-coding gene although they seem to suggest that almost one-third of these splice variants are non-functional splicing errors. The most striking result, according to the authors, is that 95% of all transcripts are just transcriptional noise.

Here's the paper ...

Thursday, May 10, 2018

Philosophers talking about genes

It's important to define what you mean when you use the word "gene." I use the molecular definition since most of what I write refers to DNA sequences. There's no perfect definition but, for most purposes, a good working definition is: A gene is a DNA sequence that is transcribed to produce a functional product. [What Is a Gene?].

There are two types of genes: protein-coding genes and those that specify a functional noncoding RNA (i.e ribosomal RNA, lincRNA). The gene is the part of the DNA that's transcribed so it includes introns. Transcription is controlled by regulatory sequences such as promoters, operators, and enhancers but these are not part of the gene.

In addition to genes, there are many other functional parts of the genome. In the case of eukaryotic genomes, these include centromeres, telomeres, origins of replication, SARs, and some other bits. None of this is new ... these functions have been known for decades and the working definition I use has been common among knowledgeable experts for half-a-century. Scientists know what they are talking about when they say that the human genome contains about 20,000 protein-coding genes and at least 5,000 genes for non-coding RNAs. They are comfortable with the idea that our genome has lots of other functional regions that lie outside of the genes.

Non-experts may not be familiar with the topic and they may have many misconceptions about genes and DNA sequences but we don't base our science on the views of non-experts.

Because of my interest in this topic, I was intrigued by the title of a new book, The Gene: from Genetics to Postgenomics. I ordered it a soon as I heard about it and I've just finished reading it. The version I read has been translated from German by Adam Bostanci.

Tuesday, March 13, 2018

Making Sense of Genes by Kostas Kampourakis

Kostas Kampourakis is a specialist in science education at the University of Geneva, Geneva (Switzerland). Most of his book is an argument against genetic determinism in the style of Richard Lewontin. You should read this book if you are interested in that argument. The best way to describe the main thesis is to quote from the last chapter.

Here is the take-home message of this book: Genes were initially conceived as immaterial factors with heuristic values for research, but along the way they acquired a parallel identity as DNA segments. The two identities never converged completely, and therefore the best we can do so far is to think of genes as DNA segments that encode functional products. There are neither 'genes for' characters nor 'genes for' diseases. Genes do nothing on their own, but are important resources for our self-regulated organism. If we insist in asking what genes do, we can accept that they are implicated in the development of characters and disease, and that they account for variation in characters in particular populations. Beyond that, we should remember that genes are part of an interactive genome that we have just begun to understand, the study of which has various limitations. Genes are not our essences, they do not determine who we are, and they are not the explanation of who we are and what we do. Therefore we are not the prisoners of any genetic fate. This is what the present book has aimed to explain.

Sunday, February 18, 2018

Human genome books

Theme
Genomes
& Junk DNA

I'm trying to read all the recent books on the human genome and anything related. There are a lot of them. Here's a list with some brief comments. You should buy some of these books. There are others you should not buy under any circumstances.

Friday, February 09, 2018

Are splice variants functional or noise?

This is a post about alternative splicing. I've avoided using that term in the title because it's very misleading. Alternative splicing produces a number of different products (RNA or protein) from a single intron-containing gene. The phenomenon has been known for 35 years and there are quite a few very well-studied examples, including several where all of the splice regulatory factors have been characterized.

Monday, February 05, 2018

ENCODE's false claims about the number of regulatory sites per gene

Some beating of dead horses may be ethical, where here and there they display unexpected twitches that look like life.

Zuckerkandl and Pauling (1965)

I realize that most of you are tired of seeing criticisms of ENCODE but it's important to realize that most scientists fell hook-line-and-sinker for the ENCODE publicity campaign and they still don't know that most of the claims were ridiculous.

I was reminded of this when I re-read Brendan Maher's summary of the ENCODE results that were published in Nature on Sept. 6, 2012 (Maher, 2012). Maher's article appeared in the front section of the ENCODE issue.1 With respect to regulatory sequences he said ...
The consortium has assigned some sort of function to roughly 80% of the genome, including more than 70,000 ‘promoter’ regions — the sites, just upstream of genes, where proteins bind to control gene expression — and nearly 400,000 ‘enhancer’ regions that regulate expression of distant genes ... But the job is far from done, says [Ewan] Birney, a computational biologist at the European Molecular Biology Laboratory’s European Bioinformatics Institute in Hinxton, UK, who coordinated the data analysis for ENCODE. He says that some of the mapping efforts are about halfway to completion, and that deeper characterization of everything the genome is doing is probably only 10% finished.

Wednesday, January 31, 2018

Herding Hemingway's Cats by Kat Arney

Kat Arney has written a very good book on genes and gene expression. She covers all the important controversies in a thorough and thoughtful manner.

Kat Arney is a science writer based in the UK. She has a Ph.D. from the University of Cambridge where she worked on epigenetics and regulation in mice. She also did postdoc work at Imperial College in London. Her experience in the field of molecular biology and gene expression shows up clearly in her book where she demonstrates the appropriate skepticism and critical thinking in her coverage of the major advances in the field.

Tuesday, November 07, 2017

Lateral gene transfer in eukaryotes - where's the evidence?

Lateral gene transfer (LGT), or horizontal gene transfer (HGT), is widespread in bacteria. It leads to the creation of pangenomes for many bacterial species where different subpopulations contain different subsets of genes that have been incorporated from other species. It also leads to confusing phylogenetic trees such that the history of bacterial evolution looks more like a web of life than a tree [The Web of Life].

Bacterial-like genes are also found in eukaryotes. Many of them are related to genes found in the ancestors of modern mitochondria and chloroplasts and their presence is easily explained by transfer from the organelle to the nucleus. Eukaryotic genomes also contain examples of transposons that have been acquired from bacteria. That's also easy to understand because we know how transposons jump between species.

Saturday, October 28, 2017

Creationists questioning pseudogenes: the GULO pseudogene

This is the second post discussing creationist1 papers on pseudogenes. The first post addressed a paper by Jeffrey Tomkins on the β-globin pseudogene [Creationists questioning pseudogenes: the beta-globin pseudogene]. This post covers another paper by Tomkins claiming that the GULO pseudogenes in various primate species are not derived from a common ancestor but instead have been deactivated independently in each lineage.

The Tomkins' article was published in 2014 in Answers Research Journal, a publication that describes itself like this:
ARJ is a professional, peer-reviewed technical journal for the publication of interdisciplinary scientific and other relevant research from the perspective of the recent Creation and the global Flood within a biblical framework.

Saturday, October 14, 2017

Creationists questioning pseudogenes: the beta-globin pseudogene

Jonathan Kane recently (Oct. 6, 2017) posted an article on The Panda's Thumb where he claimed that Young Earth Creationists often don't get enough credit for raising serious issues about evolution [Five principles for arguing against creationism].

He mentioned some articles about pseudogenes as prime examples. I asked him for references and he responded with two articles by Jeffrey Tomkins that were published on the Answers in Genesis website. The first was on the β-globin pseudogene and the second was on the GULO pseudogene. Both articles claim that these DNA sequences aren't really pseudogenes because they have functions.

I'll deal with the β-globin pseudogene in this post and the GULO pseudogene in a subsequent post.