Saturday, June 13, 2020

What's in Your Genome? Chapter 3: Repetitive DNA and Mobile Genetic Elements

By the end of chapter 3, readers will be familiar with two main lines of evidence for junk DNA: the C-Value Paradox, and the fact that most of our genome is full of bits and pieces of dead transposons and viruses. They will also understand that this is perfectly consistent with modern evolutionary theory.

Chapter 3: Repetitive DNA and Mobile Genetic Elements
  • Centromeres
  • Telomeres
  • Mobile genetic elements
  • Hidden viruses in your genome
  • What the heck is a transposon?
  • LINES and SINES
  • How much of our genome is composed of transposon-related sequences?
  • BOX 3-1: What does the humped bladderwort tell us about junk DNA?
  • Selfish genes and selfish DNA
  • Mitochondria are invading your genome!
  • Selection hypotheses
  • Exaptation and the post hoc fallacy
  • Box 3-2: Natural genetic engineering?
  • If it walks like a duck ...


What's in Your Genome? Chapter 2: The Evolution of Sloppy Genomes

I had to completely reorganize chapter 2 in order to move population genetics closer to the beginning of the book and reduce the number of words.

Chapter 2: The Evolution of Sloppy Genomes
  • Fugu sashimi
  • Variation in genome size
  • The Onion Test
  • Instantaneous genome doubling
  • Modern evolutionary theory
  • Random genetic drift
  • Neutral Theory
  • Nearly-Neutral Theory
  • Box 2-1: Are humans are still evolving?
  • Population size and the Drift-Barrier Hypothesis
  • Bacteria have small genomes
  • On the evolution of sloppy genomes



What's in Your Genome? Chapter 1: Introducing Genomes

My book is progressing slowly. The main task is to reduce it to about 120,000 words and that's proving to be a lot more difficult that I imagined.

Here's what's now in Chapter 1: Introducing Genomes
  • The genome war
  • Finishing the human genome sequence
  • What is DNA?
  • The double helix
  • The sequence of all the base pairs was the goal of the human genome project
  • How big is your genome?
  • Packaging DNA: chromatin
  • Transcription
  • Translation
  • The genetic code
  • Introns and exons
  • The history of junk DNA



Thursday, June 11, 2020

Dan Graur proposes a new definition of "gene"

I've thought a lot about how to define the word "gene." It's clear that no definition will capture all the possibilities but that doesn't mean we should abandon the term. Traditionally, the biochemical definition attempts to describe the part of the genome that produces a functional product. Most scientists seem to think that the only possible product is a protein so it's common to see the word "gene" defined as a DNA sequence that produces a protein.

But from the very beginning of molecular biology the textbooks also talked about genes for ribosomal RNAs and tRNAs so there was never a time when knowledgeable scientists restricted their definition of a gene to protein-coding regions. My best molecular definition is described in What Is a Gene?.

A gene is a DNA sequence that is transcribed to produce a functional product.

Dan Graur has also thought about the issue and he comes up with a different definition in a recent blog post: What Is a Gene? A Very Short Answer with a Very Long Footnote

A gene is a sequence of genomic material (DNA or RNA) that has a selected effect function.

This is obviously an attempt to equate "function" with "gene" so that all functional parts of the genome are genes, by definition. You might think this is rather silly because it excludes some obvious functional regions but Dan really does want to count them as genes.
Performance of the function may or may not require the gene to be translated or even transcribed.

Genes can, therefore, be classified into three categories:

(1) protein-coding genes, which are transcribed into RNA and subsequently translated into proteins.

(2) RNA-specifying genes, which are transcribed but not translated

(3) nontranscribed genes.
Really? Is it useful to think of centromeres and telomeres as genes? Is it useful to define an origin of replication as a gene? And what about regulatory sequences? Should each functional binding site for a transcription factor be called a gene?

The definition also leads to some other problems. Genes (my definition) occupy about 30% of the human genome but most of this is introns, which are mostly junk (i.e. no selected effect function). How does that make sense using Dan's definition?