Pseudogenes are non-functional DNA sequences that resemble genes. Much of the DNA related to transposable elements falls into this category. There are ribosomal RNA and tRNA pseudogenes but the term usually refers to sequences that resemble protein-encoding genes.
THEME
Genomes & Junk DNA
Total Junk so far
46%
There are two kinds of pseudogenes derived from protein-encoding genes. Those derived from reverse transcription of mRNA and the re-integration of double-stranded DNA into the genome are called "processed" pseudogenes because the mRNA precursor was processed to give mature mRNA before being copied. Consequently, processed pseudogenes do not have introns. They also don't have promoters so they cannot be transcribed.
The other kind of pseudogene arises following a gene duplication event. One of the copies acquires a mutation that inactivates it. This is usually not harmful because the other copy remains intact. It is the fate of most duplicated genes to become a pseudogene by inactivation.
The original meaning of "junk" DNA referred to pseudogenes (reviewed in Gregory 2005) but the term is now used frequently to mean any non-functional DNA. That's the definition I use here.
Ensembl lists 2,081 pseudogenes in the human genome but that's very low compared to other studies [Human Genome]. The number of processed pseudogenes range from several thousand up to 17 thousand (Drouin 2006). The ENCODE project found 118 pseudogenes in their detailed analysis of 1% of the genome (Solovyev et al. 2006). This suggest that there are 11,800 pseudogenes in the entire genome.
A number of studies suggest that the number of processed pseudogenes is approximately the same as the number of inactivated duplicated genes (reviewed in Taylor and Raes 2005). In the case of processed pseudogenes, there are many copies of a relatively small subset of the total number of genes. In other words, lots of genes do not spawn pseudogenes and those that do have many offspring. This is because there is a bias in favor of genes that are highly expressed n the germ line.
The total number of pseudogenes in the genome is likely to be close to the number of genes based on extrapolations from detailed analyses of small segments of the genome or single chromosomes.
If we assume that there are 10,000 processed pseudogenes averaging 2 kb each then this represents 20 Mb or 0.06% of the genome. If there are an equal number of other pseudogenes then this is 10,000 × 60 kb = 600 Mb or 18% of the genome. This is all junk DNA but it overlaps extensively with the junk DNA from transposable elements. It is further evidence that substantial parts of the genome are non- functional but since most of that sequence would be introns in an active gene, it would count as junk DNA even if the gene were active. It's best to just count the inactive exons in order to avoid double counting.
Thus, pseudogenes are about 1.2% of the genome and all of it is junk.1,2
1. A small number of former pseudogenes have been reactivated. They are no longer pseudogenes so they don't count as junk. A small number of pseudogenes have acquired a separate function so they don't count as junk. There do not appear to be very many examples.
2. There are many scientists who have tried to make the case for pseudogenes having some sort of function. The most common speculation is that they serve as an important reservoir of sequence information that can be accessed by recombination and/or re-activation (e.g., Balakirev and Ayala 2003).
Balakirev, E.S. and Ayala, F.J. (2003) PSEUDOGENES: Are They “Junk” or Functional DNA? Ann. Rev. Genet. 37:123-151. [doi:10.1146/annurev.genet.37.040103.103949]
Drouin, G. (2006)Processed pseudogenes are more abundant in human and mouse X chromosomes than in autosomes. Mol. Biol. Evol. 23:1652-1655 [PubMed]
Gregory, T.R. (2005) "Genome Size Evolution in Animals" in The Evolution of the Genome. Elsevier Academic Press, New York (USA).
Solovyev, V., Kosarev, P., Seledsov, I. and Vorobyev, D. (2006) Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 7 Suppl 1:S10.1-12 [ PubMed
Taylor, J.S. and Raes, J. (2005) "Small-Scale Gene Duplications" in The Evolution of the Genome. Elsevier Academic Press, New York (USA).