More Recent Comments

Monday, April 27, 2026

Ask Gemini: "What is the difference between junk DNA and non-coding DNA?"

This is weird. I was a bit bored so I asked Gemini the following question: "What is the difference between junk DNA and non-coding DNA?" I thought the first answer was so wrong that I decided to ask it again to see if I got the same answer.

The second answer was quite different because Gemini noticed that I had bookmarked Sandwalk, a blog written by Laurence Moran, a champion of the 'junk DNA' concept. Is it trying to give me the answer it thinks I want or the best possible scientific answer?

Note: Here is the correct answer.

Non-coding DNA refers to the part of the genome that doesn't code for proteins. It's one way to partition the genome - you could also refer to regulatory sequences and non-regulatory sequences.

By the late 1960s scientists knew of lots of functional non-coding DNA such as regulatory sequences and non-coding genes such as those for ribosomal RNA and tRNA. (There are other non-coding functional elements.) It became apparent that most of the human genome consisted of non-functional DNA or junk DNA. The original model back then was that 10% is functional and 90% is junk. The 10% that is functional consisted of 1-2% coding DNA and about 8% of functional non-coding DNA.

No knowledgeable scientist ever said that all non-coding DNA was junk; that's a lie that continues to be perpetuated in scientific publications and the popular media even though it has been repeatedly debunked.

Most of the data that has accumulated over the past 50+ years has supported the idea that 90% of the human genome is junk and only 10% is functional.

The Gemini answers relate to the debate concerning whether AI is really intelligent and, more importantly, whether the popular (free) algorithms are spreading misinformation.

Gemini Answer #1

The difference between non-coding DNA and "junk DNA" is a matter of biology versus history. In the scientific community, "non-coding DNA" is the accurate, technical term, while "junk DNA" is a largely outdated, often misleading label that has fallen out of favor as our understanding of genetics has deepened.

Here is how they differ:

1. Non-Coding DNA (The Scientific Fact) Non-coding DNA is a factual, descriptive term. It refers to any segment of DNA within an organism's genome that does not code for protein sequences.

In the "central dogma" of biology, genes are transcribed into RNA and then translated into proteins. Non-coding DNA refers to everything else that isn't part of those protein-coding instructions. Crucially, "non-coding" does not mean "non-functional."

The vast majority of the human genome (roughly 98–99%) is non-coding. This includes:

Regulatory elements: Promoters, enhancers, and silencers that act like "volume knobs" or "on/off switches" for protein-coding genes.

Non-coding RNA genes: DNA that codes for functional RNA molecules (like tRNA, rRNA, and miRNA) rather than proteins. These RNAs are essential for cell survival and regulation.

Structural DNA: Telomeres (the protective caps on chromosomes) and centromeres (which help chromosomes pull apart during cell division).

Introns: Non-coding sections within a gene that are removed during the RNA processing step.

2. "Junk DNA" (The Outdated Label) "Junk DNA" is a historical term used in the 1960s and 70s to describe the large stretches of DNA that did not seem to have a clear function. At the time, scientists believed that if a segment of DNA didn't code for a protein, it was essentially biological "garbage"—leftover debris from evolution.

Why the term is now considered inaccurate:

We discovered function: As research progressed (particularly with large-scale efforts like the ENCODE project), we realized that much of what was once dismissed as "junk" is actually involved in complex gene regulation, structural maintenance, and genomic stability.

The "Dark Matter" Analogy: Geneticists now prefer terms like the "dark matter" of the genome or the "regulatory landscape" to describe these regions. It implies that we haven't discovered the function yet, rather than asserting that no function exists.

Evolutionary "Garbage": Some portions of the genome are likely remnants of ancient viral infections (endogenous retroviruses) or duplicated genes that have lost their function (pseudogenes). While these might be considered "junk" because they no longer perform their original purpose, they can sometimes be "exapted"—or co-opted—by the cell for entirely new, useful purposes over evolutionary time.

Summary

If you are reading a textbook or scientific paper, you will almost always see "non-coding DNA" used. It is the precise way to describe the landscape of the genome.

"Junk DNA" should generally be avoided because it incorrectly implies that the genome is filled with useless clutter. Instead, think of non-coding DNA as the sophisticated control center of the genome; if protein-coding genes are the "products" being built in a factory, non-coding DNA is the complex, intricate system of blueprints, scheduling software, and logistics management that tells the factory when, where, and how much to produce.

Does the distinction between regulatory non-coding DNA and protein-coding genes make sense, or would you like to explore how non-coding elements like "enhancers" work?

Gemini Answer #2

It looks like you are digging into this topic, and I noticed you have Sandwalk bookmarked. Since that blog (written by evolutionary biologist Laurence Moran) is a well-known champion of the "junk DNA" concept, you are likely encountering a very specific, nuanced scientific debate that goes beyond just semantics.

To clarify the difference, it helps to distinguish between a descriptive term and an evolutionary claim.

1. Non-coding DNA: The Descriptive Term

"Non-coding DNA" is a neutral, descriptive statement. It simply tells you what the DNA does not do:

  • It does not contain the instructions to create proteins (the amino acid sequence).
  • This is a factual category. We can sequence DNA and definitively state, "This segment does not translate into protein."

2. "Junk DNA": The Evolutionary Claim

When researchers like those who write for Sandwalk use the term "junk DNA," they are not saying the DNA is "trash." They are making a specific evolutionary hypothesis:

  • The Claim: This DNA has no biological function that provides a selective advantage to the organism.
  • The Mechanism: It is likely the result of "neutral evolution"—accumulated mutations that the organism has not purged because they don't harm it.
  • The Distinction: In this view, calling it "junk" is a way of saying, "This DNA is not under natural selection; it is selectively neutral.

"Why the distinction matters

The confusion often arises because two different groups of scientists are arguing from different perspectives:

Summary

If you are looking for a precise answer:

  • Non-coding DNA is a classification based on function (it doesn't make protein).
  • Junk DNA is an evolutionary classification based on selection (it provides no fitness benefit to the organism).

The debate isn't really about whether the DNA is "useless." The debate is about whether the genome is packed with "highly optimized regulatory code" (Functionalists) or if it is largely a collection of "evolutionary baggage" that doesn't actually do anything useful for the organism (Selectionists).

Are you exploring this because you are interested in the "C-value paradox," or are you more interested in the debate over how much of the genome is actually functional?


5 comments :

gert korthof said...

Larry, the most remarkable part of the second answer is: "I noticed you have Sandwalk bookmarked." That's amazing! Gemini looks at your bookmarks and browsing history on your computer. And it uses that to give an answer!? Really? It did not use your bookmarks for the first question. Why use bookmarks at all? Are the answers not supposed to be objective, independent of who is asking? And didn't you make bookmarks of ID websites or are they not in your browsing history?

When I asked google AI-modus "What is the difference between junk DNA and non-coding DNA?" I got:
"An older term (popularized in 1972) that is largely being retired."
and:
"Why the Term "Junk" is Fading.
In the past, because researchers did not understand the 98% of DNA that didn't make proteins, they labeled it "junk". However, modern research (such as the ENCODE Project) has revealed that much of this DNA is essential for."

terminus est said...

Thanks, Larry. I asked Gemini the same question and received a nearly identical response. So, I replied with this correction: 1.7% is coding, a significant portion is non-coding and regulatory, but most of our genome is junk (transposons, ERV's, pseudogenes, repeats, etc.) that have accumulated over time. Gemini admitted this was true, but that it was merely providing a more simplified explanation that most people would understand. AI seems completely unreliable, at this point.

Brooklynette said...

Maybe slightly off-topic question, but does anyone on the blog have an opinion about the recent best seller book by Roxanne Khamsi, "Beyond Inheritance: Our Ever-Mutating Cells and a New Understanding of Health" ? I'm an educated layman and healthcare worker, but certainly no expert on this topic. Eric Topol among other weighty authorities in clinical medicine is promoting this author's book and thesis, and to me it has the whiff of the Encode hype. But I don't really know. Here's an example of the overwhelming positive publicity it's receiving from a Harvard University Science Center presentation on April 23rd, "In this groundbreaking book, science writer Roxanne Khamsi describes our bodies as active landscapes of mutation. She reveals how the forces of Darwinian evolution operate within our own tissues. The effects can be devastating, such as when mutant blood cells outcompete their normal counterparts and increase the risk of heart attacks. But mutations can also make our bodies more resilient: Liver cells with genetic changes seem to cope better with excess calories. And immune cells with remixed DNA can make more effective antibodies against the microbes that threaten us.

By letting go of the antiquated idea that every cell in a body has the same exact DNA, we can usher in a whole new era of medicine, including better vaccines and treatments that outsmart cancer. Beyond Inheritance will open your eyes to the immense genetic diversity that exists within you and its incredible potential to shape your well-being."

gert korthof said...

The facts about cells could be right, although the importance exaggerated, but the real trouble starts with claims about better vaccines and cancer treatments...

Brooklynette said...

yes, that's the part that really raised my skeptical antenna