Sandwalk: What does a person's genome reveal about their ethnicity and their appearance?

Wednesday, June 15, 2016

What does a person's genome reveal about their ethnicity and their appearance?

If you knew the complete genome sequence of someone could you tell where they came from and their ethnic background (race)? The answer is confusing according to Siddhartha Mukherjee writing in his latest book "The Gene: an intimate history." The answer appears to be "yes" but then Mukherjee denies that knowing where someone came from tells us anything about their genome or their phenotype. He writes the following on page 342.

... the genetic diversity within any racial group dominates the diversity between racial groups. This degree of intraracial variability makes "race" a poor surrogate for nearly any feature: in a genetic sense, an African man from Nigeria is so "different" from another man from Namibia that it makes little sense the lump them into the same category.

For race and genetics, then, the genome is strictly a one-way street. You can use the genome to predict where X or Y came from. But knowing where A or B came from, you can predict little about the person's genome. Or: every genome carries a signature of an individual's ancestry—but an individual's racial ancestry predicts little about the person's genome. You can sequence DNA from an African-American man and conclude that his ancestors came from Sierra Leone or Nigeria. But if you encounter a man whose great-grandparents came from Nigeria or Sierra Leone, you can say little about the features of this particular man. The geneticist goes home happy; the racist returns empty-handed.

I find this view very strange. Imagine that you were an anthropologist who was an expert on humans and human evolution. Imagine you were told that there's a woman in the next room whose eight great-grandparents all came from Japan. According to Mukherjee, such a scientist could not predict anything about the features of that woman. Does that make any sense?

I suspect this is just a convoluted way of reconciling science with political correctness.

Steven Monroe Lipkin has a different view. He's a medical geneticist who recently published a book with Jon R. Luoma titled "The Age of Genomes: tales from the front lines of genetic medicine." Here's how they explain it on page 6.

Many ethnic groups carry distinct signatures. For example, from a genome sequence you can usually tell if an individual is African-American, Caucasian, Asian, Satnami, or Ashkenazi Jew, even if you've never laid eyes on the patient. A well-regarded research scientist whom I had never met made his genome sequence publically available as part of a research study. I remember scrolling through his genetic variant files and trying, more successfully than I had expected, to guess what he would look like before I peeked at his webpage photo. The personal genome is more than skin deep.

This makes more sense to me. If you know what you look for—and Simon Monroe certainly does—then many of the features of a particular person can be deduced from their genome sequence. And if you know which variants are more common in certain ethnic groups then you can certainly predict what a person might look like just by knowing where their ancestors came from.

What's wrong with that?

79 comments:

Jonathan BadgerWednesday, June 15, 2016 10:19:00 AM
There is also an increasing realization that ethnicity is associated with genetic variation associated with disease outcome which really isn't congruent with the "race is just a social construct" idea. For example, some collaborators of mine have found that variations in the LSAMP locus predominate in prostate tumors from African American men but not in other groups, and this probably contributes to the greater incidence of prostate cancer among African Americans.
ReplyDelete
Replies
unknowingWednesday, June 15, 2016 10:45:00 AM
Honestly, it sounds more like Mukherjee is confused here, rather than trying to be "politically correct." It seems he's struggling with the interpretation of genetic diversity within and between racial categories. I.e. because of the (comparatively high) genetic diversity within some of these categories, category may be poorly predictive at the level of "all known variants." Of course, that broad categories such as white, black, Asian, are poorly predictive at this level, does not mean that one cannot predict with reasonable accuracy the presence or absence of a subset of genetic variants, those common to members of that racial category (assuming that these categories do indeed reflect shared ancestry on some level). Unsurprisingly, category becomes more useful a predictor the more it defines a specific population.
ReplyDelete
Replies
AnonymousWednesday, June 15, 2016 11:27:00 AM
From Mukherjee:
"the genetic diversity within any racial group dominates the diversity between racial groups. This degree of intraracial variability makes "race" a poor surrogate for nearly any feature: in a genetic sense"

It seems to me this idea that genetic diversity indicates that race is an illusion was started by Lewontin back in the 80s. I think the idea was that misrepresenting the science can be justified for the greater good of mitigating racism, but I doubt anyone in the KKK gave a fig for any of this.
Here is why I think this is wrong, but if my interpretation is off I of course expect a vigorous correction.

Consider several loci involved in skin pigmentation. It could easily be the case that if one looks at genetic diversity within a population from Nigeria and within a population from Iceland one would see greater diversity within those populations that between the consensus sequences for both populations. But skin pigmentation is not an illusion- those 2 populations fall at extreme opposite ends of the phenotypic spectrum. Most of the 'diversity' within the population isn't doing anything and a few key differences between the populations explain most of the differences
ReplyDelete
Replies
CherryBombSimWednesday, June 15, 2016 11:45:00 AM
This is a great example of what can go wrong when you try to explain science to 5-year-olds. I understand the Mukherjee is making about pairwise variance in his first sentence, but honestly, most people don't have the math. They read this, get some fuzzy general impression of racial equality, and pass it around in a game of Telephone, where it gets totally garbled. I've had people tell me, with a straight face, that I (an American of European descent) am more closely related to Africans than I am to other Europeans,

What's worse, racists hear this garbled end result and figure "Scientists are so dumb they can't tell races apart. I can, so I must be a lot smarter than them."
ReplyDelete
Replies
Ian BosdetWednesday, June 15, 2016 2:02:00 PM
Couldn't agree more. When interpreting a clinical genetic test, knowing the patient ethnicity can be very helpful. Mukherjee should look at the Exome Aggregation Consortium data (ExAC) - variants are separated into broad ethnic groups and you can see obvious and sometimes dramatic differences in frequencies between them.
ReplyDelete
Replies
Joe FelsensteinWednesday, June 15, 2016 4:51:00 PM
We can use genetic markers, if we have lots of them, to tell apart almost any two populations (say, North Swedes and South Swedes). Does that make them different races?

I think that we are jumping from acknowledging that there are real genetic differences between populations to making assertions about "race" being real and not being a social construct.

ReplyDelete
Replies
CrocodileChuckWednesday, June 15, 2016 5:14:00 PM
'Nambia'? Doesn't exist.

Does he mean Namibia?
ReplyDelete
Replies
JmacWednesday, June 15, 2016 6:17:00 PM
It must be a miracle when one considers how some monkeys have evolved into different skin colored monkeys now are called intelligent monkeys....
I guess evolution is not only random. It is also confusing but not to the devout believers...
ReplyDelete
Replies
UnknownWednesday, June 15, 2016 8:05:00 PM
Isn't an 80,000 year separation long enough to get significant differences in the DNA of two populations - or would it take much longer than that. Aren't there many characteristics that would differentiate the population in Japan from the population in Kenya?
ReplyDelete
Replies
Robert ByersWednesday, June 15, 2016 11:05:00 PM
Yes anyone who talks about people groups is under the burden of dealing with the historical issues of RACE. So yes pC is operative here while not corrupting the science.
There is indeed no such thing as race.
What there is IS segregated populations that gain particular details in their bodies..
This is a flaw in evolutionist thinking.
For example a YEC would say all european population groups were first brown skinned and only later , upon migration to europe, became white. YET this was in already segregated populations with different languages.
A evolutionist says there was a brown skin group that migrated to europe , became white, then segregated into groups with different languages coming soon after.
The creationist sees traits as from influences from the environment and so our bodies adapt instantly to them.
So no races exist . The evolutionist must see races as existing. They neede the original tribe to evolve its traits before separation.
Race is not a social construction in evolutionary terms. Its real populations that evolved separately in the past.
i think a flaw in evolutionary concepts is shown in the problem of how to group mankind.
It works fine for Genesis believing creationists.
YEC should jump on this.

ReplyDelete
Replies
eallocThursday, June 16, 2016 1:29:00 PM
I'm not sure the quotes necessarily contradict each other. Mukherjee has two points: 1. It's easy predict a person's ancestry from their genome, 2. It's hard to predict genetic features in a person's genome from their ancestry.

Lipkin agrees with the 1st point, but says nothing about the second.

I know little about human genetics, but here's how I might believe the 2nd point: If a particular "race" had a slight increase in probability of a certain set of snps (say, a 1% increase in frequency of 100000 snps) you would expect a genome of that race to have 1000 (+/-200) more of those snps than someone else: Probably a pretty statistically reliable test for ancestry. On the other hand, knowing a person was of a particular race tells you little about a snp. If the snps have a 50% probability in race 1 and a 51% probability in race 2, if you try to predict the snp based on race you're going to be wrong about half of the time. I base my impression on a rough skim of http://dx.doi.org/10.1038%2Fncomms1104 , hopefully I did not butcher the idea.

If that's Mukherjee's point, I still agree that he is muddying the issue a bit. He is carefult to add the caveat "in a genetic sense", but it would probably be clearer to say "genetic feature" instead of just "feature". I'm also not sure I agree that "you can predict little about the person's genome", or that "it makes little sense the lump them into the same category". It depends what he means by "little". The KKK could still argue while you might not be able to reliably predict particular genetic features given ancestry, you could still predict global genome properties like # of snps.
ReplyDelete
Replies
NickMMonday, June 27, 2016 10:06:00 AM
I'm late to the party, but:

========================
Continuous geographic structure is real, “discrete races” aren’t

By Nick Matzke on February 29, 2012 2:38 PM
http://www.pandasthumb.org/archives/2012/02/continuous-geog.html
========================
ReplyDelete
Replies
Rolf AalbergTuesday, June 28, 2016 4:20:00 AM
Larry says: Isn't it fair to say that there are differing opinions among scientists about whether there are genetically isolated human populations?
If we look at dog genetics, we don't have much of a problem with identifying specific races - from Chihuahua to Mastiff? Yet they are all dogs. Maybe we need a new word because 'race' is a loaded word when used with respect to the human race?
ReplyDelete
Replies

Add comment