tag:blogger.com,1999:blog-37148773.post2293696540639202487..comments2024-03-27T14:50:47.345-04:00Comments on <center>Sandwalk</center>: Don't misuse the word "homology"Larry Moranhttp://www.blogger.com/profile/05756598746605455848noreply@blogger.comBlogger37125tag:blogger.com,1999:blog-37148773.post-71726866019737416832015-03-08T06:10:17.684-04:002015-03-08T06:10:17.684-04:00You do, yes, but RecA itself is frequently describ...You do, yes, but RecA <i>itself</i> is frequently described as doing a 'homology search'!AllanMillerhttps://www.blogger.com/profile/05955231828424156641noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-60525065588251354652015-03-07T17:28:07.330-05:002015-03-07T17:28:07.330-05:00You do a search for possible HOMOLOGY based on seq...You do a search for possible HOMOLOGY based on sequence similarity or structural similarity. Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-64573611193091449902015-03-07T16:54:10.598-05:002015-03-07T16:54:10.598-05:00So, when RecA (or one of its ... er ... homologs s...So, when RecA (or one of its ... er ... homologs such as RAD51) does its ... er ... homology search, we should say something different? This usage has become pretty much embedded in certain areas. Which may grate, but language evolves, even scientific language. Personally, I deplore beginning sentences with 'so', and pointless use of ellipses ...AllanMillerhttps://www.blogger.com/profile/05955231828424156641noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-44700877250853779902015-03-06T22:32:05.388-05:002015-03-06T22:32:05.388-05:00Oh, those silly semantics warriors - they sure get...Oh, those silly semantics warriors - they sure get tiring after a while. <br /><br />And the funny thing is, in most cases they are dead wrong when they object to the use of "homology" because high similarity almost invariably means homology. <br />DKhttps://www.blogger.com/profile/04761138604438222762noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-46605224704300811222015-03-06T18:33:59.949-05:002015-03-06T18:33:59.949-05:00bwilson: These are not multiple definitions of &qu...bwilson: These are not multiple definitions of "homologous"; it's all the same definition, applied to different features. And this is how the term is used by systematists; I didn't invent it.<br /><br />Larry: Identity and homology are not the same thing. Some of that identity is homoplasy, as in my example that started this little argument.<br /><br />Roger: The analogy was intended to be crude just to make it understandable. "Wingness" is indeed shared, it just isn't homologous. Now in the example we can easily tell that bats and birds do not have homologous wings. But in thousands of real cases in morphology (and always in molecular sequences) the non-homologous states look similar enough -- or in fact identical -- that the only reason they're known as homoplasious is after the fact, i.e. because that character doesn't match the tree. This is a routine statement in morphological systematics, and the molecular case differs in no significant way.John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-53664636411889176532015-03-06T16:34:10.488-05:002015-03-06T16:34:10.488-05:00John Harshman, normally I either agree with you or...John Harshman, normally I either agree with you or wish I had agreed with you because I was either wrong or ignorant when I didn't. Here, however, I disagree with you and I think you're wrong. (Yes, I do understand your distinction between characters and character states -- I just don't think it is useful for communication.)<br /><br />In a writing class long ago, I learned that if a writer wants to be understood, he has a responsibility to write clearly, not to bitch about readers who fail to understand. There is a sense in which an entire DNA sequence can reasonably be treated as homologous even if some of the bases have mutated and are no longer identical. When discussing sequences in that sense, saying that two non-identical bases are not homologous would just be wrong. Of course, if you switch to a base-centered frame of reference and somewhat redefine "homologous" to mean identical by descent rather than similar because they are descended from a common ancestor, you can reasonably say that non-identical bases are not homologous. However, you can't expect that your readers to come along with you on this little mental side track, unless you explain a lot.<br /><br />We'd be stuck with these multiple definitions of homologous if that were all we have (think of the chromosome / chromatid mess in meiosis that is guaranteed to confuse students), but we have a way to express what you mean much more clearly (for your audience) if you use percent identity, rather than percent homology. <br /><br />Of course, percent homology is a common phrase, but it's a confusing phrase and should be discouraged simply. Forget arguments on fine shades of meanings; percent homology simply does not communicate well.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-37148773.post-53121026630835739842015-03-06T16:10:38.740-05:002015-03-06T16:10:38.740-05:00That doesn't address my question. I get "...That doesn't address my question. I get "homologous as limbs" "Not homologous as wings". Wingness was not shared. <br />But does anything consider the Aness or Tness of a specific site? It seems severely contrived outside of anything other than artificial algorithmic accountancy. roger shrubberhttps://www.blogger.com/profile/06920052094289132399noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-63769121370198009392015-03-06T16:04:24.840-05:002015-03-06T16:04:24.840-05:00Would you consider it odd to say that 42% of the b...<i>Would you consider it odd to say that 42% of the bases in a given sequence are homologous between two taxa? </i><br /><br />Yes, because we have a far better word for it. We can say that the sequences are 42% identical. That's the raw data that leads us to the conclusion that the genes/sequences are homologous. Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-3688887939468992562015-03-06T15:51:47.343-05:002015-03-06T15:51:47.343-05:00Do you really use homologous to describe character...<i>Do you really use homologous to describe character identity?</i><br /><br />Of course you do. To take a gross example, let's consider a character we might call "tetrapod forelimb". Now of course that character is homologous throughout tetrapods. Now consider a few character states, and let's naively code it as "leg" or "wing". "Leg" is of course the ancestral state, and both birds and bats have the derived state "wing". But those states are not homologous.<br /><br />It's the same with the bases at any given site, except that we have no hope of telling whether two A's are homologous or homoplasious just by examining them.John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-59363206915695094412015-03-06T14:39:33.302-05:002015-03-06T14:39:33.302-05:00I'm confused about the non-homologous parts. I...I'm confused about the non-homologous parts. I get sequence identity. I get reversals. But for a SNP, it still shares an ancestor, your 'site homology'. <br />Now indels are different. If a gene or protein were to be called 70% homologous, I would want that to mean that you can align 70% of the sites with the rest being recent indels (since LCA) but I'm still thinking about bits that were in the LCA and deleted in one. <br />Do you really use homologous to describe character identity? Should you? I'm scratching my head.roger shrubberhttps://www.blogger.com/profile/06920052094289132399noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-1819570069583848622015-03-06T14:17:47.418-05:002015-03-06T14:17:47.418-05:00The reason it's silly to say that wings and fl...The reason it's silly to say that wings and flippers are 42% homologous is that we have no objective measure of percent homology, since character scoring is a subjective process. Our judgments would depend on what particular characters we had abstracted from the anatomy. For molecular sequences, on the other hand, scoring (once you've aligned them, that is) is simple and objective.<br /><br />Would you consider it odd to say that 42% of the bases in a given sequence are homologous between two taxa? John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-76040406767471467142015-03-06T13:32:45.364-05:002015-03-06T13:32:45.364-05:00At any rate, the homologies of individual characte...<i>At any rate, the homologies of individual characters and character states are not invalid questions in morphological studies. Why are molecules to be considered different?</i><br /><br />It would be pretty silly to say that the wings of a bird and the flippers of a seal are 42% homologous.<br /><br /> Why are molecules to be considered different?Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-80010579374002709252015-03-06T13:18:37.011-05:002015-03-06T13:18:37.011-05:00Are you sure of that?
No, of course not. Everythi...<i>Are you sure of that?</i><br /><br />No, of course not. Everything you say is true and the problems even apply to amino acid sequences. (Although I would argue that you need to correct identify calculations by subtracting gap penalties.)<br /><br />I didn't think it was important to quibble in order to make the point. Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-88268357662299828362015-03-06T13:10:05.638-05:002015-03-06T13:10:05.638-05:00When you align any two DNA sequences you'll fi...<i>When you align any two DNA sequences you'll find that roughly 25% of the base pairs are identical.</i><br /><br />Are you sure of that? If you have two random sequences of length l, sure we expect 25% identity, with an SD of 43%/l^.5. But even the most simple alignment algorithm will tend to produce greater sequence identity. How much would depend on the precise method used and the length of sequences involved, but aligning a 100BP sequence to a 10k BP sequence assuming no indels gives me about 43% sequence identity for instance. That's quite a bit higher than 25% and would go further up if indels were allowed.<br />25% identity is not a very useful baseline. We've been over this when you were arguing that micro RNAs were not generally highly conserved. But for a sequence of 30BP aligning it to a sequence of 100kBP we get about 60% as a baseline, rising to 70% for 1MBP. Detecting homologs requires a high degree of sequence conservation for these short sequences.Anonymoushttps://www.blogger.com/profile/04521153536420798640noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-71510230991484262362015-03-06T12:57:09.641-05:002015-03-06T12:57:09.641-05:00Allen Miller: I misunderstood. I didn't realiz...Allen Miller: I misunderstood. I didn't realize they were being matched one at a time. But I'm glad I asked the question.Petrushkahttps://www.blogger.com/profile/02343702725399620404noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-56263066770054141492015-03-06T12:41:23.415-05:002015-03-06T12:41:23.415-05:00Why do you think we have to use the word "hom...<i>Why do you think we have to use the word "homology" in this context?</i><br /><br />Well, of course we don't have to do anything. A better question is this: Why are we forbidden to use it? At any rate, the homologies of individual characters and character states are not invalid questions in morphological studies. Why are molecules to be considered different?John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-68708967841460700032015-03-06T10:19:36.736-05:002015-03-06T10:19:36.736-05:00Not on those grounds alone, no - it's just tha...Not on those grounds alone, no - it's just that there are 4 bases, so any two drawn at random will be the same 25% of the time. AllanMillerhttps://www.blogger.com/profile/05955231828424156641noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-81487263147486555532015-03-06T10:13:23.180-05:002015-03-06T10:13:23.180-05:00Dumb question. If "random" DNA sequences...Dumb question. If "random" DNA sequences are 25% identical, are they all likely to share a common ancestor?Petrushkahttps://www.blogger.com/profile/02343702725399620404noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-91057978469065431552015-03-06T10:08:38.684-05:002015-03-06T10:08:38.684-05:00Why not?
Because we have perfectly good ways of ...<i>Why not? </i><br /><br />Because we have perfectly good ways of saying the same thing without abusing the word "homology." We can say that the genes in two species are homologous and certain segments are more highly conserved than others. We can even say that there's a short segment in the two genes where the sequences are 90% identical in divergent species. <br /><br />Why do you think we have to use the word "homology" in this context?<br /><br />Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-14656972292963075412015-03-06T09:03:41.605-05:002015-03-06T09:03:41.605-05:00It makes no sense to examine small regions of that...<i>It makes no sense to examine small regions of that stretch and say that this 10 bp stretch is 90% homologous while that 10 bp stretch is only 60% homologous.</i><br /><br />Why not? Would it make sense to say that 90% of the bases in a 10bp stretch are homologous? (Of course, mere identity doesn't equal homology, given that there is homoplasy too.)John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-91111768862725507072015-03-06T08:41:51.461-05:002015-03-06T08:41:51.461-05:00OMG, how did I not know about this web site? Well,...OMG, how did I not know about this web site? Well, looks like I have another reason to get less stuff done!Marcolihttps://www.blogger.com/profile/00542204027681831657noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-13013228348604782052015-03-06T07:35:40.207-05:002015-03-06T07:35:40.207-05:00Gun held side ways?! Oh that's a kill shot rig...Gun held side ways?! Oh that's a kill shot right there! (RIP Dr. Dan Deline)Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-37148773.post-39269085988806493002015-03-06T07:16:30.895-05:002015-03-06T07:16:30.895-05:00If 80% of the bases in a sequence are identical by...<i>If 80% of the bases in a sequence are identical by descent, it makes sense to call the sequence 80% homologous.</i><br /><br />No, that makes no sense at all.<br /><br />If you have decided that the stretches of nucleotides share a common ancestor then they are homologous. You describe their relatedness by saying that the sequences are 80% identical. In most cases, that's the evidence that you used to reach the conclusion in the first place. <br /><br />When you align any two DNA sequences you'll find that roughly 25% of the base pairs are identical. In that case, it makes no sense to say that each of those "characters" is homologous and the sequences are 25% homologous.<br /><br />Once you've decided that the two sequences are homologous that's the end of the story. The sequences are usually genes but if they're not then it has to be a significant stretch of DNA. It makes no sense to examine small regions of that stretch and say that this 10 bp stretch is 90% homologous while that 10 bp stretch is only 60% homologous. Larry Moranhttps://www.blogger.com/profile/05756598746605455848noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-80476214403229330962015-03-05T22:14:51.641-05:002015-03-05T22:14:51.641-05:00You may not like it, but the commonly understood m...You may not like it, but the commonly understood meaning of homology applies both to characters and to character states, separately. Homology is defined as similarity due to common ancestry. Different states of one character are homologous characters if that character was found in the ancestor; but identical states are homologous states if that state was found in the ancestor. You may want to apply a special, molecular meaning to the term, but I don't see why. There are level of homology; always have been.<br /><br />Tryptophan at position 64 is not homologous to phenylalanine at position 64, but position 64 is (or may be) a homologous site in two species even if occupied by non homologous residues.John Harshmanhttps://www.blogger.com/profile/06705501480675917237noreply@blogger.comtag:blogger.com,1999:blog-37148773.post-38845545338524463192015-03-05T21:20:38.208-05:002015-03-05T21:20:38.208-05:00If two taxa have the same character state, that...<i>If two taxa have the same character state, that's homologous if they got it by inheritance of that state from their common ancestor.</i><br /><br />Agreed. But if you define percent homology from that, then you're ignoring the homologous characters that did not remain in the same state.Anonymousnoreply@blogger.com