The scientists decided to return to the human genome and search for K111. They isolated DNA from their HIV patients, as well as from healthy people. Remarkably, the scientists didn’t find just 1 copy of K111 in each of their subject’s genomes, as is the case in chimps. The more the scientists looked, the more variants they found. Some K111 viruses were fairly intact, while others were vestiges. The scientists found over 100 copies of the virus in the human genome, scattered across 15 chromosomes.
This finding suggests that between 6 ma and 800 ka ago, K111 was duplicated a few times at a fairly slow pace. It’s possible that Markowitz and his colleagues missed some other copies because the reconstruction of those ancient genomes wasn’t quite accurate enough for their search. But even if we generously assumed that Neanderthals and Denisovans had 20 K111 viruses apiece, that’s still a small fraction of the 100 or more copies of K111 the scientists found in the human genome. It was only later, in the past 800 ka, that K111 started proliferating at a faster pace.1 reason that K111 has gone overlooked till now is that it found a good place to hide–the center of chromosomes. This region, called the centromere, is a genomic Bermuda Triangle. It’s loaded with lots of short, repetitive stretches of DNA. When scientists reconstruct the sequence of a genome, they break DNA down into many overlapping segments, which they then try to rebuild based on overlapping similarities. Centromere DNA is so similar to itself that it’s easy to line up fragments in many different arrangements. As a result, centromeres make up much of the last 5% of the human genome that has yet to be mapped.