We have identified a novel family of about 10-50 human endogenous retrovirus elements (HERVs) and have characterized one family member (HERV-KC4). This retrovirus element is integrated within intron 9 of and complement C4A genes and also in some C4B genes, and is a principal contribution to interlocus and interallelic length heterogeneity of C4 genes. The HERV-K(C4) sequence has a typical retrovirus structure with elements of gag, pol and env domains, flanked by two long terminal repeats (LTRs) and is similar to type A, B and D retroviruses. Multiple termination codons preclude the existence of long open reading frames, suggesting that the HERV-K(C4) sequence is no longer functional. Zoo blot hybridization reveals that New World monkeys appear to lack sequences similar to HERV-K(C4), suggesting that integration has occurred after the divergence of Old and New World monkeys. Retrotransposition of prototype viruses is presumed to have led to the amplification and integration of the members of the family in different loci, which in humans, appear to be dispersed over several chromosomes. The absence of the HERV-K(C4) element in some C4B genes in both humans and orangutangs indicate that the retrovirus inserted into the C4A gene after the duplication of the cluster. Subsequent spread of the HERV-K(C4) sequence to C4B genes presumably occurred by interlocus sequence exchange mechanisms, such as unequal crossover and gene conversion-like mechanisms.