At its core, a paralog is a gene that has evolved from a common ancestral gene through a process known as gene duplication. This fundamental mechanism creates redundancy in the genome, providing the raw material for evolutionary innovation. Unlike orthologs, which arise from speciation events and typically retain the same function across different species, paralogs originate within a single lineage. The duplicated copy is free to accumulate mutations, potentially leading to new functions, subfunctions, or regulatory changes that were not present in the original gene.
The Mechanism of Gene Duplication
The creation of a paralog begins with gene duplication, a relatively common event in genomic evolution. This duplication can occur through several distinct molecular mechanisms, each contributing to the genetic complexity of an organism. The primary methods include unequal crossing over during meiosis, retrotransposition via messenger RNA intermediates, and whole-genome or segmental duplication events. Once the duplication event occurs, the organism possesses two copies of the same gene, establishing the initial condition for paralogous relationship.
Molecular Pathways to Duplication
Unequal crossing over happens when homologous chromosomes misalign during recombination, resulting in one chromosome with a duplication and another with a deletion.
Retrotransposition involves the reverse transcription of mRNA into DNA, which is then inserted into a new genomic location, creating a processed pseudogene or a functional retrogene.
Whole-genome duplication, common in plants and ancient vertebrates, provides a massive surge of genetic material that fuels evolutionary diversification.
Functional Divergence and Subfunctionalization
Following gene duplication, the two paralogous copies often face different selective pressures. One prevailing model is the "neofunctionalization" hypothesis, where one copy retains the original function while the other accumulates mutations that lead to a novel biochemical activity. This process is a primary driver of genetic innovation, allowing organisms to adapt to new environments or develop new physiological traits without losing the essential function of the original gene.
Alternatively, "subfunctionalization" describes a scenario where the original function of the ancestral gene is partitioned between the two paralogs. Each copy assumes a subset of the original gene's roles, such as expression in different tissues, at different developmental stages, or in response to distinct signaling pathways. This division of labor stabilizes the duplicated genes, as both copies are required to maintain the full functionality of the ancestral state.
The Contrast with Orthologs
To fully grasp the concept of a paralog, it is essential to distinguish it from its close relative, the ortholog. While both terms describe homologous genes, they originate through different evolutionary paths. Orthologs are found in different species and are separated by speciation events; they generally maintain the same function throughout evolution. Paralogs, however, are found within the same species and are the result of gene duplication; they are more likely to diverge in function.