3
DNA replication and gene expression, which occurs when the nucleotide sequence of a gene is copied into
ribonucleic acid (RNA) during a process called transcription. For genes that encode proteins, DNA is
copied into messenger RNA (mRNA), which then directs the synthesis of proteins during the process
known as translation. Gene expression is highly regulated in order to control a cell’s activities; thus, the
timing and amount of RNA and protein generated from a given gene varies depending on the cell’s
activities. Disruption of gene expression regulation leads to diseases such as cancer.
Inside cells, DNA is packaged around proteins called histones; this DNA–protein (nucleoprotein)
complex is called chromatin. Histones act like “spools” around which DNA is wrapped. In humans, each
cell contains approximately 2 meters of DNA; however, because of the wrapping of DNA around histones,
the condensed DNA is approximately 120 micrometers long!
This DNA “packaging” in the form of chromatin plays a key role in the regulation of gene expression. The
nucleoprotein inside cells serves as a docking site for the different proteins and enzymes and their
interactions required for DNA replication, transcription, recombination, and repair.
This lesson introduces students to the emerging field of epigenetics. Epigenetics literally means “on top
of or in addition to genetics,” and is the study of changes in gene expression not accompanied by alterations
in DNA sequence. In parallel to the term genome, which defines the complete set of genetic information
contained in the DNA of an organism, epigenome refers to the complete set of epigenetic pathways in an
organism. Epigenetic modifications to DNA exert profound influences on gene activity. For example,
studies suggest that epigenetic variation may be responsible for subtle differences in appearance and
behavior of identical twins. Identical twins are more epigenetically similar early in life but show remarkable
divergence with age.
Epigenetic pathways such as DNA methylation and histone modifications interact with each other to
regulate expression of genes. One of the most common and well-characterized epigenetic pathways is DNA
methylation. DNA methylation occurs when an enzyme called a methyltransferase covalently attaches a
methyl (-CH3) group to a cytosine base that is adjacent to a guanine base
(see Figure 1). Such sites where a cytosine is adjacent to guanine via a
phosphodiester bond are called CpG sites. Scientists have observed that
DNA methylation occurs predominately along places on the DNA
strand that are rich in CpG pairs. One type of CpG-rich region is a CpG
island. CpG islands are associated with approximately 60-70% of
mammalian genes, and most CpG islands are unmethylated in normal
mammalian cells. Thus, changes in methylation patterns at CpG islands
can interfere with normal gene expression by altering the transcriptional
competency of a gene’s promoter. Genes that are essential for a cell’s function are not methylated. In
contrast, inactive genes are usually methylated to suppress their expression.
While DNA methylation is involved in normal control of gene expression, changes in the extent of DNA
methylation can contribute to cancer or disease by silencing genes that that should otherwise be active or
expressed or by causing expression of genes that are usually inactive. Methylation is one mechanism for
suppressing (or silencing) gene transcription by preventing one or more transcription factors (TF) and
thus RNA polymerase from accessing a gene’s promoter which is required for transcribing DNA into
RNA (see Figure 2).
Figure 1: This representative DNA helix
depicts two methyl groups (M)
covalently attached to two cyotosine
bases in DNA.
C G T A C A C G A C A C G A T
G C A T G T G C T G T G C T A