Skip to main content
Back

The Genetic Code and Transcription: Deciphering the Flow of Genetic Information

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

The Flow of Genetic Information

DNA, RNA, and Protein Synthesis

The central dogma of molecular genetics describes the flow of genetic information from DNA to RNA to protein. DNA stores genetic information, which is transcribed into messenger RNA (mRNA), an unstable intermediate that transfers information from the nucleus to the cytosol. Protein synthesis uses the genetic code to convert nucleotide sequence information into proteins.

Flow from RNA to protein DNA to mRNA to protein diagram

The Genetic Code

Properties of the Genetic Code

The genetic code is a set of rules used to translate the nucleotide sequence of mRNA into the amino acid sequence of proteins. It is composed of codons, each consisting of three ribonucleotide 'letters'. The code is unambiguous (each codon specifies only one amino acid or stop signal), degenerate (multiple codons can specify the same amino acid), continuous (codons are read sequentially), nonoverlapping (each nucleotide is part of only one codon), and nearly universal (with minor exceptions in mitochondria and some organisms).

  • Start codon: AUG (methionine)

  • Stop codons: UAA, UAG, UGA

  • Number of codons: 64 possible codons for 20 amino acids

Genetic code table

Triplet Nature of the Code

Experimental evidence from virus frameshift mutations showed that the genetic code is read in triplets. Insertions or deletions of one or two nucleotides cause frameshift mutations, while changes of three nucleotides do not alter the reading frame.

Deciphering the Genetic Code

Producing Synthetic mRNAs

Polynucleotide phosphorylase is an enzyme used to create synthetic mRNAs in vitro. It can synthesize RNA without a template, and the probability of incorporating a specific ribonucleotide depends on its concentration.

Polynucleotide phosphorylase reaction

Cell-Free Protein Synthesis

Proteins were synthesized in a cell-free system using E. coli extracts containing ribosomes, tRNAs, amino acids, and other molecules necessary for translation. Synthetic mRNAs and radiolabeled amino acids were added to track incorporation into proteins.

Homopolymer Experiments

Homopolymers (polyU, polyA, polyC, polyG) were used to decipher the first codons. Each homopolymer encoded only one type of codon, resulting in the incorporation of a single amino acid type into the protein.

Artificial mRNA

Radioactivity (counts/min)

None

44

Poly U

39,800

Poly A

50

Poly C

38

Table of radioactivity incorporation

Mixed Heteropolymer Experiments

Mixed heteropolymers were created using two or more ribonucleotides. The frequency of a particular codon was predicted based on the relative concentrations of ribonucleotides. This allowed manipulation of codon composition and analysis of amino acid incorporation.

Possible compositions

Possible triplets

Probability of occurrence

Final %

3A

AAA

(1/6)^3 = 0.5%

0.5

2A:1C

AAC, ACA, CAA

(1/6)^2(5/6) = 2.3%

6.9

1A:2C

ACC, CAC, CCA

(1/6)(5/6)^2 = 11.6%

34.8

3C

CCC

(5/6)^3 = 57.9%

57.9

Mixed heteropolymer table

Triplet Binding Assay

The triplet binding assay involved synthesizing triplet mRNAs of known sequence, labeling a single amino acid, and mixing with ribosomes and charged tRNAs. The ribosome complex containing the triplet mRNA and appropriate tRNA was retained on a filter, allowing identification of codon assignments.

Triplet binding assay diagram

Trinucleotides

Amino Acid

AAA, AAG

Lysine

AUG

Methionine

AUU, AUG, AUA

Isoleucine

CCG, CCA, CCU, CCC

Proline

CUC, CUA, CUG, CUU

Leucine

GAA, GAG

Glutamic acid

UCA, UCG, UCU, UCC

Serine

UGU, UGC

Cysteine

UUA, UUG

Leucine

UUU, UUC

Phenylalanine

Triplet binding assay table

Repeating Copolymer Experiments

Khorana synthesized long mRNA molecules consisting of short repeating sequences (di-, tri-, and tetra-ribonucleotides). These repeating copolymers allowed further deciphering of codon assignments.

Repeating copolymer diagram

The Complete Genetic Code

Codon Assignments and Degeneracy

Most amino acids are represented by two to six codons, except tryptophan (Trp) and methionine (Met), which are specified by only one codon. AUG encodes methionine and initiates all eukaryotic polypeptide chains. UAA, UAG, and UGA are termination signals that do not encode an amino acid or bind a tRNA.

Genetic code table

The Wobble Hypothesis

Flexibility in Codon-Anticodon Pairing

The wobble hypothesis states that the first two ribonucleotides of a codon are more critical than the third, which is less spatially constrained. This allows the anticodon of one tRNA to pair with more than one codon sequence, enabling a minimum of 30 tRNA species to accommodate the 61 amino acid codons.

Base at First Position (5′-end) of tRNA

Base at Third Position (3′-end) of mRNA

A

U

C

G

G

C or U

U

A or G

I

A, U, or C

Universality and Ordered Nature of the Genetic Code

Conservation and Colinearity

The genetic code is conserved in almost all organisms and viruses, with exceptions for a small number of codons in mitochondria, ciliate protozoans, and one bacterial species. The linear sequence of codons in a gene corresponds precisely with the linear amino acid sequence of the protein (colinearity). The 5’ end of a gene corresponds with the N-terminus of the protein.

Ordered Nature and Mutation Buffering

Chemically similar amino acids share the middle base in the different triplets encoding them. For example, U is present in the second position of codons specifying hydrophobic amino acids (Leu, Ile, Val), while C is present in the second position of codons specifying hydrophilic amino acids (Ser, Thr). This ordered code buffers the potential effect of mutation on protein function, as mutations in the first base often result in a change to an amino acid with similar chemical properties.

Genetic code table

Summary Table: Deciphering the Genetic Code

Method

Main Purpose

Key Findings

Homopolymer Synthesis

Assign codons to amino acids

UUU = phenylalanine, AAA = lysine, CCC = proline

Mixed Heteropolymer Synthesis

Analyze codon composition

Codon frequency based on nucleotide ratios

Triplet Binding Assay

Direct codon assignment

Unambiguous and degenerate code

Repeating Copolymer Synthesis

Assign codons using repeating sequences

Further codon assignments

Pearson Logo

Study Prep