The Genetic Code and Transcription: Deciphering the Flow of Genetic Information

Study Guide - Smart Notes

Tailored notes based on your materials, expanded with key definitions, examples, and context.

The Flow of Genetic Information

DNA, RNA, and Protein Synthesis

The central dogma of molecular genetics describes the flow of genetic information from DNA to RNA to protein. DNA stores genetic information, which is transcribed into messenger RNA (mRNA), an unstable intermediate that transfers information from the nucleus to the cytosol. Protein synthesis uses the genetic code to convert nucleotide sequence information into proteins.

Flow from RNA to protein DNA to mRNA to protein diagram

The Genetic Code

Properties of the Genetic Code

The genetic code is a set of rules used to translate the nucleotide sequence of mRNA into the amino acid sequence of proteins. It is composed of codons, each consisting of three ribonucleotide 'letters'. The code is unambiguous (each codon specifies only one amino acid or stop signal), degenerate (multiple codons can specify the same amino acid), continuous (codons are read sequentially), nonoverlapping (each nucleotide is part of only one codon), and nearly universal (with minor exceptions in mitochondria and some organisms).

Start codon: AUG (methionine)
Stop codons: UAA, UAG, UGA
Number of codons: 64 possible codons for 20 amino acids

Genetic code table

Triplet Nature of the Code

Experimental evidence from virus frameshift mutations showed that the genetic code is read in triplets. Insertions or deletions of one or two nucleotides cause frameshift mutations, while changes of three nucleotides do not alter the reading frame.

Deciphering the Genetic Code

Producing Synthetic mRNAs

Polynucleotide phosphorylase is an enzyme used to create synthetic mRNAs in vitro. It can synthesize RNA without a template, and the probability of incorporating a specific ribonucleotide depends on its concentration.

Polynucleotide phosphorylase reaction

Cell-Free Protein Synthesis

Proteins were synthesized in a cell-free system using E. coli extracts containing ribosomes, tRNAs, amino acids, and other molecules necessary for translation. Synthetic mRNAs and radiolabeled amino acids were added to track incorporation into proteins.

Homopolymer Experiments

Homopolymers (polyU, polyA, polyC, polyG) were used to decipher the first codons. Each homopolymer encoded only one type of codon, resulting in the incorporation of a single amino acid type into the protein.

Artificial mRNA	Radioactivity (counts/min)
None	44
Poly U	39,800
Poly A	50
Poly C	38

Table of radioactivity incorporation

Mixed Heteropolymer Experiments

Mixed heteropolymers were created using two or more ribonucleotides. The frequency of a particular codon was predicted based on the relative concentrations of ribonucleotides. This allowed manipulation of codon composition and analysis of amino acid incorporation.

Possible compositions	Possible triplets	Probability of occurrence	Final %
3A	AAA	(1/6)^3 = 0.5%	0.5
2A:1C	AAC, ACA, CAA	(1/6)^2(5/6) = 2.3%	6.9
1A:2C	ACC, CAC, CCA	(1/6)(5/6)^2 = 11.6%	34.8
3C	CCC	(5/6)^3 = 57.9%	57.9

Mixed heteropolymer table

Triplet Binding Assay

The triplet binding assay involved synthesizing triplet mRNAs of known sequence, labeling a single amino acid, and mixing with ribosomes and charged tRNAs. The ribosome complex containing the triplet mRNA and appropriate tRNA was retained on a filter, allowing identification of codon assignments.

Triplet binding assay diagram

Trinucleotides	Amino Acid
AAA, AAG	Lysine
AUG	Methionine
AUU, AUG, AUA	Isoleucine
CCG, CCA, CCU, CCC	Proline
CUC, CUA, CUG, CUU	Leucine
GAA, GAG	Glutamic acid
UCA, UCG, UCU, UCC	Serine
UGU, UGC	Cysteine
UUA, UUG	Leucine
UUU, UUC	Phenylalanine

Triplet binding assay table

Repeating Copolymer Experiments

Khorana synthesized long mRNA molecules consisting of short repeating sequences (di-, tri-, and tetra-ribonucleotides). These repeating copolymers allowed further deciphering of codon assignments.

Repeating copolymer diagram

The Complete Genetic Code

Codon Assignments and Degeneracy

Most amino acids are represented by two to six codons, except tryptophan (Trp) and methionine (Met), which are specified by only one codon. AUG encodes methionine and initiates all eukaryotic polypeptide chains. UAA, UAG, and UGA are termination signals that do not encode an amino acid or bind a tRNA.

Genetic code table

The Wobble Hypothesis

Flexibility in Codon-Anticodon Pairing

The wobble hypothesis states that the first two ribonucleotides of a codon are more critical than the third, which is less spatially constrained. This allows the anticodon of one tRNA to pair with more than one codon sequence, enabling a minimum of 30 tRNA species to accommodate the 61 amino acid codons.

Base at First Position (5′-end) of tRNA	Base at Third Position (3′-end) of mRNA
A	U
C	G
G	C or U
U	A or G
I	A, U, or C

Universality and Ordered Nature of the Genetic Code

Conservation and Colinearity

The genetic code is conserved in almost all organisms and viruses, with exceptions for a small number of codons in mitochondria, ciliate protozoans, and one bacterial species. The linear sequence of codons in a gene corresponds precisely with the linear amino acid sequence of the protein (colinearity). The 5’ end of a gene corresponds with the N-terminus of the protein.

Ordered Nature and Mutation Buffering

Chemically similar amino acids share the middle base in the different triplets encoding them. For example, U is present in the second position of codons specifying hydrophobic amino acids (Leu, Ile, Val), while C is present in the second position of codons specifying hydrophilic amino acids (Ser, Thr). This ordered code buffers the potential effect of mutation on protein function, as mutations in the first base often result in a change to an amino acid with similar chemical properties.

Genetic code table

Summary Table: Deciphering the Genetic Code

Method	Main Purpose	Key Findings
Homopolymer Synthesis	Assign codons to amino acids	UUU = phenylalanine, AAA = lysine, CCC = proline
Mixed Heteropolymer Synthesis	Analyze codon composition	Codon frequency based on nucleotide ratios
Triplet Binding Assay	Direct codon assignment	Unambiguous and degenerate code
Repeating Copolymer Synthesis	Assign codons using repeating sequences	Further codon assignments