Main content
High school biology
Course: High school biology > Unit 6
Lesson 2: RNA and protein synthesis- Molecular structure of RNA
- DNA replication and RNA transcription and translation
- Intro to gene expression (central dogma)
- The genetic code
- Impact of mutations on translation into amino acids
- RNA and protein synthesis review
- Transcription and translation
- Codons and mutations
© 2023 Khan AcademyTerms of usePrivacy PolicyCookie Notice
Intro to gene expression (central dogma)
How genes in DNA can provide instructions for proteins. The central dogma of molecular biology: DNA → RNA → protein.
Overview: Gene expression
DNA is the genetic material of all organisms on Earth. When DNA is transmitted from parents to children, it can determine some of the children's characteristics (such as their eye color or hair color). But how does the sequence of a DNA molecule actually affect a human or other organism's features? For example, how did the sequence of nucleotides (As, Ts, Cs, and Gs) in the DNA of Mendel's pea plants determine the color of their flowers?
Genes specify functional products (such as proteins)
A DNA molecule isn't just a long, boring string of nucleotides. Instead, it's divided up into functional units called genes. Each gene provides instructions for a functional product, that is, a molecule needed to perform a job in the cell. In many cases, the functional product of a gene is a protein. For example, Mendel's flower color gene provides instructions for a protein that helps make colored molecules (pigments) in flower petals.
The functional products of most known genes are proteins, or, more accurately, polypeptides. Polypeptide is just another word for a chain of amino acids. Although many proteins consist of a single polypeptide, some are made up of multiple polypeptides. Genes that specify polypeptides are called protein-coding genes.
Not all genes specify polypeptides. Instead, some provide instructions to build functional RNA molecules, such as the transfer RNAs and ribosomal RNAs that play roles in translation.
How does the DNA sequence of a gene specify a particular protein?
Many genes provide instructions for building polypeptides. How, exactly, does DNA direct the construction of a polypeptide? This process involves two major steps: transcription and translation.
- In transcription, the DNA sequence of a gene is copied to make an RNA molecule. This step is called transcription because it involves rewriting, or transcribing, the DNA sequence in a similar RNA "alphabet." In eukaryotes, the RNA molecule must undergo processing to become a mature messenger RNA (mRNA).
- In translation, the sequence of the mRNA is decoded to specify the amino acid sequence of a polypeptide. The name translation reflects that the nucleotide sequence of the mRNA sequence must be translated into the completely different "language" of amino acids.
Thus, during expression of a protein-coding gene, information flows from DNA right arrow RNA right arrow protein. This directional flow of information is known as the central dogma of molecular biology. Non-protein-coding genes (genes that specify functional RNAs) are still transcribed to produce an RNA, but this RNA is not translated into a polypeptide. For either type of gene, the process of going from DNA to a functional product is known as gene expression.
Transcription
In transcription, one strand of the DNA that makes up a gene, called the non-coding strand, acts as a template for the synthesis of a matching (complementary) RNA strand by an enzyme called RNA polymerase. This RNA strand is the primary transcript.
The primary transcript carries the same sequence information as the non-transcribed strand of DNA, sometimes called the coding strand. However, the primary transcript and the coding strand of DNA are not identical, thanks to some biochemical differences between DNA and RNA. One important difference is that RNA molecules do not include the base thymine (T). Instead, they have the similar base uracil (U). Like thymine, uracil pairs with adenine.
Transcription and RNA processing: Eukaryotes vs. bacteria
In bacteria, the primary RNA transcript can directly serve as a messenger RNA, or mRNA. Messenger RNAs get their name because they act as messengers between DNA and ribosomes. Ribosomes are RNA-and-protein structures in the cytosol where proteins are actually made.
In eukaryotes (such as humans), a primary transcript has to go through some extra processing steps in order to become a mature mRNA. During processing, caps are added to the ends of the RNA, and some pieces of it may be carefully removed in a process called splicing. These steps do not happen in bacteria.
The location of transcription is also different between prokaryotes and eukaryotes. Eukaryotic transcription takes place in the nucleus, where the DNA is stored, while protein synthesis takes place in the cytosol. Because of this, a eukaryotic mRNA must be exported from the nucleus before it can be translated into a polypeptide. Prokaryotic cells, on the other hand, don't have a nucleus, so they carry out both transcription and translation in the cytosol.
Translation
After transcription (and, in eukaryotes, after processing), an mRNA molecule is ready to direct protein synthesis. The process of using information in an mRNA to build a polypeptide is called translation.
The genetic code
During translation, the nucleotide sequence of an mRNA is translated into the amino acid sequence of a polypeptide. Specifically, the nucleotides of the mRNA are read in triplets (groups of three) called codons. There are 61 codons that specify amino acids. One codon is a "start" codon that indicates where to start translation. The start codon specifies the amino acid methionine, so most polypeptides begin with this amino acid. Three other “stop” codons signal the end of a polypeptide. These relationships between codons and amino acids are called the genetic code.
Steps of translation
Translation takes place inside of structures known as ribosomes. Ribosomes are molecular machines whose job is to build polypeptides. Once a ribosome latches on to an mRNA and finds the "start" codon, it will travel rapidly down the mRNA, one codon at a time. As it goes, it will gradually build a chain of amino acids that exactly mirrors the sequence of codons in the mRNA.
How does the ribosome "know" which amino acid to add for each codon? As it turns out, this matching is not done by the ribosome itself. Instead, it depends on a group of specialized RNA molecules called transfer RNAS (tRNAs). Each tRNA has a three nucleotides sticking out at one end, which can recognize (base-pair with) just one or a few particular codons. At the other end, the tRNA carries an amino acid – specifically, the amino acid that matches those codons.
There are many tRNAs floating around in a cell, but only a tRNA that matches (base-pairs with) the codon that's currently being read can bind and deliver its amino acid cargo. Once a tRNA is snugly bound to its matching codon in the ribosome, its amino acid will be added to the end of the polypeptide chain.
This process repeats many times, with the ribosome moving down the mRNA one codon at a time. A chain of amino acids is built up one by one, with an amino acid sequence that matches the sequence of codons found in the mRNA. Translation ends when the ribosome reaches a stop codon and releases the polypeptide.
What happens next?
Once the polypeptide is finished, it may be processed or modified, combine with other polypeptides, or be shipped to a specific destination inside or outside the cell. Ultimately, it will perform a specific job needed by the cell or organism – perhaps as a signaling molecule, structural element, or enzyme!
Summary:
- DNA is divided up into functional units called genes, which may specify polypeptides (proteins and protein subunits) or functional RNAs (such as tRNAs and rRNAs).
- Information from a gene is used to build a functional product in a process called gene expression.
- A gene that encodes a polypeptide is expressed in two steps. In this process, information flows from DNA right arrow RNA right arrow protein, a directional relationship known as the central dogma of molecular biology.
- Transcription: One strand of the gene's DNA is copied into RNA. In eukaryotes, the RNA transcript must undergo additional processing steps in order to become a mature messenger RNA (mRNA).
- Translation: The nucleotide sequence of the mRNA is decoded to specify the amino acid sequence of a polypeptide. This process occurs inside a ribosome and requires adapter molecules called tRNAs.
- During translation, the nucleotides of the mRNA are read in groups of three called codons. Each codon specifies a particular amino acid or a stop signal. This set of relationships is known as the genetic code.
Explore outside of Khan Academy
Do you want to learn more about transcription? Check out this scrollable interactive from LabXchange.
Do you want to learn more about translation? Check out this scrollable interactive from LabXchange.
LabXchange is a free online science education platform created at Harvard’s Faculty of Arts and Sciences and supported by the Amgen Foundation.
Want to join the conversation?
- Why there are 61 codons? Why not 64??(36 votes)
- 3 of them are stop-codons which terminates translation (61+3=64).(88 votes)
- what happens to the mRNA after translation process i.e after proteins are produced?(30 votes)
- Hi Srinidhi,
After mRNA is translated, is either stored for later translation or is degraded. The eventual fate for every mRNA molecule is to be degraded. The process of degrading mRNA molecules happens at a relatively fixed rate.
Hope this helps!
Jonathan Myung(4 votes)
- I'm still confused on two things. One, what is a TATA box? ANd two, what are the poly-a tails and 5' caps?(11 votes)
- The TATA box tells where a gene begins so that it can be transcribed. The Poly-A tail is a string of (mostly) adenines on the 3' end of the mRNA that gets eaten away by hydrolytic enzymes. It is there so that the coding section of the mRNA doesn't get eaten. (The hydrolytic enzymes themselves are there to protect from viruses.) It is also recognized by the nuclear pore and allows the mRNA to leave the nucleus. The 5' cap tells the ribosome where to begin translating.(23 votes)
- What happens if a mRNA breaks? Will part of the protein be produced from the broken piece?(5 votes)
- Yes, most likely. If the context of the mRNA fits with the translational machinery (applicable for the part of mRNA with the initiation codon only. The part without the initiation codon would not be translated), it might produce a truncated protein where the N-terminal part would be present but the C-terminal part (wrt to the original full length protein) would not be there. However, most of these truncated proteins are recognized by the cellular repair machinery as abnormal and they are recycled. Sometimes though, such proteins can linger and may even participate in cellular functions (in a positive or detrimental way). Most likely source of truncated proteins is DNA rearrangement though, and mRNA breakage would not likely have a major effect (it might, depending upon the need of the original protein) as there would be other full-length mRNAs that would be translated into the protein of interest. Hope this helps.(13 votes)
- Can a DNA end in 3' and the last molecule in this end is a phosphate? Why not??(5 votes)
- Phosphate is always attached to 5' end, and OH group to 3' end, because of the chemical structure of DNA.(2 votes)
- Why do the number of A's on the poly-A tail vary?(3 votes)
- Each time a mRNA is read, an ''A'' of the poly-A tail is cut off, when there's no more ''A'' in the tail, the mRNA can be degraded. A mRNA (let's call it mRNA 1) can have more ''A'' in its tail than another mRNA (mRNA 2) depending on how much the cell needs that product (1 instead of product 2).(5 votes)
- It is essential that we need RNApolymerase for transcription. But to synthesize this enzyme we need transcription to happen . So either RNApolymerase has to be present from the start or there should be another mechanism by which polymerisation of RNA happen. So how is it done for the First Time ?? How is it possible ?(4 votes)
- Some proteins and other molecules come from the previous cell, so the new cell can start its own production. Another thing is that some RNA can act as an enzyme and catalyzes reactions, though it occurs much less often.(2 votes)
- It is mentioned in The Genetic Code, that,
One codon is a "start" codon that indicates where to start translation. The start codon specifies the amino acid methionine, so most polypeptides begin with this amino acid.
AUG codes for methionine, which contains sulfur. In the Hershey-Chase experiment, they made use of the fact that all proteins contain sulfur (because of the presence of methionine, I guess)
Are there proteins which do not begin with methionine?(3 votes)- There are, but this is (usually) due to removal or modification of the amino-terminal (start) methionine. For example enzymes called "methionine amino-peptidases" cut off this amino acid from the beginning of some proteins — this is an example of what is known as a "post-translational modification".
It is also quite common for the first part of a protein (including the starting methionine) to be removed during processing — an example is secreted proteins that have their signal sequences removed during secretion or membrane insertion.
Methionines can also be oxidized to form chemically related residues.(3 votes)
- Why is amino acid sequence not directly read from a template DNA strand? Why do we need an intermediary mRNA ?(2 votes)
- This is an interesting question to think about, but "why" questions in biology are often very difficult to answer in a meaningful way.
Below I've listed one possible evolutionary reason for the use of mRNA as an intermediary and then some advantages to this system.
One strongly supported hypothesis is that RNA actually came first — this known as the "RNA world hypothesis"§.
This means that we use RNA because "life" has always used RNA and getting rid of it by evolutionary processes would be essentially impossible. (You'd have to redesign life from scratch!)
Making mRNA also allows efficient production of proteins from a single gene.
This is because many mRNAs can be transcribed from a gene and then each mRNA can be translated independently (and multiple times).
There are also multiple levels of regulation that can control how much mRNA is present, what parts of the mRNA get kept‡, and how frequently the mRNAs get translated.
This allows cells to be much more responsive to changing conditions.
In addition, in eukaryotes DNA is kept in the nucleus, while translation happens on ribosomes (found in the cytoplasm).
Having an RNA intermediate allows the information in DNA to travel from the nucleus to the cytoplasm.
There are probably more advantages that I haven't thought of — I encourage you to keep thinking about your question as you learn more about how cells work!
Does that help?
§Note: For more information see:
https://en.wikipedia.org/wiki/RNA_world
‡Note: For example alternative splicing in eukaryotes — to learn more see:
https://www.khanacademy.org/science/biology/gene-expression-central-dogma/transcription-of-dna-into-rna/a/eukaryotic-pre-mrna-processing(3 votes)
- What happens in a mutation where the Stop Codon is removed/altered? What does the cell do then? does it perform apoptosis?(2 votes)
- There are repair mechanisms. That one is called Non stop Decay that mechanism is able to detect mRNA which cannot be degraded because there is no STOP codon. It has to detach mRNA from the ribosome so it can translate the next mRNA sequence.
Nonstop decay is the mechanism of identifying and disposing aberrant transcripts that lack in-frame stop codons. It is hypothesized that these transcripts are identified during translation when the ribosome arrives at the 3′ end of the mRNA and stalls. Presumably the ribosome stalling recruits additional cofactors, Ski7 and the exosome complex. The exosome degrades the transcript using either one of is ribonucleolytic activities and the ribosome and the peptide are both released. Additional precautionary measures by the nonstop decay pathway may include translational repression of the nonstop transcript after translation, and proteolysis of the released peptide by the proteasome.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3638749/(2 votes)