Exon in the context of Intron

Exon in the context of Intron

Exon Study page number 1 of 1

Play TriviaQuestions Online!

Skip to study material about Exon in the context of "Intron"

⭐ Core Definition: Exon

An exon is any part of a gene that will form a part of the final mature RNA produced by that gene after introns have been removed by RNA splicing. The term exon refers to both the DNA sequence within a gene and to the corresponding sequence in RNA transcripts. In RNA splicing, introns are removed and exons are covalently joined to one another as part of generating the mature RNA. Just as the entire set of genes for a species constitutes the genome, the entire set of exons constitutes the exome.

↓ Menu

HINT:

👉 Exon in the context of Intron

An intron is any nucleotide sequence within a gene that is not expressed or operative in the final RNA product. The word intron is derived from the term intragenic region, i.e., a region inside a gene. The term intron refers to both the DNA sequence within a gene and the corresponding RNA sequence in RNA transcripts. The non-intron sequences that become joined by this RNA processing to form the mature RNA are called exons.

Introns are found in the genes of most eukaryotes and many eukaryotic viruses, and they can be located in both protein-coding genes and genes that function as RNA (noncoding genes). There are four main types of introns: tRNA introns, group I introns, group II introns, and spliceosomal introns (see below). Introns are rare in Bacteria and Archaea (prokaryotes).

↓ Explore More Topics

In this Dossier

⭐ Core Definition: Exon
👉 Exon in the context of Intron
Exon in the context of RNA processing
Exon in the context of Mature messenger RNA
Exon in the context of Open reading frame
Exon in the context of Long noncoding RNA
Exon in the context of Protein isoform
Exon in the context of Transgene
Exon in the context of Duchenne muscular dystrophy
Exon in the context of RNA splicing
Exon in the context of Exome sequencing
Exon in the context of Gene family
Exon in the context of RNA-Seq
Exon in the context of Terminal deoxynucleotidyl transferase

Exon in the context of RNA processing

Transcriptional modification or co-transcriptional modification is a set of biological processes common to most eukaryotic cells by which an RNA primary transcript is chemically altered following transcription from a gene to produce a mature, functional RNA molecule that can then leave the nucleus and perform any of a variety of different functions in the cell. There are many types of post-transcriptional modifications achieved through a diverse class of molecular mechanisms.

One example is the conversion of precursor messenger RNA transcripts into mature messenger RNA that is subsequently capable of being translated into protein. This process includes three major steps that significantly modify the chemical structure of the RNA molecule: the addition of a 5' cap, the addition of a 3' polyadenylated tail, and RNA splicing. Such processing is vital for the correct translation of eukaryotic genomes because the initial precursor mRNA produced by transcription often contains both exons (coding sequences) and introns (non-coding sequences); splicing removes the introns and links the exons directly, while the cap and tail facilitate the transport of the mRNA to a ribosome and protect it from molecular degradation.

View the full Wikipedia page for RNA processing

↑ Return to Menu

Exon in the context of Mature messenger RNA

Mature messenger RNA, often abbreviated as mature mRNA is a eukaryotic RNA transcript that has been spliced and processed and is ready for translation in the course of protein synthesis. Unlike the eukaryotic RNA immediately after transcription known as precursor messenger RNA, mature mRNA consists exclusively of exons and has all introns removed.

Mature mRNA is also called "mature transcript", "mature RNA" or "mRNA".

View the full Wikipedia page for Mature messenger RNA

↑ Return to Menu

Exon in the context of Open reading frame

In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames will be "open" (the "reading", however, refers to the RNA produced by transcription of the DNA and its subsequent interaction with the ribosome in translation). Such an open reading frame (ORF) may contain a start codon (usually AUG in terms of RNA) and by definition cannot extend beyond a stop codon (usually UAA, UAG or UGA in RNA). That start codon (not necessarily the first) indicates where translation may start. The transcription termination site is located after the ORF, beyond the translation stop codon. If transcription were to cease before the stop codon, an incomplete protein would be made during translation.

In eukaryotic genes with multiple exons, introns are removed and exons are then joined together after transcription to yield the final mRNA for protein translation. In the context of gene finding, the start-stop definition of an ORF therefore only applies to spliced mRNAs, not genomic DNA, since introns may contain stop codons and/or cause shifts between reading frames. An alternative definition says that an ORF is a sequence that has a length divisible by three and is bounded by stop codons. This more general definition can be useful in the context of transcriptomics and metagenomics, where a start or stop codon may not be present in the obtained sequences. Such an ORF corresponds to parts of a gene rather than the complete gene.

View the full Wikipedia page for Open reading frame

↑ Return to Menu

Exon in the context of Long noncoding RNA

Long non-coding RNAs (long ncRNAs, lncRNA) are a type of RNA, generally defined as transcripts more than 200 nucleotides that are not translated into protein. This arbitrary limit distinguishes long ncRNAs from small non-coding RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs), Piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), and other short RNAs. Given that some lncRNAs have been reported to have the potential to encode small proteins or micro-peptides, the latest definition of lncRNA is a class of transcripts of over 200 nucleotides that have no or limited coding capacity. However, John S. Mattick and colleagues suggested to change definition of long non-coding RNAs to transcripts more than 500 nt, which are mostly generated by Pol II. That means that question of lncRNA exact definition is still under discussion in the field. Long intervening/intergenic noncoding RNAs (lincRNAs) are sequences of transcripts that do not overlap protein-coding genes.

Long non-coding RNAs include intergenic lincRNAs, intronic ncRNAs, and sense and antisense lncRNAs, each type showing different genomic positions in relation to genes and exons.

View the full Wikipedia page for Long noncoding RNA

↑ Return to Menu

Exon in the context of Protein isoform

A protein isoform, or "protein variant", is a member of a set of highly similar proteins that originate from a single gene and are the result of genetic differences. While many perform the same or similar biological roles, some isoforms have unique functions. A set of protein isoforms may be formed from alternative splicings, variable promoter usage, or other post-transcriptional modifications of a single gene; post-translational modifications are generally not considered. (For that, see Proteoforms.) Through RNA splicing mechanisms, mRNA has the ability to select different protein-coding segments (exons) of a gene, or even different parts of exons from RNA to form different mRNA sequences. Each unique sequence produces a specific form of a protein.

The discovery of isoforms could explain the discrepancy between the small number of protein coding regions of genes revealed by the Human Genome Project and the large diversity of proteins seen in an organism: different proteins encoded by the same gene could increase the diversity of the proteome. Isoforms at the RNA level are readily characterized by cDNA transcript studies. Many human genes possess confirmed alternative splicing isoforms. It has been estimated that ~100,000 expressed sequence tags (ESTs) can be identified in humans. Isoforms at the protein level can manifest in the deletion of whole domains or shorter loops, usually located on the surface of the protein.

View the full Wikipedia page for Protein isoform

↑ Return to Menu

Exon in the context of Transgene

A transgene is a gene that has been transferred naturally, or by any of a number of genetic engineering techniques, from one organism to another. The introduction of a transgene, in a process known as transgenesis, has the potential to change the phenotype of an organism. Transgene describes a segment of DNA containing a gene sequence that has been isolated from one organism and is introduced into a different organism. This non-native segment of DNA may either retain the ability to produce RNA or protein in the transgenic organism or alter the normal function of the transgenic organism's genetic code. In general, the DNA is incorporated into the organism's germ line. For example, in higher vertebrates this can be accomplished by injecting the foreign DNA into the nucleus of a fertilized ovum. This technique is routinely used to introduce human disease genes or other genes of interest into strains of laboratory mice to study the function or pathology involved with that particular gene.

The construction of a transgene requires the assembly of a few main parts. The transgene must contain a promoter, which is a regulatory sequence that will determine where and when the transgene is active, an exon, a protein coding sequence (usually derived from the cDNA for the protein of interest), and a stop sequence. These are typically combined in a bacterial plasmid and the coding sequences are typically chosen from transgenes with previously known functions.

View the full Wikipedia page for Transgene

↑ Return to Menu

Exon in the context of Duchenne muscular dystrophy

Duchenne muscular dystrophy (DMD) is a severe type of muscular dystrophy predominantly affecting boys. The onset of muscle weakness typically begins around age four, with rapid progression. Initially, muscle loss occurs in the thighs and pelvis, extending to the arms, which can lead to difficulties in standing up. By the age of 12, most individuals with Duchenne muscular dystrophy are unable to walk. Affected muscles may appear larger due to an increase in fat content, and scoliosis is common. Some individuals may experience intellectual disability, and females carrying a single copy of the mutated gene may show mild symptoms.

Duchenne muscular dystrophy is caused by mutations or deletions in any of the 79 exons encoding the large dystrophin protein, which is essential for maintaining the muscle fibers' cell membrane integrity. The disorder follows an X-linked recessive inheritance pattern, with approximately two-thirds of cases inherited from the mother and one-third resulting from a new mutation. Diagnosis can frequently be made at birth through genetic testing, and elevated creatine kinase levels in the blood are indicative of the condition.

View the full Wikipedia page for Duchenne muscular dystrophy

↑ Return to Menu

Exon in the context of RNA splicing

RNA splicing is a process in molecular biology where a newly-made precursor messenger RNA (pre-mRNA) transcript is transformed into a mature messenger RNA (mRNA). It works by removing all the introns (non-coding regions of RNA) and splicing back together exons (coding regions). For nuclear-encoded genes, splicing occurs in the nucleus either during or immediately after transcription. For those eukaryotic genes that contain introns, splicing is usually needed to create an mRNA molecule that can be translated into protein. For many eukaryotic introns, splicing occurs in a series of reactions which are catalyzed by the spliceosome, a complex of small nuclear ribonucleoproteins (snRNPs). There exist self-splicing introns, that is, ribozymes that can catalyze their own excision from their parent RNA molecule. The process of transcription, splicing and translation is called gene expression, the central dogma of molecular biology.

View the full Wikipedia page for RNA splicing

↑ Return to Menu

Exon in the context of Exome sequencing

Exome sequencing, also known as whole exome sequencing (WES), is a genomic technique for sequencing all of the protein-coding regions of genes in a genome (known as the exome). It consists of two steps: the first step is to select only the subset of DNA that encodes proteins. These regions are known as exons—humans have about 180,000 exons, constituting about 1% of the human genome, or approximately 30 million base pairs. The second step is to sequence the exonic DNA using any high-throughput DNA sequencing technology.

The goal of this approach is to identify genetic variants that alter protein sequences, and to do this at a much lower cost than whole-genome sequencing. Since these variants can be responsible for both Mendelian and common polygenic diseases, such as Alzheimer's disease, whole exome sequencing has been applied both in academic research and as a clinical diagnostic.

View the full Wikipedia page for Exome sequencing

↑ Return to Menu

Exon in the context of Gene family

A gene family is a set of several similar genes, formed by duplication of a single original gene, and generally with similar biochemical functions. One such family are the genes for human hemoglobin subunits; the ten genes are in two clusters on different chromosomes, called the α-globin and β-globin loci. These two gene clusters are thought to have arisen as a result of a precursor gene being duplicated approximately 500 million years ago.

Genes are categorized into families based on shared nucleotide or protein sequences. Phylogenetic techniques can be used as a more rigorous test. The positions of exons within the coding sequence can be used to infer common ancestry. Knowing the sequence of the protein encoded by a gene can allow researchers to apply methods that find similarities among protein sequences that provide more information than similarities or differences among DNA sequences.

View the full Wikipedia page for Gene family

↑ Return to Menu

Exon in the context of RNA-Seq

RNA-Seq (short for RNA sequencing) is a next-generation sequencing (NGS) technique used to quantify and identify RNA molecules in a biological sample, providing a snapshot of the transcriptome at a specific time. It enables transcriptome-wide analysis by sequencing cDNA derived from RNA. Modern workflows often incorporate pseudoalignment tools (such as Kallisto and Salmon) and cloud-based processing pipelines, improving speed, scalability, and reproducibility.

RNA-Seq facilitates the ability to look at alternative gene spliced transcripts, post-transcriptional modifications, gene fusion, mutations/SNPs and changes in gene expression over time, or differences in gene expression in different groups or treatments. In addition to mRNA transcripts, RNA-Seq can look at different populations of RNA to include total RNA, small RNA, such as miRNA, tRNA, and ribosomal profiling. RNA-Seq can also be used to determine exon/intron boundaries and verify or amend previously annotated 5' and 3' gene boundaries. Recent advances in RNA-Seq include single cell sequencing, bulk RNA sequencing, 3' mRNA-sequencing, in situ sequencing of fixed tissue, and native RNA molecule sequencing with single-molecule real-time sequencing. Other examples of emerging RNA-Seq applications due to the advancement of bioinformatics algorithms are copy number alteration, microbial contamination, transposable elements, cell type (deconvolution) and the presence of neoantigens.

View the full Wikipedia page for RNA-Seq

↑ Return to Menu

Exon in the context of Terminal deoxynucleotidyl transferase

Terminal deoxynucleotidyl transferase (TdT), also known as DNA nucleotidylexotransferase (DNTT) or terminal transferase, is a specialized DNA polymerase expressed in immature, pre-B, pre-T lymphoid cells, and acute lymphoblastic leukemia/lymphoma cells. TdT adds N-nucleotides to the V, D, and J exons of the TCR and BCR genes during antibody gene recombination, enabling the phenomenon of junctional diversity. In humans, terminal transferase is encoded by the DNTT gene. As a member of the X family of DNA polymerase enzymes, it works in conjunction with polymerase λ and polymerase μ, both of which belong to the same X family of polymerase enzymes. The diversity introduced by TdT has played an important role in the evolution of the vertebrate immune system, significantly increasing the variety of antigen receptors that a cell is equipped with to fight pathogens. Studies using TdT knockout mice have found drastic reductions (10-fold) in T-cell receptor (TCR) diversity compared with that of normal, or wild-type, systems. The greater diversity of TCRs that an organism is equipped with leads to greater resistance to infection. Although TdT was one of the first DNA polymerases identified in mammals in 1960, it remains one of the least understood of all DNA polymerases. In 2016–18, TdT was discovered to demonstrate in trans template dependant behaviour in addition to its more broadly known template independent behaviour

TdT is absent in fetal liver HSCs, significantly impairing junctional diversity in B-cells during the fetal period.

View the full Wikipedia page for Terminal deoxynucleotidyl transferase

↑ Return to Menu