Protein Data Bank in the context of Biocuration


Protein Data Bank in the context of Biocuration

Protein Data Bank Study page number 1 of 3

Play TriviaQuestions Online!

or

Skip to study material about Protein Data Bank in the context of "Biocuration"


⭐ Core Definition: Protein Data Bank

The Protein Data Bank (PDB) is a database for the three-dimensional structural data of large biological molecules such as proteins and nucleic acids, which is overseen by the Worldwide Protein Data Bank (wwPDB). This structural data is obtained and deposited by biologists and biochemists worldwide through the use of experimental methodologies such as X-ray crystallography, NMR spectroscopy, and, increasingly, cryogenic electron microscopy. All submitted data are reviewed by expert biocurators and, once approved, are made freely available on the Internet under the CC0 Public Domain Dedication. Global access to the data is provided by the websites of the wwPDB member organizations (PDBe, PDBj, RCSB PDB, BMRB and the EMDB).

The PDB is a key in areas of structural biology, such as structural genomics. Most major scientific journals and some funding agencies now require scientists to submit their structure data to the PDB. Many other databases use protein structures deposited in the PDB. For example, SCOP and CATH classify protein structures, while PDBsum provides a graphic overview of PDB entries using information from other sources, such as Gene Ontology.

↓ Menu
HINT:

In this Dossier

Protein Data Bank in the context of Genetic information

A nucleic acid sequence is a succession of bases within the nucleotides forming alleles within a DNA (using GACT) or RNA (GACU) molecule. This succession is denoted by a series of a set of five different letters that indicate the order of the nucleotides. By convention, sequences are usually presented from the 5' end to the 3' end. For DNA, with its double helix, there are two possible directions for the notated sequence; of these two, the sense strand is used. Because nucleic acids are normally linear (unbranched) polymers, specifying the sequence is equivalent to defining the covalent structure of the entire molecule. For this reason, the nucleic acid sequence is also termed the primary structure.

The sequence represents genetic information. Biological deoxyribonucleic acid represents the information which directs the functions of an organism.

View the full Wikipedia page for Genetic information
↑ Return to Menu

Protein Data Bank in the context of Nucleic acid secondary structure


Nucleic acid secondary structure is the basepairing interactions within a single nucleic acid polymer or between two polymers. It can be represented as a list of bases which are paired in a nucleic acid molecule.The secondary structures of biological DNAs and RNAs tend to be different: biological DNA mostly exists as fully base paired double helices, while biological RNA is single stranded and often forms complex and intricate base-pairing interactions due to its increased ability to form hydrogen bonds stemming from the extra hydroxyl group in the ribose sugar.

In a non-biological context, secondary structure is a vital consideration in the nucleic acid design of nucleic acid structures for DNA nanotechnology and DNA computing, since the pattern of basepairing ultimately determines the overall structure of the molecules.

View the full Wikipedia page for Nucleic acid secondary structure
↑ Return to Menu

Protein Data Bank in the context of Nucleic acid tertiary structure

Nucleic acid tertiary structure is the three-dimensional shape of a nucleic acid polymer. RNA and DNA molecules are capable of diverse functions ranging from molecular recognition to catalysis. Such functions require a precise three-dimensional structure. While such structures are diverse and seemingly complex, they are composed of recurring, easily recognizable tertiary structural motifs that serve as molecular building blocks. Some of the most common motifs for RNA and DNA tertiary structure are described below, but this information is based on a limited number of solved structures. Many more tertiary structural motifs will be revealed as new RNA and DNA molecules are structurally characterized.

View the full Wikipedia page for Nucleic acid tertiary structure
↑ Return to Menu

Protein Data Bank in the context of Enzyme

An enzyme is a biological macromolecule, usually a protein, that acts as a biological catalyst, accelerating chemical reactions without being consumed in the process. The molecules on which enzymes act are called substrates, which are converted into products. Nearly all metabolic processes within a cell depend on enzyme catalysis to occur at biologically relevant rates. Metabolic pathways are typically composed of a series of enzyme-catalyzed steps. The study of enzymes is known as enzymology, and a related field focuses on pseudoenzymes—proteins that have lost catalytic activity but may retain regulatory or scaffolding functions, often indicated by alterations in their amino acid sequences or unusual 'pseudocatalytic' behavior.

Enzymes are known to catalyze over 5,000 types of biochemical reactions. Other biological catalysts include catalytic RNA molecules, or ribozymes, which are sometimes classified as enzymes despite being composed of RNA rather than protein. More recently, biomolecular condensates have been recognized as a third category of biocatalysts, capable of catalyzing reactions by creating interfaces and gradients—such as ionic gradients—that drive biochemical processes, even when their component proteins are not intrinsically catalytic.

View the full Wikipedia page for Enzyme
↑ Return to Menu

Protein Data Bank in the context of Ligand (biochemistry)

In biochemistry and pharmacology, a ligand is a substance that forms a complex with a biomolecule to serve a biological purpose. The etymology stems from Latin ligare, which means 'to bind'. In protein-ligand binding, the ligand is usually a molecule which produces a signal by binding to a site on a target protein. The binding typically results in a change of conformational isomerism (conformation) of the target protein. In DNA-ligand binding studies, the ligand can be a small molecule, ion, or protein which binds to the DNA double helix. The relationship between ligand and binding partner is a function of charge, hydrophobicity, and molecular structure.

Binding occurs by intermolecular forces, such as ionic bonds, hydrogen bonds and Van der Waals forces. The association or docking is actually reversible through dissociation. Measurably irreversible covalent bonding between a ligand and target molecule is atypical in biological systems. In contrast to the definition of ligand in metalorganic and inorganic chemistry, in biochemistry it is ambiguous whether the ligand generally binds at a metal site, as is the case in hemoglobin. In general, the interpretation of ligand is contextual with regard to what sort of binding has been observed.

View the full Wikipedia page for Ligand (biochemistry)
↑ Return to Menu

Protein Data Bank in the context of Polymerase

In biochemistry, a polymerase is an enzyme (EC 2.7.7.6/7/19/48/49) that synthesizes long chains of polymers or nucleic acids. DNA polymerase and RNA polymerase are used to assemble DNA and RNA molecules, respectively, by copying a DNA template strand using base-pairing interactions or half ladder replication.

A DNA polymerase from the thermophilic bacterium, Thermus aquaticus (Taq) (PDB 1BGX, EC 2.7.7.7), is used in the polymerase chain reaction, an important technique of molecular biology.

View the full Wikipedia page for Polymerase
↑ Return to Menu

Protein Data Bank in the context of Nucleic acid structure

Nucleic acid structure refers to the structure of nucleic acids such as DNA and RNA. Chemically speaking, DNA and RNA are very similar. Nucleic acid structure is often divided into four different levels: primary, secondary, tertiary, and quaternary.

View the full Wikipedia page for Nucleic acid structure
↑ Return to Menu

Protein Data Bank in the context of VS ribozyme

The Varkud satellite (VS) ribozyme is an RNA enzyme that carries out the cleavage of a phosphodiester bond.

View the full Wikipedia page for VS ribozyme
↑ Return to Menu

Protein Data Bank in the context of Protein structure

Protein structure is the three-dimensional arrangement of atoms in an amino acid-chain molecule. Proteins are polymers – specifically polypeptides – formed from sequences of amino acids, which are the monomers of the polymer. A single amino acid monomer may also be called a residue, which indicates a repeating unit of a polymer. Proteins form by amino acids undergoing condensation reactions, in which the amino acids lose one water molecule per reaction in order to attach to one another with a peptide bond. By convention, a chain under 30 amino acids is often identified as a peptide, rather than a protein. To be able to perform their biological function, proteins fold into one or more specific spatial conformations driven by a number of non-covalent interactions, such as hydrogen bonding, ionic interactions, Van der Waals forces, and hydrophobic packing. To understand the functions of proteins at a molecular level, it is often necessary to determine their three-dimensional structure. This is the topic of the scientific field of structural biology, which employs techniques such as X-ray crystallography, NMR spectroscopy, cryo-electron microscopy (cryo-EM) and dual polarisation interferometry, to determine the structure of proteins.

Protein structures range in size from tens to several thousand amino acids. By physical size, proteins are classified as nanoparticles, between 1–100 nm. Very large protein complexes can be formed from protein subunits. For example, many thousands of actin molecules assemble into a microfilament.

View the full Wikipedia page for Protein structure
↑ Return to Menu

Protein Data Bank in the context of Protein primary structure

Protein primary structure is the linear sequence of amino acids in a peptide or protein. By convention, the primary structure of a protein is reported starting from the amino-terminal (N) end to the carboxyl-terminal (C) end. Protein biosynthesis is most commonly performed by ribosomes in cells. Peptides can also be synthesized in the laboratory. Protein primary structures can be directly sequenced, or inferred from DNA sequences.

View the full Wikipedia page for Protein primary structure
↑ Return to Menu

Protein Data Bank in the context of Active site

In biology and biochemistry, the active site is the region of an enzyme where substrate molecules bind and undergo a chemical reaction. The active site consists of amino acid residues that form temporary bonds with the substrate, the binding site, and residues that catalyse a reaction of that substrate, the catalytic site. Although the active site occupies only ~10–20% of the volume of an enzyme, it is the most important part as it directly catalyzes the chemical reaction. It usually consists of three to four amino acids, while other amino acids within the protein are required to maintain the tertiary structure of the enzymes.

Each active site is evolved to be optimised to bind a particular substrate and catalyse a particular reaction, resulting in high specificity. This specificity is determined by the arrangement of amino acids within the active site and the structure of the substrates. Sometimes enzymes also need to bind with some cofactors to fulfil their function. The active site is usually a groove or pocket of the enzyme which can be located in a deep tunnel within the enzyme, or between the interfaces of multimeric enzymes. An active site can catalyse a reaction repeatedly as residues are not altered at the end of the reaction (they may change during the reaction, but are regenerated by the end). This process is achieved by lowering the activation energy of the reaction, so more substrates have enough energy to undergo reaction.

View the full Wikipedia page for Active site
↑ Return to Menu

Protein Data Bank in the context of Protease

A protease (also called a peptidase, proteinase, or proteolytic enzyme) is an enzyme that catalyzes proteolysis, breaking down proteins into smaller polypeptides or single amino acids, and spurring the formation of new protein products. They do this by cleaving the peptide bonds within proteins by hydrolysis, a reaction where water breaks bonds. Proteases are involved in numerous biological pathways, including digestion of ingested proteins, protein catabolism (breakdown of old proteins), and cell signaling.

In the absence of functional accelerants, proteolysis would be very slow, taking hundreds of years. Proteases can be found in all forms of life and viruses. They have independently evolved multiple times, and different classes of protease can perform the same reaction by completely different catalytic mechanisms.

View the full Wikipedia page for Protease
↑ Return to Menu