Currently, there is a revolution in ideas about the etiology, pathogenesis and therapy of human diseases, which is associated with advances in the field of molecular biology and genetics, molecular medicine and pharmacology.

Significant advances have been made in understanding the structure and function of DNA, RNA, proteins, genome replication and functioning, reverse transcription, modification, DNA repair and recombination, transcription and translation of mRNA in pro- and eukaryotic cells. Numerous studies based on new bioanalytical methods have clarified the main pathways of gene expression regulation. Recombinant DNA technologies have been studied in detail. Currently, the study of the physicochemical bases of the development of hereditary and socially significant human diseases (atherosclerosis, oncopathologies, diabetes mellitus, intracellular infections, neurodegenerative diseases, etc.) has received powerful development.

In the post-genomic era, the question of the practical implementation of fundamental developments in the field of molecular biology, medicine and pharmacology arises. At the same time, the functioning of the genome is reflected in postgenomic events associated with the synthesis of numerous proteins, the study of which is now receiving special attention within the framework of a separate scientific field - proteomics. The development of proteomic research is impossible without the construction of algorithms and analysis methods, the creation of a database that makes it possible to elucidate the functioning mechanism of biological texts and develop targeted pharmacological effects (biotransformation).

Related problems of genomics and proteomics, pharmacogenomics and biotransformatics are implemented on the basis of unique methodological solutions and technological platforms.

Currently, at the level of academic centers, various research institutes in Russia, the CIS countries, Western Europe, the USA and Canada, the results of scientific technological platforms for biomedical and pharmaceutical research are being developed and introduced into the clinic.

The purpose of the practice is to consolidate and deepen the theoretical knowledge acquired during the learning process; master methods of working with specialized literature; collect specific materials in accordance with the recommended questions; formalize the results obtained during the internship.


The beginning of the 21st century is marked by the beginning of the era of proteomics. This term comes from two other well-known concepts in biochemistry: “PROTEins” and “genOMe” and was first used in 1995.

Of course, genomics will not disappear, it will develop at the same, and maybe even faster speed, but it is clear that the center of post-genomic research will be moved to the area of ​​inventory and elucidation of the human proteomic map. At first glance, the problem seems completely unsolvable. If the human genomic map is essentially the same for all human cells (these are 23 chromosomes with the same set of genes - the exception is 14 sex cells), then in the case of the human proteomic map it is completely meaningless to talk about its generality: every cell, every tissue, each biological fluid must have its own proteomic map. Although each cell may have about 100,000 functioning genes, numerous modification reactions can increase the number of proteins in a cell to 10 to 20 million.

In this regard, there are currently two definitions of proteomics: a narrow one, which can be called structural proteomics, and a broader one, which includes both the structural and functional parts of proteomics. In the narrow sense of the word, proteomics is the science that deals with the inventory of proteins using the combined use of methods: two-dimensional electrophoresis (2D-electrophoresis), mass spectrometric (MS) analysis of molecular weight and the sequence of proteins of biological material separated by electrophoresis, followed by analysis of the results using bioinformatics methods. Essentially, structural proteomics is a combination of 2D electrophoresis, mass spectrometry and bioinformatics. And if the resolving capabilities of two-dimensional electrophoresis have been known for a long time, since the first work of O"Farrell in 1975, then the ability of MS analysis to very quickly determine the molecular weight and sequence of polypeptide chains became clear only very recently. They developed so quickly that now some companies fully automated systems have already been created for determining the molecular mass and sequence of proteins, operating at phenomolar and atomicmolar concentration levels... Using a combination of these methods, it is possible to create a proteomic map of any biological material, which represents a phenotypic manifestation of the genome of a cell, tissue, or even an entire organ. In a broad sense, the terms proteomic analysis, or proteomics, can be used not only to inventory the proteins of a biological object, but also to control the reversible post-translational modification (PTM) of proteins by specific enzymes, such as: phosphorylation, glycosylation, acylation, frenylation, scaffolding, etc. . .

Currently, more than 300 different types of post-translational modification have been characterized using proteomics.

The intensive development of MS analysis has contributed to the emergence over the past 5 - 7 years of a whole group of areas of proteomic research (Fig. 1), most of which have a biomedical focus, however, the fundamental basis today still remains with structural and functional proteomics.

The policies of most countries of the European Union, Russia and the CIS countries are, to one degree or another, connected with the natural desire of the population to live in accordance with international quality standards. Terms such as “ecologically clean area” or “ecologically friendly product”, as well as all kinds of words with the prefix “euro-”, which have become firmly established in everyday life, unfortunately, in most cases, do not have any actual content. At the same time, the desired standards of quality of life established in many countries are the result of complex processes affecting the cultural, social and legal aspects of the development of these states.

Figure 1 - Modern directions of proteomic analysis.


S.V. Suchkov12, D.A. Gnatenko1, D.S. Kostyushev1, S.A. Krynsky1, M.A. Paltsev3

1 First Moscow State Medical University named after. THEM. Sechenov, Russian Federation 2 Moscow State Medical and Dental University named after. A.I. Evdokimova, Russian Federation

3 RRC “Kurchatov Institute”, Moscow, Russian Federation

It is known that the vast majority of pathological changes in the functioning of cells, tissues and organs are accompanied by a deviation from the physiological protein profile of a normal healthy organism. In modern conditions, the analysis and prediction of such changes come to the fore when creating preclinical screening protocols (i.e., identifying hidden and latent protein “precursors” of the disease, as well as assessing the effectiveness of applied therapy methods). Search, identification, separation, quantitative and qualitative determination of protein molecules that play a role in providing sensitivity or directly in the formation of the disease are the main tasks of proteomics.

Proteomics is a science that studies the protein composition of biological objects, as well as the structural and functional properties of protein molecules. Its task is to identify and quantify the total individual proteins that are contained in biological samples (blood serum, cerebrospinal fluid, urine, biopsies) at different stages of disease development, as well as against the background of therapy. The totality of all the proteins of the body, i.e., in fact, its protein profile, is called “pro-theome”.

Modern technological arsenal of proteomics

Fractionation and separation of proteins contained in a specific biological sample is carried out

by electrophoresis in polyacrylamide gel. To identify isolated proteins, a wide range of methods are used, among which the following should be highlighted:

Protein microsequencing;

High pressure liquid chromatography (HPLC) and high resolution;

Methods of immunochemical testing using monoclonal antibodies to individual antigenic determinants;

Mass spectrometry.

In recent years, the procedure for detecting protein molecules has been significantly optimized by developing for this purpose a wide panel of microbiochips with different types of detection, for example SELDI (surface-enhanced laser desorption/ionization) and/or MALDI (matrix-assisted laser desorption/ionization). Approaches of this kind made it possible to simultaneously analyze up to 10,000 individual proteins in one sample, while recording minute shifts in their concentrations under the influence of various factors. As a result, if proteins differ in at least one of their inherent parameters (total charge of the molecule or molecular weight), the above approach makes it possible to achieve their separation with subsequent identification and characterization.

One of the most promising methods for identifying proteins is mass spectrometry, based on the formation of ionized particles of the analyte in a vacuum space, followed by analysis of the ratio of the mass of ions to their charge. There are various modifications of mass spectrometry, which are divided depending on the ionization and particle detection methods used. A time-of-flight mass spectrometer records individual ions, indicating the ion's mass-to-charge ratio (m/z), the number of ions, and the time of flight of the ions from the source to the ion detector.

Chromatographic methods have a lower resolution, allowing separation of proteins according to the physical properties of molecules: charge (ion exchange chromatography), hydrophobicity parameters (hydrophobic chromatography), size (gel filtration), ability to bind to various ligands, for example antibodies (affinity chromatography) . In these cases we are talking about variants of liquid chromatography, because Protein molecules do not exist in the gas phase. In proteomic analysis, a combination of mass spectrometry and liquid chromatography (chromatography-mass spectrometry) is often used: that is, in fact, the creation and implementation of mass spectrometry led to a leap in the development of proteomics.

Finally, proteomics methods include immunochemical analysis using monoclonal antibodies to individual antigenic determinants, linear and conformation-dependent, including a number of cryptic epitopes.

An important role when working with tissue sections is played by immunohistochemical research methods based on specific antigen-antibody interactions. Immunohistochemical methods are highly sensitive and specific, allowing the determination of almost any antigen of interest (the scope of application of the method is limited only by the antibody library available).

Detection of bound antibodies is carried out using enzyme or fluorescent labels. In clinical practice, enzyme labels are more common since the immunofluorescence method, although

and is more sensitive and specific, but requires expensive equipment. In addition, fluorescent dyes have a short shelf life. Some techniques involve the use of polymer carriers for antibodies, which increases the sensitivity of the reaction.

The final stage of such a labor-intensive and multi-stage research is protein identification using databases (bioinformatics).

Bioinformatics, from the perspective of applied science, allows not only to store, analyze and process enormous amounts of data necessary for scientific and diagnostic procedures, but is also capable of obtaining information about the functional properties of certain protein molecules based on some data on the structure of the genome. Thus, without having practically any information on the interaction of groups of molecules with each other, their functions and properties, in some cases it is possible to reliably, with a high degree of probability, determine the characteristics of the object being studied.

Proteomics as a foundation for scientific research with subsequent implementation of results into clinical practice within the framework of the principles and objectives of translational medicine

Research often requires analysis of a large number of similar samples. Meanwhile, each study requires material and time costs, which can be minimized by the tissue matrix method, which involves the creation of libraries of tissue samples with the subsequent possibility of simultaneous (on one glass) examination of many sections. The typical sequence of operations for research of this kind is as follows:

Sampling (cells, tissue, biological fluid);

Sample preparation (cell lysis, protein extraction);

Two-dimensional polyacrylamide gel electrophoresis;

The appearance of protein spots on the gel;

Electropherogram analysis (number of spots, their location);

Isolation of gel areas containing individual protein spots;

Cleavage of individual proteins (trypsinization) directly in the gel;

Mass spectrometric analysis (determination of amino acid sequences of individual protein fragments);

Identification of each protein and measurement of its concentration, documentation, processing of results;

Interpretation of the obtained data using bioinformatics methods - analysis of databases, obtaining a differential profile of proteins.

Using this procedure, new protein markers have already been discovered and impressive results have been obtained in the field of cardiovascular proteomics and oncoproteomics.

Particular aspects of proteomics

The two main types of proteomics are structural and functional. The first one studies


the structure of individual proteins, while the second considers them in interaction with other proteins, exploring the conformational, biochemical and functional changes that occur. The set of all cell proteins that interact with a specific target protein molecule is called the “interactome.”

Primary diagnostic purposes are primarily served by structural proteomics, while functional proteomics is more a path of scientific research, as well as the foundation for the development of fundamentally new drugs that work with specific and individual pharmacotherapeutic targets at the cellular and molecular level.

Blood plasma proteomics

Among all body tissues, blood plasma most closely reflects the protein composition: the plasma proteome includes about 1/10 of all proteins present in the body. Among the proteins present in plasma are:

Proteins functioning in plasma;




Proteins passing transiently through plasma;

Intracellular proteins that enter the plasma during

destruction or increase in cell permeability;

Proteins that are absent normally and secreted by malignant cells;

Foreign proteins.

At least 1/2 of plasma proteins exist in the form of multiprotein complexes. With the help of special molecular tags introduced into the protein molecule, it is possible to isolate and isolate such complexes for the purpose of their further study for the characteristics of a particular interactome.

To date, more than 10,000 plasma proteins have been identified based on mass spectrometric analysis of one or two peptides of each protein, and more than 3,000 proteins based on the identification of two or more peptides. Almost 900 plasma proteins have been identified with 95% confidence.

The possibilities offered by proteomic analysis of blood plasma are very attractive. However, plasma as a standard test sample also has a number of significant disadvantages. These include a very large (up to 10 orders of magnitude) scatter in protein concentrations and the predominance among them of little diagnostic significance. When studying changes in the plasma proteome, for example in cardiovascular disease, one must first find a way to separate these unimportant proteins, which poses a significant challenge. Therefore, the optimal sensitivity and specificity would be to study a sample obtained from a biopsy of the target organ, which, however, is not always applicable.

It should be noted that, despite intensive research in this area, the rate of introduction of new biomarkers into clinical practice remains low. This is explained by both objective and subjective reasons. One of them should be considered a predominantly empirical approach to organizing research without proper theoretical justification, as well as insufficient development of infrastructural connections between research centers, the lack of a unified nomenclature and problems with systematization of available data. Factual data in no small part

at least remain scattered, since the pace of their accumulation outstrips the capabilities of science to integrate them.

Cardiovascular proteomics

This section of proteomics is one of the most intensively developing. Databases have already been created on hundreds of proteins of the myocardial proteome, the levels of which change in chronic and acute cardiovascular pathologies. The greatest progress has been made in the study of dilated cardiomyopathy. With this disease, the content of more than 100 proteins changes, which can be divided into 3 main groups:

Proteins associated with energy and metabolism;

Stress-inducible proteins;

Proteins providing contractile functions

and formation of the cytoskeleton.

These results are fully consistent with modern ideas about the pathogenesis of dilated cardiomyopathy.

Progress in studying the pathogenesis of coronary heart disease and chronic heart failure is not so significant. It is not always possible to adequately model these types of pathologies: some results obtained in animal models are not consistent with those in humans. Most of the reliable results are related to the role of the so-called in the development and prevention of coronary heart disease and chronic heart failure. heat shock proteins (Hsp 27). Particular attention is paid to the study of the proteome in reperfusion syndrome. After reperfusion injury, changes in the structure of contractile proteins are detected: MLC-2 (myosin light chain 2), all three proteins of the troponin complex. The signaling mechanisms involved in the pathogenesis of reperfusion syndrome are being studied, although the complete picture of protein interactions has not yet been fully established. Studies have been conducted to study the phenomenon of remote preconditioning of the myocardium before ischemic injury, when a hypoxic state is created first in some other organ, and then in the heart. This reduces reperfusion damage. However, to date it has not been possible to identify candidate molecules for the role of humoral mediators of preconditioning.

Studying the proteomics of atherosclerosis is difficult due to the significant functional heterogeneity of the endothelial tissue phenotype. However, models of the protein profile of atherosclerotic plaques have been obtained, in which changes in the content of proteins such as Hsp27, crystallins, tumor necrosis factor a, cathepsins, peroxiredoxins, etc. are detected, about 80 proteins in total. To create biomarkers of atherosclerosis, it is proposed to study the profiles of plasma proteins associated with inflammation. The secretion of proteins by atherosclerotic plaques in vitro is also being studied.

In chronic heart failure, the only clinically useful biomarker is B-natriuretic peptide. As for coronary heart disease, the number of biomarkers is much larger: cardiac troponins, creatine kinase, etc. However, their content increases only in the later stages of ischemia, so a search is underway for new biomarkers that allow diagnosing its early stages. Another area of ​​interest is biomarkers specific to ischemia (rather than myocardial necrosis). At the moment, there is only one such marker - ischemia-modified albumin (ischemic-


modified albumin, IMA). However, its low specificity makes it difficult to use outside of a complex with traditional biomarkers.

Studying the cardiac proteome poses significant challenges. The most accurate method of analysis would be a biopsy, but it is difficult to perform. In the case of studying blood plasma, identifying among the huge mass of proteins those that could have clinical significance is an extremely difficult task. In this regard, in animal studies, perfusion of isolated hearts with blood-substituting solutions is often used, followed by the study of proteins released by tissues into the solution. Another direction is the study of pericardial fluid. Thus, in patients undergoing cardiac surgery, the level of the protein H-FABP (heart-type fatty acid binding protein) in the pericardial fluid was examined. It has been found that the level of this pericardial fluid protein, which is absent from the blood plasma, increases during ischemia.

Proteomics of lung diseases

When studying lung diseases, from the point of view of proteomics, lung tissue, fluid lining the epithelium, alveolocytes, and blood plasma are used as samples.

To study the proteome of the fluid lining the epithelium, bronchoalveolar fluid is used as a sample. Some lung tissue-specific proteins, such as glutathione-transferase and surfactant protein B, are significantly more abundant in this fluid than in plasma. Changes in bronchoalveolar fluid are studied in various diseases: sarcoidosis, cystic fibrosis, mesothelioma, idiopathic fibrosing alveolitis, etc. The study of bronchoalveolar fluid also makes it possible to isolate alveolar macrophages for subsequent assessment of their proteomic profile.

To obtain lung tissue samples, the use of invasive technologies is necessary. These studies are mainly aimed at assessing proteome changes in lung cancer. In a study by D.P. Carbone found that the protein content of SUMO-2 (small ubiquitin-like protein-2), thymosin-p4 and ubiquitin correlates with prognosis in non-small cell lung cancer. Studies have been conducted to identify protein patterns that distinguish invasive tumors from normal bronchial epithelium. To increase the reliability of the results, laser microdissection was used when obtaining samples to prevent the capture of healthy tissue. However, long-term clinical trials will be required before new biomarkers can be introduced into clinical practice.

To build proteomic profiles of adenocarcinomas, blood plasma studies are also used. Thus, when labeled with radioactive oxygen, 211 proteins were found whose levels increased in lung adenocarcinoma in mice, and 246 proteins whose levels decreased.


The main objectives of oncoproteomics are:

Construction of proteomes and analysis of their dynamics during the emergence and development of various tumors;

Identification of cell signaling pathways leading to tumorigenesis;

Identification of markers for the diagnosis of cancer and for monitoring the response of the tumor and the body to surgery and to different types of therapy;

Determination of the immune response to tumorigenesis. Tumor markers are macromolecules (usually proteins

with a lipid or carbohydrate component), the presence and concentrations of which in blood plasma and/or other biological fluid correlate to a certain extent with the presence and growth of a malignant tumor. Among the wide variety of indicators used in the diagnosis of tumors, there are both specific tumor markers and some substances, the concentration of which can change during various pathological processes, incl. and tumor. The most specific tumor markers, practically absent in a healthy body, include embryonic antigens (the synthesis of which stops in the early stages of embryonic development and is derepressed during malignant transformation): cancer embryonic antigen, a-fetoprotein. Tumor-specific antigens are molecules (secretory products or membrane glycoproteins) expressed more intensely by tumor cells than by normal cells. These include CA 19-9, CA 15-3 (membrane glycoproteins), as well as prostate-specific antigen (PSA), a secretory product of prostate glandulocytes. In addition, hormones (human chorionic gonadotropin) and substances of other groups (thyroglobulin, P2-microglobulin, etc.) can act as tumor markers. To predict the course of the disease, proteins that are markers of proliferative activity and proteins that regulate apoptosis are examined (Fig. 1).

The areas of clinical application of tumor markers are as follows:

Early diagnosis of cancer;

Monitoring and evaluation of treatment effectiveness;

Definition of forecast.

Based on the above, the main requirements for a tumor marker are sufficiently high sensitivity and specificity, correlation with tumor volume, and the ability to provide information about the location of the tumor.

The sensitivity and specificity of most currently available tumor biomarkers are often insufficient. Markers for which the sensitivity at a specificity of 95% is more than 50% are considered clinically useful, and only a few of them can demonstrate a sensitivity of more than 70% at a given level of specificity. There are 2 approaches to the search for new tumor markers: the first involves targeted research based on modern knowledge about carcinogenesis, testing certain hypotheses; the second is an empirical search by comparing the proteomes of normal and tumor cells or by comparing the protein profile of the sera of healthy and sick patients; with and without risk factors.

Let's consider the possibilities of using modern tumor markers in the 3 aspects mentioned above.

1. Diagnostics. Due to lack of sensitivity, most tumor markers are unsuitable for screening studies in the general population. However, some of them can be effectively used for early diagnosis in risk groups where the likelihood of the disease is initially higher. So, PSA screening


Gene Expression Analysis



Mass spectrometry


IHC profiles


Assessing the prognosis Choosing a treatment method Obtaining new antibodies

Immunohistochemical (IHC) techniques


Indirect methods

Treatment monitoring


Selecting a treatment method Monitoring treatment effectiveness identifying side effects


Direct Methods



Early diagnosis in risk groups Diagnosis of relapses

Rice. Proteomics technologies in the diagnosis of cancer.

carried out in a group of men over 50 years of age; screening for a-fetoprotein (a marker of hepatocellular carcinoma) - in patients with liver cirrhosis; for calcitonin (a marker of medullary thyroid cancer) - in persons with a family history.

2. Monitoring the course of the disease. Currently, tumor markers are most widely used for these purposes. A sign of successful radical surgery is a persistent decrease in marker concentration. Its subsequent increase indicates, depending on the time and rate of growth, the presence of a residual tumor, the occurrence of a relapse or isolated metastasis.

3. Predicting the course of the disease and determining treatment tactics. The level of many tumor markers correlates with the volume of the primary tumor and increases sharply with local and distant metastasis. So, for example, in chronic lymphocytic leukemia, the content of the serum deoxythymidine synthetase marker correlates with the course of the disease (stable or progressive).

To predict the course of the disease, the expression of markers of proliferative activity is also determined: Ki-67 protein, PCNA, cyclins (for example, cyclin D1), inhibitors of cyclin-dependent kinases. The levels of proteins that regulate apoptosis (Bcl-2, Bcl-x, Bax, Bak, etc.) are of great prognostic significance. Recently, apoptosis inhibitors - servivin and telomerase - have been intensively studied. Increased concentrations of these molecules have been demonstrated in tumors of many, although not all, locations. The level of their expression correlates with the stage of tumor development. For some types of carcinomas, a correlation of the course with the level of p53 protein expression, as well as the number of mutant forms of this protein, has been proven.

The type of marker studied and the significance of the result vary depending on the histological structure and location of the tumor. The final conclusion is made after a comprehensive assessment with other factors. An important task in oncology is the identification of signaling pathways involved in the process of carcinogenesis. The role of apoptosis regulatory proteins in this process is undoubted: p53, proteins of the bcr family, etc. The focus of functional proteomics is the study of the interactomes of these proteins, in other words, the reconstruction of the molecular interactions in which these proteins are involved.

The main problem in introducing oncoproteomics into practice is the difficulty of training oncologists to read oncotranscriptome and oncoproteome maps.

Types of protein molecules and features of interactomes

Although many proteins carry out their functions independently, the vast majority of them require highly specific interactions with other proteins in the body to exhibit their biological activity. Examples of various protein-protein interactions found in complex biological systems:

Protein-protein interactomes in strictly defined cellular compartments;

Messenger proteins that interact with receptors on the outer surface of the cell membrane, which is a necessary condition for triggering signaling cascades;

Proteins that form network and structural interactions, structural relationships at the intercellular level;

Enzyme inhibitors;


Modification (often followed by denaturation) due to the action of enzymes;

Interactions of protein subunits leading to allosteric effects in the composition of multimeric biocomplexes;

Protein-protein interactions underlying the motor functions of individual organelles, organs or the body as a whole (muscle contraction). Protein interactions are usually subdivided

into stable and transient, and both types can be provided by both strong and weak intermolecular bonds.

Stable interaction is observed in proteins consisting of several subunits-complexes and polypeptide chains. Typical examples of complex protein molecules consisting of several stably linked polypeptide chains are hemoglobin and polymerases.

Transient protein-protein interactions are involved in the control of most intra- and extracellular signaling processes. Transient interactions usually require a specific set of conditions that promote the development of various physiological effects, namely phosphorylation, conformational changes, or localization to a discrete region of the cell. Transient interacting proteins are involved in a wide range of cellular processes, including in catalytic protein modification, transport, reserve, signaling, regulatory, receptor and motor functions.

Transient protein-protein interaction is also observed during the transport of proteins through membrane pores, during the deformation of native proteins, at certain stages of the translation cycle, and the reformation of cellular structures during the cell cycle (cytoplasmic microfilaments, nuclear pore complex, etc.).

Proteins can bind to each other through hydrophobic/hydrophilic bonds, van der Waals forces, and ionic bridges between binding domains on each protein. These domains can be represented by a small area of ​​the protein surface and consist of only a few peptides. On the other hand, proteins with long polypeptide regions spanning hundreds of amino acids are widespread; the strength of their binding depends on the size and properties of the binding domain. One of the most common intraprotein bonds that provides stability to the entire molecule is the leucine zipper.

In the leucine zipper, the amino acid leucine is found at approximately every 8th position of the α-helix, resulting in leucine residues on one side, forming an amphipathic helix in which one side is hydrophobic. Thus, the leucine zipper forms a dimeric protein by linking two parallel α-helices together like a zipper.

The two Src homologous (SH) domains, SH2 and SH3, are an example of transient binding domains that are connected by short peptide sequences and are commonly found in signaling proteins. The Sffi domain “recognizes” only peptide sequences with phosphorylated tyrosine residues, which is a sign of an activated protein. In other words, the SH2 region is the most important region on the receptor involved in the growth factor signaling pathway, in which these residues are recognized through ligand-receptor-mediated phosphorylation of tyrosine residues by Sffi domains. SH3 domains typically recognize proline-rich peptide sequences and are typically found in enzymes such as kinases, phospholipases, and GTPases. They are designed to identify target proteins.


Proteomics, being a fundamental science, is nevertheless indispensable in solving a number of practical medical and applied scientific problems. The study of various biological fluids of the body using modern technological techniques of proteomics can provide the diagnostician with sufficient amounts of information necessary for an unambiguous diagnosis or assessment of the risks of a particular disease in a particular patient. The construction of algorithms for preclinical and clinical monitoring of patients using a conglomerate of laboratory diagnostic procedures, including genomic, transcriptomic and proteomic methods of analysis, as well as bioinformational techniques for data processing and analysis, is the key to the successful identification of a pathological condition in the latent stage, verification of diagnosis, definition and possible predicting the type and nature of the course of the disease, as well as monitoring the reactions of the patient’s body in response to the type of therapy used.


12. Malygin A.G. Metabolism of carboxylic acids (periodic scheme). – M.: “International Education Program”, 1999.

Proteomics is a functional science whose main subject of study is the proteome. The proteome is the entire set of proteins that are produced or modified by an organism or system. Proteomics is the science that studies the types of proteins, and therefore it has helped to discover many new types of this compound - many more than were known before its emergence as a science. The amount of proteins appears to depend on time and the various demands or stresses to which cells or organisms are exposed. Proteomics is an interdisciplinary field that is largely driven by the latest genome research projects. It covers the study of proteomes from the overall level of protein composition, structure and activity. Functional proteomics is often cited as the most important component of functional genomics.

Subject of study

Defining proteomics is not as simple as it might seem at first glance. This science typically involves large-scale experimental analysis of proteins and proteomes, but is often used to explore the possibilities of protein purification.

After genomics and transcriptomics, proteomics is the next step in the study of biological systems. It is much more complex than genomics because the genome of an organism is more or less constant, whereas the proteome differs from cell to cell and from time to time. Individual genes are expressed in different cell types, meaning that even the core set of proteins that are produced in a cell must be identified.

History of study

Proteomics, the study of protein structure, is a direction in biochemistry that emerged relatively recently. In the past, protein research was done using RNA analysis, but it turned out that RNA structure did not correlate with protein content. It is known that mRNA is not always translated into protein, and the amount of protein produced for a given amount of mRNA depends on which gene is being transcribed, as well as the current physiological state of the cell. Proteomics is the science that confirms the presence of a protein and provides a direct estimate of the amount present.

Subsequent changes

Not only does extracting a protein from mRNA damage it, but many proteins also undergo a wide range of chemical modifications after this process. Many of these post-translational modifications are critical to protein function.


One such modification is phosphorylation, which occurs with many enzymes and structural proteins during cellular signaling. The addition of phosphate to certain amino acids, most commonly serines and threonines mediated by serine/threonine aminoses or less commonly tyrosine mediated by tyrosine kinases, causes the protein molecule to be targeted for binding or interaction with a varied set of other molecules that recognize the phosphorylated domain.

Because protein phosphorylation is one of the most studied protein modifications, many “proteomic” efforts are aimed at identifying the set of phosphorylated proteins in a specific cell or tissue type under specific circumstances.


Ubiquitin is a small protein that can be attached to certain substrates by enzymes scientifically called E3 ubiquitin-ligases. Determining which proteins are poly-ubiquitinated helps to understand how the movement of these molecules is regulated. Likewise, once a researcher has determined which substrates are ubiquitinated by each ligase, it is useful to determine the set of ligases expressed in a particular cell type.

Additional changes

In addition to phosphorylation and ubiquitination, proteins can undergo (among others) methylation, acetylation, glycosylation, oxidation and nitrosylation. Some proteins undergo all of these changes, often in time-dependent combinations. This illustrates the potential difficulty of studying protein structure and function.

Individual proteins are produced under different conditions. A cell may make different sets of proteins at different times or under different conditions, such as during development, cell differentiation, the cell cycle, or carcinogenesis. The further increase in proteome complexity, as already mentioned, implies that most proteins can undergo a wide range of post-translational modifications.

Therefore, research in the field of proteomics is a challenging task in the future, even if the topic of study of this science will remain limited. For more ambitious tasks, such as looking for a biomarker for a specific cancer subtype, a proteomist scientist may choose to study multiple serum samples from multiple cancer patients to minimize confounding factors. Thus, complex experimental designs are sometimes necessary to account for the dynamic complexity of the proteome.

Differences from genomics

Proteomics provides different levels of understanding than genomics for many reasons:

  1. The level of transcription of a gene provides only a rough estimate of its level of translation into protein. Once produced in abundance, mRNA can be quickly degraded or transformed in an inefficient manner, resulting in the production of small amounts of protein.
  2. As mentioned above, many proteins undergo post-translational modifications that greatly affect their functionality. For example, some proteins are not active until they become phosphorylated. Techniques such as phosphoproteomics and glycoproteomics are used to study post-translational modifications.
  3. Many transcripts give rise to more than one protein, through alternative splicing or alternative post-translational modifications.
  4. Many proteins form complexes with other proteins or RNA molecules and act only in the presence of these other molecules. The degree of protein degradation plays an important role in its content.


One of the major factors affecting the reproducibility of proteomics experiments is the simultaneous elution of many other peptides that can be measured by mass spectrometers. This results in stochastic differences between experiments due to data-dependent tryptic peptide treatments. Although early large-scale analyzes of the yeast proteome showed considerable variability in results between different laboratories, presumably due in part to technical and experimental differences between them, reproducibility has been improved in more recent mass spectrometric analyses, especially when using mass spectrometers.

Research methods

In proteomics, there are many methods for studying proteins. Typically, they can be detected using antibodies (immunoassays) or mass spectrometry. If a complex biological sample is being analyzed, it is necessary to either use a very specific antibody in a quantitative metope blot (qdb) analysis or biochemical separation.

Protein detection using antibodies (immunoassays)

Antibodies to specific proteins or modified forms have been used in biochemistry and cell biology studies. They are among the most common tools used by molecular biologists today. There are several specific methods and protocols that involve the use of antibodies for protein detection. For decades, enzyme-linked immunosorbent assay (ELISA) has been used to detect and quantify them in biological samples. Western blot can be used to detect and quantify individual proteins, where initially a complex organic mixture is separated using SDS-PAGE and then the protein of interest is identified using an antibody.

Modified proteins can be studied by developing an antibody specific for that modification. For example, there are antibodies that only recognize certain proteins when they are tyrosine-phosphorylated, known as phospho-specific antibodies. In addition, there are antibodies specific for other modifications. They can be used to determine the set of proteins that have undergone modification.

Proteomics in medicine

Disease detection at the molecular level is driving a new revolution in diagnosis and treatment. Digital immunoassay technology has improved the detection sensitivity of molecules to the so-called attomolar range. This opportunity gives us the potential to unlock new advances in diagnostics and therapy, but such technologies have been relegated to manual procedures that are not well suited to effective daily use.

Although protein detection with antibodies is still very common in molecular biology, other methods have been developed that do not rely on the antibody. These methods offer various advantages, for example, they can often determine the sequence of a protein or peptide, they can have higher throughput than an antibody, and sometimes they can identify and quantify proteins for which no antibodies exist.

Proteomics methods

One of the earliest methods for protein analysis was Edman degradation (introduced in 1967), where a single peptide undergoes several steps of chemical degradation to determine its sequence. These methods have mostly been superseded by technologies that provide higher throughput. Various areas of proteomics also depend on the methods.

Basic separation methods

Analyzing complex biological samples requires reducing their complexity. This can be done using one-dimensional or two-dimensional separation. More recently, online methods have been developed in which individual peptides were separated using reverse phase chromatography and then directly ionized using the ESI method.

Hybrid technologies

There are several hybrid technologies that use antibody-based purification of individual analytes and then mass spectrometric analysis to identify and quantify them. Examples of these methods are the MSIA (mass spectrometric immunoassay) method developed by Randal Nelson in 1995 and the SISCAPA (Stable Isotope Standard Capture with Antipeptide Antibody) method introduced by Lee Anderson in 2004.

Comparative proteomic analyzes can reveal the role of proteins in complex biological systems, including reproduction. For example, treatment with the insecticide triazophos results in an increase in brown seedlings (Nolaparvata lugens (Stål)) - male accessory iron proteins (Acps), which can be transferred to females through mating, resulting in increased fertility (i.e., fertility) in females. To identify changes in the types of accessory gland proteins (Acps) and reproductive proteins obtained from male grasshoppers, the researchers performed a comparative proteomic analysis of hibernating male N. lugens. The results showed that these proteins are involved in the reproductive process of adult female and male grasshoppers N. lugens.

High-throughput proteomic technologies

Proteomics is a science that has steadily gained momentum over the past decade. Many of the approaches developed by this science are absolutely revolutionary, while some are based on old scientific methods. Methods based on mass spectrometry and microwells are the most common technologies for large-scale study of proteins.

Mass spectrometry and profiling

Currently, two methods of mass spectrometry are used for protein profiling. A better known and widely used method uses high resolution 2D electrophoresis to separate proteins from different samples in parallel, followed by selection and staining of differentiated expressed proteins to be identified by mass spectrometry. Despite the advances in 2DE and the general sophistication of this method, it also has its limits. The main problem is the inability to identify all proteins in a sample, given their variability and other unique properties.

The second quantitative approach uses stable isotope tags to differentially label proteins from two different complex mixtures. Here, proteins in a complex mixture are first labeled with isotopes and then digested to produce labeled peptides. The labeled mixtures are then combined, with the peptides separated by multidimensional liquid chromatography and analyzed by tandem mass spectrometry. Isotope-coded tags (ICAT) are widely used isotopic tags. In this scientific method, cysteine ​​residues of proteins are covalently attached to the ICAT reagent, thereby reducing the complexity of mixtures by eliminating non-cysteine ​​residues.

Proteomics, genomics, metabolomics are new directions in biology, characterized by complexity and innovation. Not everyone can study them.