SARS CoV-2 Spike Protein
Yan Toren and David Marcey
© 2021 David Marcey

I. Introduction

II. Structure (prefusion)

III. Furin Cleavage and the Closed to Open Transition

IV. Host Receptor Binding

V. Structure (postfusion)

VI. Mutations

VII. References


Please leave comments/suggestions or please acknowledge use of this site by visiting our feedback page

[Back to OMM Exhibits]

This exhibit displays molecules in the left part of the screen, and text that addresses structure-function relationships of the molecules in the right part (below). Use the scrollbar to the right to scroll through the text. If you are using browser other than Firefox (the recommended browser for this site), be sure to allow popups. In Chrome, you can click on the popup blocker icon in the right part of the address bar..

To evoke renderings of the molecule that illustrate particular points, click the radio buttons:

Please click the load PDB buttons, , when present.

To reset the molecule, use the reset buttons:

If you are a practiced user, you can create the illusion of 3D if you turn on stereo mode. In this mode, when you train one eye on one image and the other eye on the other image, you will elicit a centered image that appears truly 3-dimensional. To turn on stereo mode when viewing a scene, return here and use this button . To turn off stereo mode, return here and use this button .

Return to the OMM Exhibits

I. Introduction

Coronaviruses (CoVs) belong to a large family of viruses that display a "crown" (corona) of surface Spike (S) proteins on their membranes. It is these S proteins that bind to host cell receptor proteins to initiate an infection cycle. The coronaviruses are enveloped, positive-sense, single stranded RNA viruses (see Figure 1).

Most coronaviruses circulate among animal "reservoirs" (e.g. pigs, camels, bats, cats), but some can move into human hosts, causing zoonotic diseases that present as a mild to moderate upper-respiratory tract illness. Despite the relatively benign human symptoms caused by most coronaviruses, a few novel coronaviruses have emerged from animal reservoirs over the past two decades that seriously impact infected individuals.

In 2002, a coronavirus, SARS-CoV, emerged and caused Severe Acute Respiratory Syndrome (SARS). In 2012, the Middle East Respiratory Syndrome coronavirus (MERS-CoV) jumped from camels to humans and elicits severe respiratory pathology. The most recent coronavirus to zoonotically infect humans is SARS CoV-2 which has generated a global pandemic (2019). This SARS-CoV-2 virus elicits Coronovirus Disease (COVID-19) that has caused serious and widespread illness and death.

Enveloped viruses contain a lipid bilayer envelope that must achieve the energetically unfavorable fusion of the viral and host cell membranes in order to deliver the genetic material of the virion into the cytoplasm of the host during the initial stages of infection. The S Proteins of the coronaviruses are essential to this membrane fusion and the molecular "gymnastics" that these proteins conduct in order to elicit fusion are astounding!

To the left is a model of the SARS-CoV-2 S protein. This protein is a glycosylated class I viral fusion protein that binds to the ACE2 receptor protein (angiotensin converting enzyme-2), and in a series of molecular events, initiates viral entry into host cells by fusion with a host cell membrane. Fusion can occur between the viral membrane and the host plasma membrane if particular proteases are present at the cell surface, or between the viral and endososomal membrane if the virus is endocytosed after ACE-2 binding (e.g. Verma and Subbarao, 2021). In either case, the virus has gained entry and can begin its replication cycle. The dual entry mechanisms allow the virus to attack a range of host tissues.

Cleavage of the S protein into two fragments (S1 and S2) by a furin-like proteolytic enzyme exposes an S protein region that binds to a second receptor, Neuropilin-1 (NRP1). Further cleavage by a serine protease triggers dissociation of S1 and elicits extensive conformational changes in S2, bringing the viral and host membrane into close contact for membrane fusion and entry of the virus into the cytoplasm (Kielan, 2020) - see illustrations below. For a summary of the initial infection events, see Figure 2.

This exhibit provides visualizations of some of the CoV-2 S protein structure/function relationships that are important to viral infection and also illustrates important interactions with the ACE-2 receptor protein. Example mutations found in variants of CoV-2 that affect transmissabiity and virulence are also presented.

Return to Beginning

II. The SARS Cov-2 Spike (S) Protein Structure (prefusion conformation)

As can been seen in its pre-fusion state at left, the CoV-2 S protein is a homotrimer consisting of three protomers: protomer I, protomer II, and protomer III. Each protomer has multiple domains and extensive secondary structure. The S protein is a glycoprotein and numerous glycans are seen in the structure (PDB ID: 6XR8).

The S protein shown is oriented with the viral membrane down and the potential host membrane up. Each protomer contains a receptor binding domain (RB). If one visualizes the protein as a tree, the RBDs lie in the canopy, ready to bind a host cell ACE-2 receptor, enhanced by a conformational change that causes an RBD to project upward (see below).

The prefusion trimer is cleaved in the early stages of host cell infection by a furin-like protease into the receptor binding fragment, S1, and the fusion fragment, S2, that mediates fusion of the virus' membrane and that of the host cell plasma membrane or endosomal membrane (see Figure 2). S1 contains the RBDs.

Some of the structural components important in launching an infection of a host cell can be visualized by examining a single protomer, followed by highlighting these in the prefusion trimer:

  • S1 N Terminal Domain (NTD)
  • S1 Receptor Binding Domain (RBD)
  • S1 C Terminal Domain 1 (CTD1)
  • S1 C Terminal Domain 2 (CTD2)
  • S2 Fusion Peptide (FP) and S2 Fusion Peptide Proximal Region (FPPR)
  • S2 Heptad Repeat 1 (HR1)
  • S2 Central Helix (CH)
  • S2 Connector Domain (CD)
  • S2 Linker (L)
  • Not resolved in the 6XR8 structure: S2 Heptad Repeat 2 (HR2), S2 Transmembrane Domain (TM), and S2 C Terminal (CT). These would project downward from the S2 Linker (L) in the view shown at left, with the TM domain anchoring the Spike protein in the viral membrane.

The S1 domains (NTD, RBD, CTD1, CTD2,) are seen to envelop the S2 fragment beneath.

In the trimer, the RBDs are stabilized in a tight cluster atop the Spike, in the "closed" conformation, which may hide them from host immune recognition. A conformational change elicited by proteolytic cleavage causes an upward projection of an RBD to engage in host cell receptor binding (see below). The Fusion Peptide Proximal Region (FPPR) plays an essential role in ths clamping down of the RBDs. As can be seen, a FPPR of one protomer lies on the opposite side of a CTD1 of another protomer, which in turn is adjacent to an RBD from the same protomer.

The FPPR is stabilized both by a disulfide bridge and by a salt bridge between the side chains of a lysine and an aspartate.

The CTD1 may serve as a relay between an RBD and an FPPR. Since the FPPR is directly connected to the Fusion Peptide (FP) that will be inserted into the host cell membrane after S Protein processing and profound conformational rearrangement (see section V, below), changes to the FPPR structure likely play a functional role in viral membrane-host cell membrane fusion (Cai, et al., 2020).

Further evidence of the important role of the FPPR in clamping down the RBDs is provided by the observation that mutation of an aspartate (residue 614) to a glycine (D614G) in certain CoV-2 isolates provides more efficient infection of host cells (e.g. Korber, et al., 2020). The aspartate in the CTD2 of one protomer is bridged to a lysine at the end of an adjacent protomer's FPPR, and the D614G mutation would therefore destabilize the FPPR, leading to the movement of RBSs toward the host cell membrane, thereby promoting receptor binding (see below).

As is typical for Spike glycoproteins, decoration of the CoV-2 Spike with numerous glycans can be observed. Glycans on the Spike may funtion to promote proper protein folding and priming by host proteases (Walls et al. 2020). Glycan-mediated shielding of key S protein residues may also allow the virus to evade surveillance by host immune responses, enhancing infection of host cells. To view an animated model of a glycosylated S protein embedded in a viral membrane, see Figure 3.

The fatty acid linoleic acid (LA) is observed to occupy a binding pocket in each protomer's RBD. The pocket is rich in phenylalanine side chains, which together with cysteine disulfide bridges and a tryptophan side chain form a "greasy tube" into which the hydrophobic tail of LA fits. The polar head group of LA is anchored by electrostatic attractions to an arginine and a glutamine from an adjacent RBD in the trimer.

The conformation of individual protomers is altered by LA binding. In this view, a gating helix containing two tyrosine residues can be observed to swing free of the LA binding pocket to accomodate LA when it is bound (LA not shown here). RBDs have been observed to be more loosely packed when LA is not bound, suggesting that LA has a role in maintaining the RBDs in the compacted, closed, conformation referenced above. Indeed, LA has been shown to augment remdesivir therapy in suppressing SARS CoV-2 replication and may do so by helping to lock RBDs in the clamped configuration (Toelzer, et al., 2020). [This morph of cryo-EM structures (6VXX to 6ZB4 - Walls, et al., 2020; Toelzer, et al., 2020) was created using the linear interpolation morpher provided by Karsten Theis:]

Return to Beginning

III. Furin Cleavage of the Prefusion Spike Protein and Transition From the Closed to Open Conformation

A distinctive feature of SARS Cov-2 is the presence of a polybasic sequence motif at the S1/S2 boundary, lacking in other closely related coronavirus S proteins. This motif falls within a 13 residue loop of each protomer and provides a cleavage site for a host proprotein convertase, furin (see Figure 4). [the loop is not visible in the 6XR8 cryo-EM structure and is simulated here].

Cleavage in this loop, either during viral shedding from a host cell or on approach to the cell surface of a potential target cell, exposes a "C-endR" (C-end Rule) motif at the newly generated C terminus of S1. This RRAR (Arg-Arg-Ala-Arg) sequence is the recognition motif for the Neuropilin-1 receptor that enhances S protein binding and viral uptake (see Figure 2). Furin priming increases infectivity whereas furin inhibition lowers the capacity for SARS-CoV-2 entry (Walls, et al., 2020; Wrapp et al., 2020). Note that furin cleavage separates S1 and S2, although these remain noncovalently associated until further digestion of S2 with the serine protease TMPRSS2 (e.g Hoffman, et al., 2020) - see below.

Furin cleavage in the loop also results in a conformational change of a protomer such that an RBD moves from its clamped, downward, "closed" state to project upward toward the host cell membrane in an "open" conformation that facilitates efficient interaction with the host cell ACE-2 receptor. [This morph of cryo-EM structures (6ZGI, 6ZGG - Wrobel, et al., 2020) was created using the linear interpolation morpher provided by Karsten Theis:]

Return to Beginning

IV. The ACE2 Receptor and Interaction with the The SARS Cov-2 Spike (S) Protein

At left is the structure of two RBDs of the CoV-2 S protein individually bound to a dimer of two ACE2 receptor proteins (the planes of a membrane are simulated). Note the receptors' hydrophobic membrane spanning helices embedded in the simulated planes of the bilayer. The model of the membrane planes was produced by using the PPM server at the Orientations of Proteins in Membranes (OPM) site at the University of Michigan applied to PDB 6M17.

ACE2 comprises three primary domains: the peptidase (PD), the collectrin (neck), and the linker/transmembrane/cytoplasmic domains. It has been suggested that ACE2 dimers can engage two full prefusion S proteins, provided that one of the S RBDs is the open state, since the PD of ACE2 would clash with the S trimer if an RBD in the closed conformation were bound (Yan, et al., 2020). Dimerization of the ACE2 monomers is largely based on polar interactions between residues in helices of the PD and neck domains.

Turning now to the ACE2 binding of a SARS CoV-2 S protein RBD, an exquisite fit of the molecules can be seen by examining their surface complementarity.

The Receptor Binding Domain contains a twisted, antiparallel 5-stranded β-sheet, several small β-strands, 5 main α-helices, and numerous loops.

The RBD's Receptor Binding Motif (RBM) contains most of the residues that are responsible for SARS Cov-2 binding to the the ACE2 receptor. Three pairs of disulfide bonds are observed to stabilize the RBD core and another pair connects the loops at the end of the RBM.

Numerous ACE2-RBM residues interact along the curved molecular interface in which the RBM "cradles" the N-terminal helix of ACE2 (17 RBM amino acids interact with 20 ACE2 residues).

For one example, a glutamine (Gln498) of SARS CoV-2 is observed to interact with the polar sidechains of Asp38, Tyr41, Gln42, and Lys353 of ACE2.

Another example is the two salt bridges formed between a lysine (K417) of the RBD and an aspartate (D30) of ACE2. K417 endows the surface with a positively charged patch in the groove into which the N-terminal helix of ACE2 fits. It is notable that the position occupied by K417 in the SARS CoV-2 RBD is provided by a hydrophobic valine in SARS CoV, the less infective, 2002 coronavirus. This and other subtle differences between the RBDs may contribute to the higher binding affinity of the CoV-2 Spike to ACE2, and partially account for its greater infection capabilities (Lan, et al., 2020).

Return to Beginning

V. The SARS Cov-2 Spike (S) Protein Structure (postfusion conformation)

Upon receptor binding, and after dissociation of the S1 subunit, cleavage of the S2 subunit of the trimeric Spike by proteases, either at the cell surface or within an endosome if the virus has been endocytosed, leads to dramatic molecular gymnastics that thrust the fusion peptide of the S2 subunit into the host membrane. The substantial conformational rearrangements can be seen by comparing the relative positions of the Heptad Repeat 1 (HR1), the Central Helix (CH), the Fusion Peptide Proximal Region (FPPR), and the Fusion Peptide (FP) in the pre-fusion and post-fusion S2 trimers. [The FPPR and FP were not resolved in the post-fusion, cryo-electron microscopy structure of Cai, et al, (2020) and are represented schematically here].

Shown at left, for a single S2 protomer, is a simulated transition from the pre-fusion (6XR8) to post-fusion (6XRA) state. The protein is colored in the amino-carboxy color scheme. [This morph of cryo-EM structures (6XR8, 6XRA - Cai, et al., 2020) was created using the linear interpolation morpher provided by Karsten Theis:]

A simulated transition between the pre- and post-fusion S2 trimers (6XR8, 6XRA - Cai, et al., 2020) is shown at left. This morph was created by Eric Martz and posted on the Proteopedia site*. The Heptad Repeat 1 (HR1), the Central Helix (CH) are indicated as above, with Threonine 912 highlighted to emphasize possible transition states.

Subsequent to insertion of the fusion peptide into the host membrane, further conformational dynamics of the S post-fusion trimer juxtapose the viral and host membranes. For an explanatory animation of these processes see the animation created by Jonathan Khao and Gaël McGill (Digizyme Inc.) in this popup.

Return to Beginning

VI. Mutations of the CoV-2 Spike Protein

Because of the rapid and astronomically large number of replication cycles that the Cov-2 virus undergoes as it sweeps through human populations, it is clear that numerous mutations are generated on a regular basis. While most mutations are neutral or deleterious with respect to virus survival, replication, and transmission, some confer selective advantage to the virus. It is unsurprising that some of these adaptive mutations occur in the gene that encodes the Spike protein, given that the spikes on the virion serve as the molecular "trojan horse" that allows the virus to enter host cells. The positions of a few of the spike residues that, when mutated, are advantageous to the virus, are illustrated below. For a comprehensive discussion of spike mutations, see the review by Harvey, et al. (2021). Note the convention that lists the one letter amino acid code that has been mutated, its residue position, and the substituted residue in the mutant form. For example, D614G is an Aspartate (D) to Glycine (G) substitution.

  • The (D614G) (aspartate to glycine) mutation, discussed above in the context of the Fusion Peptide Proximal Region, has been shown to confer an advantage for infectivity (Korber, et al., 2020; Hou, et al., 2020; Yurkovetskiy, et al., 2020) and transmissibility (Volz, et al., 2020).
  • An asparagine residue in the Receptor Binding Domain has been mutated to a Lysine in some European lineages (N439K). This mutation enhances the binding affinity of the S protein for the ACE2 receptor and also protects the virus from therapeutic antibodies (Thomson, et al., 2021).
  • Another RBD residue, tyrosine 453, has been mutated to phenylalanine in some lineages, and this substitution increases ACE2 binding affinity (Starr, et al., 2020).
  • Several mutations in the N-terminal domain (see structural components, above) represent substitutions that provide escape from immune responses and therapeutic antibodies: N148S; K150R/E/T/Q; S151P (Weisblum, et al., 2020).
  • A mutation that has been shown to elicit a 10-fold increase in S protein binding affinity for ACE2, N501Y, has been found in highly contagious CoV-2 strains that have spread worldwide, having originated in the United Kingdom. The wild type and mutant residue resides near the apex of the spike. The mutant tyrosine confers an aromatic ring–ring interaction and an additional hydrogen bond with ACE2, compared with the wild-type asparagine (Liu, et al., 2021).
  • The very dangerous CoV-2 Delta variant has numerous mutations. Four, found individually in other variants of concern, occur simultaneously in in the Delta S protein and identify this variant as one of major concern that is rapidly spreading worldwide:
    • The D614G substitution already mentioned above is shared with other variants of concern.
    • T478K: this threonine-to-lysine substitution (Greenwood, 2021) may affect immunogenicity of the spike (Harvey, et al., 2021).
    • A substitution of arginine for leucine, L452R, endows the spike protein with high affinity for the ACE2 receptor and decreased recognition capability of the immune system (e.g. Zhang, et al., 2021).
    • The P681R mutation may boost infectivity of the variant by facilitating cleavage of the S prefusion protein by furin, as this residue falls within the furin cleavage loop (Harvey, et al.,2021).
    These and other mutations may account for the shorter incubation period of the Delta variant (four days after exposure, compared with ~six days), and for viral loads ~1,260 times higher than those in people infected with the wild-type strain (Li, et al., 2021).

Return to Beginning

VII. References

Bangaru, S., Ozorowki, G., Turner, L.H., Antanasijevic, A., et al. Structural annalysis of full-lngth SARS-CoV-2 spik protein from an advaced vaccine candidate. Sci Adv. 1089-1094 (2020)

Cai, Y., Zhang, J., Xio, T., Penf, H., Sterling, S.M., Walsh, R.M., Rawson, S., Rits-Volloch, S., and B. Chen. Distinct conformational states of SARS-CoV-2 spike protein. Science 369: 1586-1592. (2020)

Harvey, W.T., Carabelli, A.M., Jackson, B. et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat Rev Microbiol 19, 409–424 (2021).

Haseltine, William. "An Indian SARS-CoV-2 Variant Lands In California. More Danger Ahead?". Forbes. Retrieved 20 April 2021.

Hoffmann, M., H. Kleine-Weber, S. Schroeder, N. Krüger, T. Herrler, S. Erichsen, T. S. Schiergens, G. Herrler, N.-H. Wu, A. Nitsche, M. A. Müller, C. Drosten, S. Pöhlmann , SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell 181, 271–280.e8 (2020).

Hou, Y. J. et al. SARS- CoV-2 D614G variant exhibits efficient replication ex vivo and transmission in vivo. Science 370, 1464 (2020).

Jaimes JA, André NM, Chappie JS, Millet JK, Whittaker GR. Phylogenetic Analysis and Structural Modeling of SARS-CoV-2 Spike Protein Reveals an Evolutionary Distinct and Proteolytically Sensitive Activation Loop. J Mol Biol. 2020;432(10):3309-3325.

Kielan M. Science 13 NOVEMBER 2020 • VOL 370 ISSUE 6518.

Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, Hengartner N, Giorgi EE, Bhattacharya T, Foley B, Hastie KM, Parker MD, Partridge DG, Evans CM, Freeman TM, de Silva TI; Sheffield COVID-19 Genomics Group, McDanal C, Perez LG, Tang H, Moon-Walker A, Whelan SP, LaBranche CC, Saphire EO, Montefiori DC. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell. 2020 Aug 20;182(4):812-827.

Lan, J., Ge, J., Yu, J., et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 recptor. Nature 581, 215-220 (2020).

Li, B. et al. Preprint at medRxiv (2021).

Liu, Haolin , Qianqian Zhang, Pengcheng Wei, Zhongzhou Chen, Katja Aviszus, John Yang, Walter Downing, Chengyu Jiang, Bo Liang, Lyndon Reynoso, Gregory P. Downey, Stephen K. Frankel, John Kappler, Philippa Marrack & Gongyi Zhang. (2021). The basis of a more contagious 501Y.V1 variant of SARS-CoV-2. Cell Res 31, 720–722.

Starr, T. N. et al. Deep mutational scanning of SARS- CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. Cell 182, 1295–1310.e1220 (2020).

Toelzer, C., Gupta, K., Yadav, S.K.N., Borucu, U., Davidson, A.D., Kavanagh Williamson, M., Shoemark, D.K., Garzoni, F., Staufer, O., Milligan, R., Capin, J., Mulholland, A.J., Spatz, J., Fitzgerald, D., Berger, I., Schaffitzel, C.. Free fatty acid binding pocket in the locked structure of SARS-CoV-2 spike protein. Science 370: 725-730. (2020).

Thomson, E. C. et al. Circulating SARS- CoV-2 spike N439K variants maintain fitness while evading antibody- mediated immunity. Cell 184, 1171–1187 e1120 (2021).

Verma, J., Subbarao, N. A comparative study of human betacoronavirus spike proteins: structure, function and therapeutics. Arch Virol 166, 697–714 (2021).

Walls, A. C., Y.-J. Park, M. A. Tortorici, A. Wall, A. T. McGuire, D. Veesler , Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181, 281–292.e6 (2020).

Wrobel, A.G., Benton, D.J., Xu, P., Roustan, C., Martin, S.R., Rosenthal, P.B., Skehel, J.J., Gamblin, S.J. SARS-CoV-2 and bat RaTG13 spike glycoprotein structures inform on virus evolution and furin-cleavage effects. Nat Struct Mol Biol 27: 763-767 (2020)

Wrapp, D., N. Wang, K. S. Corbett, J. A. Goldsmith, C.-L. Hsieh, O. Abiona, B. S. Graham, J. S. McLellan , Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 367, 1260–1263 (2020).

Weisblum, Y. et al. Escape from neutralizing antibodies by SARS- CoV-2 spike protein variants. eLife https:// (2020).

Xu, C., Wang, Y., Liu, C., Zhang, C., Han, W., Hong, X., et al. Conformational dynamics of SARS-CoV-2 trimeric spike glycoproteein in complex with recepttor ACE2 revealved by cyro-EM. Sci Adv. 5575 (2020)

Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020;367(6485):1444-1448. doi:10.1126/science.abb2762

Yurkovetskiy, L. et al. Structural and functional analysis of the D614G SARS- CoV-2 spike protein variant. Cell 183, 739–751.e738 (2020).

Volz, E. et al. Evaluating the effects of SARS- CoV-2 Spike mutation D614G on transmissibility and pathogenicity. Cell cell.2020.11.020 (2020).

Zhang, Wenjuan; Davis, Brian D.; Chen, Stephanie S.; Sincuir Martinez, Jorge M.; Plummer, Jasmine T.; Vail, Eric (6 April 2021). Emergence of a Novel SARS-CoV-2 Variant in Southern California. JAMA. 325 (13): 1324–1326.

SARS-CoV-2 variants of concern as of 24 May 2021". European Centre for Disease Prevention and Control. Retrieved 29 May 2021.

SARS-CoV-2 Variant Classifications and Definitions". Centers for Disease Control and Prevention. Retrieved 15 June 2021.

Return to Beginning