News

Paper Published: Cryo-EM Ligand Modeling Challenge

06/25 PDB101 News

A paper co-authored by all organizers and participants in the EMDataResource Ligand Challenge has now been published in Nature Methods: Outcomes of the EMDataResource Cryo-EM Ligand Modeling Challenge.

The paper describes the results of the 2021 Challenge and recommends best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.

<I>Figure 2: Ligand Challenge targets and ligands from submitted models. In (A-C), Targets 1-3 are shown, with each polymer/nucleic acid chain rendered as a separate surface with a different color, in some cases semi-transparent. Target ligands are shown in red. In (D-F), segmented density representing each target ligand is shown with a semi-transparent surface, with submitted ligand models overlaid. Map contour levels are 0.35 (2.3σ), 0.036 (2.6σ), 0.25 (3.7σ) respectively (sigma values were calculated from the full unmasked map to capture variation in background noise). (G-I) Chemical sketches for each of the target ligands (source: PDB). Figure from </I>Nature Methods<I> doi: 10.1038/s41592-024-02321-7</I>Figure 2: Ligand Challenge targets and ligands from submitted models. In (A-C), Targets 1-3 are shown, with each polymer/nucleic acid chain rendered as a separate surface with a different color, in some cases semi-transparent. Target ligands are shown in red. In (D-F), segmented density representing each target ligand is shown with a semi-transparent surface, with submitted ligand models overlaid. Map contour levels are 0.35 (2.3σ), 0.036 (2.6σ), 0.25 (3.7σ) respectively (sigma values were calculated from the full unmasked map to capture variation in background noise). (G-I) Chemical sketches for each of the target ligands (source: PDB). Figure from Nature Methods doi: 10.1038/s41592-024-02321-7

The 2021 EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein/nucleic-acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic resolution (1.9-2.5 Å). Three published maps were selected as targets: E. coli beta-galactosidase with inhibitor, SARS-CoV-2 virus RNA-dependent RNA polymerase with covalently bound nucleotide analog, and SARS-CoV-2 virus ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. The quality of submitted ligand models and surrounding atoms were analyzed by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics, and contact scores. A composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.

Recommendations Summary

  1. For ligand-macromolecular complexes, the macromolecular model should be subject to standard geometric checks as done for X-ray crystallographic based models. These include standard covalent geometry checks and MolProbity evaluation, including CaBLAM, clashscore. Sugar pucker and DNATCO conformational analysis should be checked for nucleic acid components. The macromolecular model-map fit should be evaluated by EM Ringer, Q score, and FSC. Serious local outliers usually indicate an incorrect local conformation.
  2. Ligands in macromolecular complexes should conform to standard covalent geometry measures (bond lengths, angles, planarity, chirality) as recommended by the wwPDB validation report. Additional checks that should be applied to ligands include fit to density using methods applicable to cryo-EM such as Q-score, occupancy (density strength, both absolute and relative to surroundings), and identification of missing atoms, including any surrounding ions.
  3. The detailed interaction of the ligand with its binding site is of great importance and should be assessed by several independent metrics. Pharmacophore modeling is an optimized and time-tested energetic measure for how well the site would bind the specific ligand. LIVQ scores (Ligand+Immediate enVironment Q scores), introduced here, measure the density fit of the surrounding residues as well as the ligand itself. Probescore both quantifies and identifies specific all-atom contacts of H-bond, clash, and van der Waals interactions. All three types of measures should be taken into account. If the ligand model shows only weak interaction with its environment, the model is not right.
  4. Future cryo-EM Model Challenges should be organized similarly to the well-established CASP and CAPRI challenge events of the X-ray crystallography and prediction communities, with incorporation of automated checks and immediate author feedback on all model submissions. A future Challenge might focus on validation of RNA models, including identification of RNA-associated ions, owing to the rapidly rising numbers of RNA-containing cryo-EM structures.

Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge
Lawson C, Kryshtafovych A, Pintilie G, Burley S, Cerny J, Chen V, Emsley P, Gobbi A, Joachimiak A, Noreng S, Prisant M, Read R, Richardson J, Rohou A, Schneider B, Sellers B, Chao C, Sourial E, Williams C, Williams C, Yang Y, Abbaraju V, Afonine PV, Baker M, Bond P, Blundell T, Burnley T, Campbell A, Cao R, Cheng J, Chojnowski G, Cowtan K, Dimaio F, Esmaeeli R, Giri N, Grubmüller H, Hoh SW, Hou J, Hryc C, Hunte C, Igaev M, Joseph A, Kao W, Kihara D, Kumar D, Lang L, Lin S, Subramaniya SRMV, Mittal S, Mondal A, Moriarty N, Muenks A, Murshudov G, Nicholls R, Olek M, Palmer C, Perez A, Pohjolainen E, Pothula K, Rowley C, Sarkar D, Schäfer L, Schlicksup C, Schroeder G, Shekhar M, Si D, Singharoy A, Sobolev O, Terashi G, Vaiana A, Vedithi S, Verburgt J, Wang X, Warshamanage R, Winn M, Weyand S, Yamashita K, Zhao M, Schmid M, Berman H, Chiu W.
(2024) Nature Methods 21: 1340–1348 doi: 10.1038/s41592-024-02321-7

News Index