b

Rebuttal to the article Pathological crystal structures

A section in the Acta Crystallographica Section C article by Raymond & Girolami [Acta Cryst. (2023), C79, 445–455] stated that the product of the reaction of [(Cp*Rh)2(μ-OH)3]+ (Cp* is 1,2,3,4,5-penta­methyl­cyclo­penta­diene) with 1-methyl­thymine (1-MT) at pH 10 and 60 °C, to synthesize the anionic com­ponent [RhI(η1-N3-1-MT)2]−, was not an RhI com­plex, but rather an AgI com­plex, due to the use of silver triflate (AgOTf) to remove Cl− from [Cp*RhCl2]2 to synthesize [Cp*Rh(H2O)3](OTf)2, a water-soluble crystalline com­plex. We will clearly show that this premise, as stated, is invalid, while the authors have simply avoided several important facts, including that Cp*OH, a reductive elimination product, at pH 10 and 60 °C, was unequivocally identified, thus leading to the RhI anionic com­ponent [RhI(η1-N3-1-MT)2]−. More importantly, AgOH, from the reaction of NaOH at pH 10 with any potentially remaining AgOTf, after the AgCl was filtered off, would be insoluble in water. Furthermore, a control experiment with the inorganic com­plex Rh(OH)3, reacting with 1-methyl­thymine at pH 10, provided no product, and this bodes well for a similar fate with AgOTf and 1-methyl­thymine, i.e. at pH 10, AgOTf would again be converted to the water-insoluble AgOH; therefore, no reaction would occur! Finally, a 1H NMR spectroscopy experiment was carried out with synthesized and crystallized [Cp*Rh(H2O)3](OTf)2 in D2O at various pD values; at pD 8.65 no reaction took place, while at pD 13.6, and at 60 °C for 2 h, a reductive elimination reaction caused the precipitation of Cp*OH. The subsequent 1H NMR spectrum clearly demonstrated, in the absence of any AgI com­plexes, that the solution structure and the X-ray crystals in D2O were similar. A postulated mechanism for this novel anionic com­ponent structure, as published previously [Smith et al. (2014). Organometallics, 33, 2389–2404], will be presented, along with the experimental data, to insure the credibility of our results. We will also answer the comments in the response of Drs Raymond and Girolami to this rebuttal.




b

Response to the rebuttal of the article Pathological crystal structures

We stand fully behind our earlier suggestion [Raymond & Girolami (2023). Acta Cryst. C79, 445–455] that the claim by Fish and co-workers [Chen et al. (1995). J. Am. Chem. Soc. 117, 9097–9098; Smith et al. (2014). Organometallics, 33, 2389–2404] of a linear two-coordinate rhodium(I) species is incorrect, and that the putative rhodium atom is in fact silver.




b

Supra­molecular hy­dro­gen-bonded networks formed from copper(II) car­box­yl­ate dimers

The well-known copper car­box­yl­ate dimer, with four car­box­yl­ate ligands ex­ten­ding outwards towards the corners of a square, has been employed to generate a series of crystalline com­pounds. In particular, this work centres on the use of the 4-hy­droxy­benzoate anion (Hhba−) and its deprotonated phe­nol­ate form 4-oxidobenzoate (hba2−) to obtain complexes with the general formula [Cu2(Hhba)4–x(hba)xL2–y]x−, where L is an axial coligand (including solvent mol­ecules), x = 0, 1 or 2, and y = 0 or 1. In some cases, short hy­dro­gen bonds result in complexes which may be represented as [Cu2(Hhba)2(H0.5hba)2L2]−. The main focus of the investigation is on the formation of a variety of extended networks through hy­dro­gen bonding and, in some crystals, coordinate bonds when bridging coligands (L) are employed. Crystals of [Cu2(Hhba)4(di­ox­ane)2]·4(di­ox­ane) consist of the expected Cu dimer with the Hhba− anions forming hy­dro­gen bonds to 1,4-di­ox­ane mol­ecules which block network formation. In the case of crystals of com­position [Et4N][Cu2(Hhba)2(H0.5hba)2(CH3OH)(H2O)]·2(di­ox­ane), Li[Cu2(Hhba)2(H0.5hba)2(H2O)2]·3(di­ox­ane)·4H2O and [Cu2(Hhba)2(H0.5hba)2(H0.5DABCO)2]·3CH3OH (DABCO is 1,4-di­aza­bicyclo­[2.2.2]octa­ne), square-grid hy­dro­gen-bonded networks are generated in which the complex serves as one type of 4-con­necting node, whilst a second 4-con­necting node is a hy­dro­gen-bonding motif assembled from four phenol/phenolate groups. Another two-dimensional (2D) network based upon a related square-grid structure is formed in the case of [Et4N]2[Cu2(Hhba)2(hba)2(di­ox­ane)2][Cu2(Hhba)4(di­ox­ane)(H2O)]·CH3OH. In [Cu2(Hhba)4(H2O)2]·2(Et4NNO3), a square-grid structure is again apparent, but, in this case, a pair of nitrate anions, along with four phenolic groups and a pair of water mol­ecules, combine to form a second type of 4-con­necting node. When 1,8-bis­(di­methyl­amino)­naphthalene (bdn, `proton sponge') is used as a base, another square-grid network is generated, i.e. [Hbdn]2[Cu2(Hhba)2(hba)2(H2O)2]·3(di­ox­ane)·H2O, but with only the copper dimer complex serving as a 4-con­necting node. Complex three-dimensional networks are formed in [Cu2(Hhba)4(O-bipy)]·H2O and [Cu2(Hhba)4(O-bipy)2]·2(di­ox­ane), where the potentially bridging 4,4'-bi­pyridine N,N'-dioxide (O-bipy) ligand is employed. Rare cases of mixed car­box­yl­ate copper dimer complexes were obtained in the cases of [Cu2(Hhba)3(OAc)(di­ox­ane)]·3.5(di­ox­ane) and [Cu2(Hhba)2(OAc)2(DABCO)2]·10(di­ox­ane), with each structure possessing a 2D network structure. The final com­pound re­por­ted is a simple hy­dro­gen-bonded chain of com­position (H0.5DABCO)(H1.5hba), formed from the reaction of H2hba and DABCO.




b

Crystal structure elucidation of a geminal and vicinal bis­(tri­fluoro­methane­sulfonate) ester

Geminal and vicinal bis­(tri­fluoro­methane­sulfonate) esters are highly reactive alkyl­ene synthons used as potent electrophiles in the macrocyclization of imid­azoles and the transformation of bypyridines to diquat derivatives via nucleophilic substitution reactions. Herein we report the crystal structures of methyl­ene (C3H2F6O6S2) and ethyl­ene bis­(tri­fluoro­methane­sulfonate) (C4H4F6O6S2), the first examples of a geminal and vicinal bis­(tri­fluoro­methane­sulfonate) ester characterized by single-crystal X-ray diffraction (SC-XRD). With melting points slightly below ambient temperature, both reported bis­(tri­fluoro­methane­sulfonate)s are air- and moisture-sensitive oils and were crys­tallized at 277 K to afford two-com­ponent non-merohedrally twinned crystals. The dominant inter­actions present in both com­pounds are non-classical C—H⋯O hydrogen bonds and inter­molecular C—F⋯F—C inter­actions between tri­fluoro­methyl groups. Mol­ecular electrostatic potential (MEP) cal­culations by DFT-D3 helped to qu­antify the polarity between O⋯H and F⋯F contacts to rationalize the self-sorting of both bis­(tri­fluoro­methane­sulfonate) esters in polar (non-fluorous) and non-polar (fluorous) domains within the crystal structure.




b

Crystal structure and cryomagnetic study of a mononuclear erbium(III) ox­am­ate inclusion com­plex

The synthesis, crystal structure and magnetic properties of an ox­am­ate-con­taining erbium(III) com­plex, namely, tetra­butyl­ammonium aqua­[N-(2,4,6-tri­methyl­phen­yl)oxamato]erbium(III)–di­methyl sulfoxide–water (1/3/1.5), (C16H36N)[Er(C11H12NO3)4(H2O)]·3C2H6OS·1.5H2O or n-Bu4N[Er(Htmpa)4(H2O)]·3DMSO·1.5H2O (1), are reported. The crystal structure of 1 reveals the occurrence of an erbium(III) ion, which is surrounded by four N-phenyl-substituted ox­am­ate ligands and one water mol­ecule in a nine-coordinated environment, together with one tetra­butyl­ammonium cation acting as a counter-ion, and one water and three dimethyl sulfoxide (DMSO) mol­ecules of crystallization. Variable-temperature static (dc) and dynamic (ac) magnetic mea­sure­ments were carried out for this mononuclear com­plex, revealing that it behaves as a field-induced single-ion magnet (SIM) below 5.0 K.




b

Synthesis, spectroscopic and crystallographic characterization of various cymantrenyl thio­ethers [Mn{C5HxBry(SMe)z}(PPh3)(CO)2]

Starting from [Mn(C5H4Br)(PPh3)(CO)2] (1a), the cymantrenyl thio­ethers [Mn(C5H4SMe)(PPh3)(CO)2] (1b) and [Mn{C5H4–nBr(SMe)n}(PPh3)(CO)2] (n = 1 for com­pound 2, n = 2 for 3 and n = 3 for 4) were obtained, using either n-butyllithium (n-BuLi), lithium diiso­propyl­amide (LDA) or lithium tetra­methyl­piperidide (LiTMP) as base, followed by electrophilic quenching with MeSSMe. Stepwise consecutive reaction of [Mn(C5Br5)(PPh3)(CO)2] with n-BuLi and MeSSMe led finally to [Mn{C5(SMe)5}(PPh3)(CO)2] (11), only the fifth com­plex to be reported containing a perthiol­ated cyclo­penta­dienyl ring. The mol­ecular and crystal structures of 1b, 3, 4 and 11 were determined and were studied for the occurrence of S⋯S and S⋯Br inter­actions. It turned out that although some inter­actions of this type occurred, they were of minor importance for the arrangement of the mol­ecules in the crystal.




b

3-[(Benzo-1,3-dioxol-5-yl)amino]-4-meth­oxy­cyclo­but-3-ene-1,2-dione: polymorphism and twinning of a precursor to an anti­mycobacterial squaramide

The title compound, 3-[(benzo-1,3-dioxol-5-yl)amino]-4-meth­oxy­cyclo­but-3-ene-1,2-dione, C12H9NO5 (3), is a precursor to an anti­mycobacterial squaramide. Block-shaped crystals of a monoclinic form (3-I, space group P21/c, Z = 8, Z' = 2) and needle-shaped crystals of a triclinic form (3-II, space group P-1, Z = 4, Z' = 2) were found to crystallize concomitantly. In both crystal forms, R22(10) dimers assemble through N—H⋯O=C hydrogen bonds. These dimers are formed from crystallographically unique mol­ecules in 3-I, but exhibit crystallographic Ci symmetry in 3-II. Twinning by pseudomerohedry was encountered in the crystals of 3-II. The conformations of 3 in the solid forms 3-I and 3-II are different from one another but are similar for the unique mol­ecules in each polymorph. Density functional theory (DFT) calculations on the free mol­ecule of 3 indicate that a nearly planar conformation is preferred.




b

A brief review on com­puter simulations of chal­co­py­rite surfaces: structure and reactivity

Chalcopyrite, the world's primary copper ore mineral, is abundant in Latin America. Copper extraction offers significant economic and social benefits due to its strategic importance across various industries. However, the hydro­metallurgical route, considered more environmentally friendly for processing low-grade chal­co­py­rite ores, remains challenging, as does its concentration by froth flotation. This limited understanding stems from the poorly understood structure and reactivity of chal­co­py­rite surfaces. This study reviews recent contributions using density functional theory (DFT) calculations with periodic boundary conditions and slab models to elucidate chal­co­py­rite surface properties. Our analysis reveals that reconstructed surfaces preferentially expose S atoms at the topmost layer. Furthermore, some studies report the formation of di­sulfide groups (S22−) on pristine sulfur-terminated surfaces, accom­panied by the reduction of Fe3+ to Fe2+, likely due to surface oxidation. Additionally, Fe sites are consistently identified as favourable adsorption locations for both oxygen (O2) and water (H2O) mol­ecules. Finally, the potential of com­puter modelling for investigating collector–chal­co­py­rite surface inter­actions in the context of selective froth flotation is discussed, highlighting the need for further research in this area.




b

Concerning the structures of Lewis base adducts of titanium(IV) hexa­fluoro­iso­pro­pox­ide

The reaction of titanium(IV) chloride with sodium hexa­fluoro­iso­pro­pox­ide, carried out in hexa­fluoro­iso­propanol, produces titanium(IV) hexa­fluoro­iso­pro­pox­ide, which is a liquid at room temperature. Recrystallization from coordinating solvents, such as aceto­nitrile or tetra­hydro­furan, results in the formation of bis-solvate com­plexes. These com­pounds are of inter­est as possible Ziegler–Natta polymerization catalysts. The aceto­nitrile com­plex had been structurally characterized previously and adopts a distorted octahedral structure in which the nitrile ligands adopt a cis configuration, with nitro­gen lone pairs coordinated to the metal. The low-melting tetra­hydro­furan com­plex has not provided crystals suitable for single-crystal X-ray analysis. However, the structure of chlorido­tris­(hexa­fluoro­isopropoxido-κO)bis­(tetra­hydro­furan-κO)titanium(IV), [Ti(C3HF6O)3Cl(C4H8O)2], has been obtained and adopts a distorted octa­hedral coordination geometry, with a facial arrangement of the alkoxide ligands and adjacent tetra­hydro­furan ligands, coordinated by way of metal–oxygen polar coordinate inter­actions.




b

Synthesis of organotin(IV) heterocycles containing a xanthenyl group by a Barbier approach via ultrasound activation: synthesis, crystal structure and Hirshfeld surface analysis

A series of organotin heterocycles of general formula [{Me2C(C6H3CH2)2O}SnR2] [R = methyl (Me, 4), n-butyl (n-Bu, 5), benzyl (Bn, 6) and phenyl (Ph, 7)] was easily synthesized by a Barbier-type reaction assisted by the sonochemical activation of metallic magnesium. The 119Sn{1H} NMR data for all four com­pounds confirm the presence of a central Sn atom in a four-coordinated environment in solution. Single-crystal X-ray diffraction studies for 17,17-dimethyl-7,7-di­phenyl-15-oxa-7-stanna­tetra­cyclo­[11.3.1.05,16.09,14]hepta­deca-1,3,5(16),9(14),10,12-hexa­­ene, [Sn(C6H5)2(C17H16O)], 7, at 100 and 295 K con­firmed the formation of a mono­nuclear eight-membered heterocycle, with a conformation depicted as boat–chair, resulting in a weak Sn⋯O inter­action. The Sn and O atoms are surrounded by hydro­phobic C—H bonds. A Hirshfeld surface analysis of 7 showed that the eight-membered heterocycles are linked by weak C—H⋯π, π–π and H⋯H noncovalent inter­actions. The pairwise inter­action energies showed that the cohesion between the heterocycles are mainly due to dispersion forces.




b

The influence of the axial group on the crystal structures of boron sub­phthalo­cy­an­ines

The crystal structures of 16 boron sub­phthalo­cy­an­ines (BsubPcs) with structurally diverse axial groups were analyzed and com­pared to elucidate the impact of the axial group on the inter­molecular π–π inter­actions, axial-group inter­actions, axial bond length and BsubPc bowl depth. π–π inter­actions between the iso­indole units of adjacent BsubPc mol­ecules most often involve concave–concave packing, whereas axial-group inter­actions with adjacent BsubPc mol­ecules tend to favour the convex side of the BsubPc bowl. Furthermore, axial groups that contain O and/or F atoms tend to have significant hy­dro­gen-bonding inter­actions, while axial groups containing arene site(s) can participate in π–π inter­actions with the BsubPc bowl, both of which can strongly influence the crystal packing. Bulky axial groups did tend to disrupt the π–π inter­actions and/or axial-group inter­actions, preventing some of the close packing that is seen in BsubPcs with less bulky axial groups. The atomic radius of the heteroatom bonded to boron directly influences the axial bond length, whereas the axial group has minimal impact on the BsubPc bowl depth. Finally, the crystal growth method did not generally appear to have a significant impact on the solid-state arrangement, with the exception of water occasionally being incorporated into crystal structures when hygroscopic solvents were used. These insights can help with the design and fine-tuning of the solid-state structures of BsubPcs as they continue to be developed as functional materials in organic electronics.




b

Occupational modulation in the (3+1)-dimensional incommensurate structure of (2S,3S)-2-amino-3-hy­droxy-3-methyl-4-phen­oxy­butanoic acid dihydrate

The incommensurately modulated structure of (2S,3S)-2-amino-3-hy­droxy-3-methyl-4-phen­oxy­butanoic acid dihydrate (C11H15NO4·2H2O or I·2H2O) is described in the (3+1)-dimensional superspace group P212121(0β0)000 (β = 0.357). The loss of the three-dimensional periodicity is ascribed to the occupational modulation of one positionally disordered solvent water mol­ecule, where the two positions are related by a small translation [ca 0.666 (9) Å] and ∼168 (5)° rotation about one of its O—H bonds, with an average 0.624 (3):0.376 (3) occupancy ratio. The occupational modulation of this mol­ecule arises due to the com­petition between the different hy­dro­gen-bonding motifs associated with each position. The structure can be very well refined in the average approximation (all satellite reflections disregarded) in the space group P212121, with the water mol­ecule refined as disordered over two positions in a 0.625 (16):0.375 (16) ratio. The refinement in the commensurate threefold supercell approximation in the space group P1121 is also of high quality, with the six corresponding water mol­ecules exhibiting three different occupancy ratios averaging 0.635:0.365.




b

Further evaluation of the shape of atomic Hirshfeld surfaces: M⋯H contacts and homoatomic bonds

It is well known that Hirshfeld surfaces provide an easy and straightforward way of analysing inter­molecular inter­actions in the crystal environment. The use of atomic Hirshfeld surfaces has also demonstrated that such surfaces carry information related to chemical bonds which allow a deeper evaluation of the structures. Here we briefly summarize the approach of atomic Hirshfeld surfaces while further evaluating the kind of information that can be retrieved from them. We show that the analysis of the metal-centre Hirshfeld surfaces from structures refined via Hirshfeld Atom Refinement (HAR) allow accurate evaluation of contacts of type M⋯H, and that such contacts can be related to the overall shape of the surfaces. The com­pounds analysed were tetra­aqua­bis­(3-carb­oxy­propionato)metal(II), [M(C4H3O4)2(H2O)4], for metal(II)/M = manganese/Mn, cobalt/Co, nickel/Ni and zinc/Zn. We also evaluate the sensitivity of the surfaces by an investigation of seemingly flat surfaces through analysis of the curvature functions in the direction of C—C bonds. The obtained values not only demonstrate variations in curvature but also show a correlation with the hybridization of the C atoms involved in the bond.




b

Coordination structure and inter­molecular inter­actions in copper(II) acetate com­plexes with 1,10-phenanthroline and 2,2'-bi­py­ri­dine

The crystal structures of two coordination com­pounds, (acetato-κO)(2,2'-bi­py­ri­dine-κ2N,N')(1,10-phenanthroline-κ2N,N')copper(II) acetate hexa­hydrate, [Cu(C2H3O2)(C10H8N2)(C12H8N2)](C2H3O2)·6H2O or [Cu(bipy)(phen)Ac]Ac·6H2O, and (acetato-κO)bis­(2,2'-bi­py­ri­dine-κ2N,N')copper(II) acetate–acetic acid–water (1/1/3), [Cu(C2H3O2)(C10H8N2)2](C2H3O2)·C2H4O2·3H2O or [Cu(bipy)2Ac]Ac·HAc·3H2O, are reported and com­pared with the previously published structure of [Cu(phen)2Ac]Ac·7H2O (phen is 1,10-phenanthroline, bipy for 2,2'-bi­py­ri­dine, ac is acetate and Hac is acetic acid). The geometry around the metal centre is penta­coordinated, but highly distorted in all three cases. The coordination number and the geometric distortion are both discussed in detail, and all com­plexes belong to the space group Poverline{1}. The analysis of the geometric parameters and the Hirshfeld surface properties dnorm and curvedness provide information about the metal–ligand inter­actions in these com­plexes and allow com­parison with similar systems.




b

Multivalent hy­dro­gen-bonded architectures directed by self-com­plementarity between [Cu(2,2'-bi­imid­az­ole)] and malonate building blocks

The synthesis and structural characterization of four novel supra­molecular hy­dro­gen-bonded arrangements based on self-assembly from mol­ecular `[Cu(2,2'-bi­imid­az­ole)]' modules and malonate anions are pre­sent­ed, namely, tetra­kis­(2,2'-bi­imid­az­ole)di-μ-chlorido-dimal­on­atotricopper(II) penta­hydrate, [Cu3(C3H2O4)2Cl2(C6H6N4)4]·5H2O or [Cu(H2biim)2(μ-Cl)Cu0.5(mal)]2·5H2O, aqua­(2,2'-bi­imid­az­ole)­mal­on­atocopper(II) dihydrate, [Cu(C3H2O4)(C6H6N4)(H2O)]·2H2O or [Cu(H2biim)(mal)(H2O)]·2H2O, bis­[aqua­bis­(2,2'-bi­imid­az­ole)­cop­per(II)] di­mal­on­atodi­perchloratocopper(II) 2.2-hydrate, [Cu(C6H6N4)2(H2O)]2[Cu(C3H2O4)(ClO4)2]·2.2H2O or [Cu(H2biim)2(H2O)]2[Cu(mal)2(ClO4)2]·2.2H2O, and bis­(2,2'-bi­imid­az­ole)­copper(II) bis­[bis­(2,2'-bi­imid­az­ole)(2-carb­oxy­acetato)mal­on­atocopper(II)] tridecahydrate, [Cu(C6H6N4)2][Cu(C3H2O4)(C3H3O4)(C6H6N4)2]·13H2O or [Cu(H2biim)2][Cu(H2biim)2(Hmal)(mal)]2·13H2O. These as­sem­blies are characterized by self-com­plementary donor–acceptor mol­ecular inter­actions, demonstrating a recurrent and distinctive pattern of hy­dro­gen-bonding preferences among the carboxyl­ate, carb­oxy­lic acid and N—H groups of the coordinated 2,2'-bi­imid­az­ole and malonate ligands. Additionally, co­or­din­ation of the carboxyl­ate group with the metallic centre helps sustain re­mark­able supra­molecular assemblies, such as layers, helices, double helix columns or 3D channeled architectures, including mixed-metal com­plexes, into a single structure.




b

3D electron diffraction studies of synthetic rhabdophane (DyPO4·nH2O)

In this study, we report the results of continuous rotation electron diffraction studies of single DyPO4·nH2O (rhabdophane) nanocrystals. The diffraction patterns can be fit to a trigonal lattice (P3121) with lattice parameters a = 7.019 (5) and c = 6.417 (5) Å. However, there is also a set of diffuse background scattering features present that are associated with a disordered superstructure that is double these lattice parameters and fits with an arrangement of water mol­ecules present in the structure pore. Pair distribution function (PDF) maps based on the diffuse background allowed the extent of the water correlation to be estimated, with 2–3 nm correlation along the c axis and ∼5 nm along the a/b axis.




b

Introducing the Best practice in crystallography series

 




b

Modes and model building in SHELXE

Density modification is a standard step to provide a route for routine structure solution by any experimental phasing method, with single-wavelength or multi-wavelength anomalous diffraction being the most popular methods, as well as to extend fragments or incomplete models into a full solution. The effect of density modification on the starting maps from either source is illustrated in the case of SHELXE. The different modes in which the program can run are reviewed; these include less well known uses such as reading external phase values and weights or phase distributions encoded in Hendrickson–Lattman coefficients. Typically in SHELXE, initial phases are calculated from experimental data, from a partial model or map, or from a combination of both sources. The initial phase set is improved and extended by density modification and, if the resolution of the data and the type of structure permits, polyalanine tracing. As a feature to systematically eliminate model bias from phases derived from predicted models, the trace can be set to exclude the area occupied by the starting model. The trace now includes an extension into the gamma position or hydrophobic and aromatic side chains if a sequence is provided, which is performed in every tracing cycle. Once a correlation coefficient of over 30% between the structure factors calculated from such a trace and the native data indicates that the structure has been solved, the sequence is docked in all model-building cycles and side chains are fitted if the map supports it. The extensions to the tracing algorithm brought in to provide a complete model are discussed. The improvement in phasing performance is assessed using a set of tests.




b

The TR-icOS setup at the ESRF: time-resolved microsecond UV–Vis absorption spectroscopy on protein crystals

The technique of time-resolved macromolecular crystallography (TR-MX) has recently been rejuvenated at synchrotrons, resulting in the design of dedicated beamlines. Using pump–probe schemes, this should make the mechanistic study of photoactive proteins and other suitable systems possible with time resolutions down to microseconds. In order to identify relevant time delays, time-resolved spectroscopic experiments directly performed on protein crystals are often desirable. To this end, an instrument has been built at the icOS Lab (in crystallo Optical Spectroscopy Laboratory) at the European Synchrotron Radiation Facility using reflective focusing objectives with a tuneable nanosecond laser as a pump and a microsecond xenon flash lamp as a probe, called the TR-icOS (time-resolved icOS) setup. Using this instrument, pump–probe spectra can rapidly be recorded from single crystals with time delays ranging from a few microseconds to seconds and beyond. This can be repeated at various laser pulse energies to track the potential presence of artefacts arising from two-photon absorption, which amounts to a power titration of a photoreaction. This approach has been applied to monitor the rise and decay of the M state in the photocycle of crystallized bacteriorhodopsin and showed that the photocycle is increasingly altered with laser pulses of peak fluence greater than 100 mJ cm−2, providing experimental laser and delay parameters for a successful TR-MX experiment.




b

The High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX), an ancillary tool for the macromolecular crystallography beamlines at the ESRF

This article describes the High-Pressure Freezing Laboratory for Macromolecular Crystallography (HPMX) at the ESRF, and highlights new and complementary research opportunities that can be explored using this facility. The laboratory is dedicated to investigating interactions between macromolecules and gases in crystallo, and finds applications in many fields of research, including fundamental biology, biochemistry, and environmental and medical science. At present, the HPMX laboratory offers the use of different high-pressure cells adapted for helium, argon, krypton, xenon, nitrogen, oxygen, carbon dioxide and methane. Important scientific applications of high pressure to macromolecules at the HPMX include noble-gas derivatization of crystals to detect and map the internal architecture of proteins (pockets, tunnels and channels) that allows the storage and diffusion of ligands or substrates/products, the investigation of the catalytic mechanisms of gas-employing enzymes (using oxygen, carbon dioxide or methane as substrates) to possibly decipher intermediates, and studies of the conformational fluctuations or structure modifications that are necessary for proteins to function. Additionally, cryo-cooling protein crystals under high pressure (helium or argon at 2000 bar) enables the addition of cryo-protectant to be avoided and noble gases can be employed to produce derivatives for structure resolution. The high-pressure systems are designed to process crystals along a well defined pathway in the phase diagram (pressure–temperature) of the gas to cryo-cool the samples according to the three-step `soak-and-freeze method'. Firstly, crystals are soaked in a pressurized pure gas atmosphere (at 294 K) to introduce the gas and facilitate its inter­actions within the macromolecules. Samples are then flash-cooled (at 100 K) while still under pressure to cryo-trap macromolecule–gas complexation states or pressure-induced protein modifications. Finally, the samples are recovered after depressurization at cryo-temperatures. The final section of this publication presents a selection of different typical high-pressure experiments carried out at the HPMX, showing that this technique has already answered a wide range of scientific questions. It is shown that the use of different gases and pressure conditions can be used to probe various effects, such as mapping the functional internal architectures of enzymes (tunnels in the haloalkane dehalogenase DhaA) and allosteric sites on membrane-protein surfaces, the interaction of non-inert gases with proteins (oxygen in the hydrogenase ReMBH) and pressure-induced structural changes of proteins (tetramer dissociation in urate oxidase). The technique is versatile and the provision of pressure cells and their application at the HPMX is gradually being extended to address new scientific questions.




b

A web-based dashboard for RELION metadata visualization

Cryo-electron microscopy (cryo-EM) has witnessed radical progress in the past decade, driven by developments in hardware and software. While current software packages include processing pipelines that simplify the image-processing workflow, they do not prioritize the in-depth analysis of crucial metadata, limiting troubleshooting for challenging data sets. The widely used RELION software package lacks a graphical native representation of the underlying metadata. Here, two web-based tools are introduced: relion_live.py, which offers real-time feedback on data collection, aiding swift decision-making during data acquisition, and relion_analyse.py, a graphical interface to represent RELION projects by plotting essential metadata including interactive data filtration and analysis. A useful script for estimating ice thickness and data quality during movie pre-processing is also presented. These tools empower researchers to analyse data efficiently and allow informed decisions during data collection and processing.




b

Structural flexibility of Toscana virus nucleoprotein in the presence of a single-chain camelid antibody

Phenuiviridae nucleoprotein is the main structural and functional component of the viral cycle, protecting the viral RNA and mediating the essential replication/transcription processes. The nucleoprotein (N) binds the RNA using its globular core and polymerizes through the N-terminus, which is presented as a highly flexible arm, as demonstrated in this article. The nucleoprotein exists in an `open' or a `closed' conformation. In the case of the closed conformation the flexible N-terminal arm folds over the RNA-binding cleft, preventing RNA adsorption. In the open conformation the arm is extended in such a way that both RNA adsorption and N polymerization are possible. In this article, single-crystal X-ray diffraction and small-angle X-ray scattering were used to study the N protein of Toscana virus complexed with a single-chain camelid antibody (VHH) and it is shown that in the presence of the antibody the nucleoprotein is unable to achieve a functional assembly to form a ribonucleoprotein complex.




b

Fragment-based screening targeting an open form of the SARS-CoV-2 main protease binding pocket

To identify starting points for therapeutics targeting SARS-CoV-2, the Paul Scherrer Institute and Idorsia decided to collaboratively perform an X-ray crystallographic fragment screen against its main protease. Fragment-based screening was carried out using crystals with a pronounced open conformation of the substrate-binding pocket. Of 631 soaked fragments, a total of 29 hits bound either in the active site (24 hits), a remote binding pocket (three hits) or at crystal-packing interfaces (two hits). Notably, two fragments with a pose that was sterically incompatible with a more occluded crystal form were identified. Two isatin-based electrophilic fragments bound covalently to the catalytic cysteine residue. The structures also revealed a surprisingly strong influence of the crystal form on the binding pose of three published fragments used as positive controls, with implications for fragment screening by crystallography.




b

AlphaFold-assisted structure determination of a bacterial protein of unknown function using X-ray and electron crystallography

Macromolecular crystallography generally requires the recovery of missing phase information from diffraction data to reconstruct an electron-density map of the crystallized molecule. Most recent structures have been solved using molecular replacement as a phasing method, requiring an a priori structure that is closely related to the target protein to serve as a search model; when no such search model exists, molecular replacement is not possible. New advances in computational machine-learning methods, however, have resulted in major advances in protein structure predictions from sequence information. Methods that generate predicted structural models of sufficient accuracy provide a powerful approach to molecular replacement. Taking advantage of these advances, AlphaFold predictions were applied to enable structure determination of a bacterial protein of unknown function (UniProtKB Q63NT7, NCBI locus BPSS0212) based on diffraction data that had evaded phasing attempts using MIR and anomalous scattering methods. Using both X-ray and micro-electron (microED) diffraction data, it was possible to solve the structure of the main fragment of the protein using a predicted model of that domain as a starting point. The use of predicted structural models importantly expands the promise of electron diffraction, where structure determination relies critically on molecular replacement.




b

Using cryo-EM to understand the assembly pathway of respiratory complex I

Complex I (proton-pumping NADH:ubiquinone oxidoreductase) is the first component of the mitochondrial respiratory chain. In recent years, high-resolution cryo-EM studies of complex I from various species have greatly enhanced the understanding of the structure and function of this important membrane-protein complex. Less well studied is the structural basis of complex I biogenesis. The assembly of this complex of more than 40 subunits, encoded by nuclear or mitochondrial DNA, is an intricate process that requires at least 20 different assembly factors in humans. These are proteins that are transiently associated with building blocks of the complex and are involved in the assembly process, but are not part of mature complex I. Although the assembly pathways have been studied extensively, there is limited information on the structure and molecular function of the assembly factors. Here, the insights that have been gained into the assembly process using cryo-EM are reviewed.




b

A service-based approach to cryoEM facility processing pipelines at eBIC

Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.




b

The crystal structure of mycothiol disulfide reductase (Mtr) provides mechanistic insight into the specific low-molecular-weight thiol reductase activity of Actinobacteria

Low-molecular-weight (LMW) thiols are involved in many processes in all organisms, playing a protective role against reactive species, heavy metals, toxins and antibiotics. Actinobacteria, such as Mycobacterium tuberculosis, use the LMW thiol mycothiol (MSH) to buffer the intracellular redox environment. The NADPH-dependent FAD-containing oxidoreductase mycothiol disulfide reductase (Mtr) is known to reduce oxidized mycothiol disulfide (MSSM) to MSH, which is crucial to maintain the cellular redox balance. In this work, the first crystal structures of Mtr are presented, expanding the structural knowledge and understanding of LMW thiol reductases. The structural analyses and docking calculations provide insight into the nature of Mtrs, with regard to the binding and reduction of the MSSM substrate, in the context of related oxidoreductases. The putative binding site for MSSM suggests a similar binding to that described for the homologous glutathione reductase and its respective substrate glutathione disulfide, but with distinct structural differences shaped to fit the bulkier MSSM substrate, assigning Mtrs as uniquely functioning reductases. As MSH has been acknowledged as an attractive antitubercular target, the structural findings presented in this work may contribute towards future antituberculosis drug development.




b

Characterization of novel mevalonate kinases from the tardigrade Ramazzottius varieornatus and the psychrophilic archaeon Methanococcoides burtonii

Mevalonate kinase is central to the isoprenoid biosynthesis pathway. Here, high-resolution X-ray crystal structures of two mevalonate kinases are presented: a eukaryotic protein from Ramazzottius varieornatus and an archaeal protein from Methanococcoides burtonii. Both enzymes possess the highly conserved motifs of the GHMP enzyme superfamily, with notable differences between the two enzymes in the N-terminal part of the structures. Biochemical characterization of the two enzymes revealed major differences in their sensitivity to geranyl pyrophosphate and farnesyl pyrophosphate, and in their thermal stabilities. This work adds to the understanding of the structural basis of enzyme inhibition and thermostability in mevalonate kinases.




b

Advanced exploitation of unmerged reflection data during processing and refinement with autoPROC and BUSTER

The validation of structural models obtained by macromolecular X-ray crystallography against experimental diffraction data, whether before deposition into the PDB or after, is typically carried out exclusively against the merged data that are eventually archived along with the atomic coordinates. It is shown here that the availability of unmerged reflection data enables valuable additional analyses to be performed that yield improvements in the final models, and tools are presented to implement them, together with examples of the results to which they give access. The first example is the automatic identification and removal of image ranges affected by loss of crystal centering or by excessive decay of the diffraction pattern as a result of radiation damage. The second example is the `reflection-auditing' process, whereby individual merged data items showing especially poor agreement with model predictions during refinement are investigated thanks to the specific metadata (such as image number and detector position) that are available for the corresponding unmerged data, potentially revealing previously undiagnosed instrumental, experimental or processing problems. The third example is the calculation of so-called F(early) − F(late) maps from carefully selected subsets of unmerged amplitude data, which can not only highlight the location and extent of radiation damage but can also provide guidance towards suitable fine-grained parametrizations to model the localized effects of such damage.




b

Structural determination and modeling of ciliary microtubules

The axoneme, a microtubule-based array at the center of every cilium, has been the subject of structural investigations for decades, but only recent advances in cryo-EM and cryo-ET have allowed a molecular-level interpretation of the entire complex to be achieved. The unique properties of the nine doublet microtubules and central pair of singlet microtubules that form the axoneme, including the highly decorated tubulin lattice and the docking of massive axonemal complexes, provide opportunities and challenges for sample preparation, 3D reconstruction and atomic modeling. Here, the approaches used for cryo-EM and cryo-ET of axonemes are reviewed, while highlighting the unique opportunities provided by the latest generation of AI-guided tools that are transforming structural biology.




b

Mononuclear binding and catalytic activity of europium(III) and gadolinium(III) at the active site of the model metalloenzyme phosphotriesterase

Lanthanide ions have ideal chemical properties for catalysis, such as hard Lewis acidity, fast ligand-exchange kinetics, high coordination-number preferences and low geometric requirements for coordination. As a result, many small-molecule lanthanide catalysts have been described in the literature. Yet, despite the ability of enzymes to catalyse highly stereoselective reactions under gentle conditions, very few lanthanoenzymes have been investigated. In this work, the mononuclear binding of europium(III) and gadolinium(III) to the active site of a mutant of the model enzyme phosphotriesterase are described using X-ray crystallography at 1.78 and 1.61 Å resolution, respectively. It is also shown that despite coordinating a single non-natural metal cation, the PTE-R18 mutant is still able to maintain esterase activity.




b

HEIDI: an experiment-management platform enabling high-throughput fragment and compound screening

The Swiss Light Source facilitates fragment-based drug-discovery campaigns for academic and industrial users through the Fast Fragment and Compound Screening (FFCS) software suite. This framework is further enriched by the option to utilize the Smart Digital User (SDU) software for automated data collection across the PXI, PXII and PXIII beamlines. In this work, the newly developed HEIDI webpage (https://heidi.psi.ch) is introduced: a platform crafted using state-of-the-art software architecture and web technologies for sample management of rotational data experiments. The HEIDI webpage features a data-review tab for enhanced result visualization and provides programmatic access through a representational state transfer application programming interface (REST API). The migration of the local FFCS MongoDB instance to the cloud is highlighted and detailed. This transition ensures secure, encrypted and consistently accessible data through a robust and reliable REST API tailored for the FFCS software suite. Collectively, these advancements not only significantly elevate the user experience, but also pave the way for future expansions and improvements in the capabilities of the system.




b

STOPGAP: an open-source package for template matching, subtomogram alignment and classification

Cryo-electron tomography (cryo-ET) enables molecular-resolution 3D imaging of complex biological specimens such as viral particles, cellular sections and, in some cases, whole cells. This enables the structural characterization of molecules in their near-native environments, without the need for purification or separation, thereby preserving biological information such as conformational states and spatial relationships between different molecular species. Subtomogram averaging is an image-processing workflow that allows users to leverage cryo-ET data to identify and localize target molecules, determine high-resolution structures of repeating molecular species and classify different conformational states. Here, STOPGAP, an open-source package for subtomogram averaging that is designed to provide users with fine control over each of these steps, is described. In providing detailed descriptions of the image-processing algorithms that STOPGAP uses, this manuscript is also intended to serve as a technical resource to users as well as for further community-driven software development.




b

A database overview of metal-coordination distances in metalloproteins

Metalloproteins are ubiquitous in all living organisms and take part in a very wide range of biological processes. For this reason, their experimental characterization is crucial to obtain improved knowledge of their structure and biological functions. The three-dimensional structure represents highly relevant information since it provides insight into the interaction between the metal ion(s) and the protein fold. Such interactions determine the chemical reactivity of the bound metal. The available PDB structures can contain errors due to experimental factors such as poor resolution and radiation damage. A lack of use of distance restraints during the refinement and validation process also impacts the structure quality. Here, the aim was to obtain a thorough overview of the distribution of the distances between metal ions and their donor atoms through the statistical analysis of a data set based on more than 115 000 metal-binding sites in proteins. This analysis not only produced reference data that can be used by experimentalists to support the structure-determination process, for example as refinement restraints, but also resulted in an improved insight into how protein coordination occurs for different metals and the nature of their binding interactions. In particular, the features of carboxylate coordination were inspected, which is the only type of interaction that is commonly present for nearly all metals.




b

Pillar data-acquisition strategies for cryo-electron tomography of beam-sensitive biological samples

For cryo-electron tomography (cryo-ET) of beam-sensitive biological specimens, a planar sample geometry is typically used. As the sample is tilted, the effective thickness of the sample along the direction of the electron beam increases and the signal-to-noise ratio concomitantly decreases, limiting the transfer of information at high tilt angles. In addition, the tilt range where data can be collected is limited by a combination of various sample-environment constraints, including the limited space in the objective lens pole piece and the possible use of fixed conductive braids to cool the specimen. Consequently, most tilt series are limited to a maximum of ±70°, leading to the presence of a missing wedge in Fourier space. The acquisition of cryo-ET data without a missing wedge, for example using a cylindrical sample geometry, is hence attractive for volumetric analysis of low-symmetry structures such as organelles or vesicles, lysis events, pore formation or filaments for which the missing information cannot be compensated by averaging techniques. Irrespective of the geometry, electron-beam damage to the specimen is an issue and the first images acquired will transfer more high-resolution information than those acquired last. There is also an inherent trade-off between higher sampling in Fourier space and avoiding beam damage to the sample. Finally, the necessity of using a sufficient electron fluence to align the tilt images means that this fluence needs to be fractionated across a small number of images; therefore, the order of data acquisition is also a factor to consider. Here, an n-helix tilt scheme is described and simulated which uses overlapping and interleaved tilt series to maximize the use of a pillar geometry, allowing the entire pillar volume to be reconstructed as a single unit. Three related tilt schemes are also evaluated that extend the continuous and classic dose-symmetric tilt schemes for cryo-ET to pillar samples to enable the collection of isotropic information across all spatial frequencies. A fourfold dose-symmetric scheme is proposed which provides a practical compromise between uniform information transfer and complexity of data acquisition.




b

Introduction of the Capsules environment to support further growth of the SBGrid structural biology software collection

The expansive scientific software ecosystem, characterized by millions of titles across various platforms and formats, poses significant challenges in maintaining reproducibility and provenance in scientific research. The diversity of independently developed applications, evolving versions and heterogeneous components highlights the need for rigorous methodologies to navigate these complexities. In response to these challenges, the SBGrid team builds, installs and configures over 530 specialized software applications for use in the on-premises and cloud-based computing environments of SBGrid Consortium members. To address the intricacies of supporting this diverse application collection, the team has developed the Capsule Software Execution Environment, generally referred to as Capsules. Capsules rely on a collection of programmatically generated bash scripts that work together to isolate the runtime environment of one application from all other applications, thereby providing a transparent cross-platform solution without requiring specialized tools or elevated account privileges for researchers. Capsules facilitate modular, secure software distribution while maintaining a centralized, conflict-free environment. The SBGrid platform, which combines Capsules with the SBGrid collection of structural biology applications, aligns with FAIR goals by enhancing the findability, accessibility, interoperability and reusability of scientific software, ensuring seamless functionality across diverse computing environments. Its adaptability enables application beyond structural biology into other scientific fields.




b

A structural role for tryptophan in proteins, and the ubiquitous Trp Cδ1—H⋯O=C (backbone) hydrogen bond

Tryptophan is the most prominent amino acid found in proteins, with multiple functional roles. Its side chain is made up of the hydrophobic indole moiety, with two groups that act as donors in hydrogen bonds: the Nɛ—H group, which is a potent donor in canonical hydrogen bonds, and a polarized Cδ1—H group, which is capable of forming weaker, noncanonical hydrogen bonds. Due to adjacent electron-withdrawing moieties, C—H⋯O hydrogen bonds are ubiquitous in macromolecules, albeit contingent on the polarization of the donor C—H group. Consequently, Cα—H groups (adjacent to the carbonyl and amino groups of flanking peptide bonds), as well as the Cɛ1—H and Cδ2—H groups of histidines (adjacent to imidazole N atoms), are known to serve as donors in hydrogen bonds, for example stabilizing parallel and antiparallel β-sheets. However, the nature and the functional role of interactions involving the Cδ1—H group of the indole ring of tryptophan are not well characterized. Here, data mining of high-resolution (r ≤ 1.5 Å) crystal structures from the Protein Data Bank was performed and ubiquitous close contacts between the Cδ1—H groups of tryptophan and a range of electronegative acceptors were identified, specifically main-chain carbonyl O atoms immediately upstream and downstream in the polypeptide chain. The stereochemical analysis shows that most of the interactions bear all of the hallmarks of proper hydrogen bonds. At the same time, their cohesive nature is confirmed by quantum-chemical calculations, which reveal interaction energies of 1.5–3.0 kcal mol−1, depending on the specific stereochemistry.




b

Managing macromolecular crystallographic data with a laboratory information management system

Protein crystallography is an established method to study the atomic structures of macromolecules and their complexes. A prerequisite for successful structure determination is diffraction-quality crystals, which may require extensive optimization of both the protein and the conditions, and hence projects can stretch over an extended period, with multiple users being involved. The workflow from crystallization and crystal treatment to deposition and publication is well defined, and therefore an electronic laboratory information management system (LIMS) is well suited to management of the data. Completion of the project requires key information on all the steps being available and this information should also be made available according to the FAIR principles. As crystallized samples are typically shipped between facilities, a key feature to be captured in the LIMS is the exchange of metadata between the crystallization facility of the home laboratory and, for example, synchrotron facilities. On completion, structures are deposited in the Protein Data Bank (PDB) and the LIMS can include the PDB code in its database, completing the chain of custody from crystallization to structure deposition and publication. A LIMS designed for macromolecular crystallography, IceBear, is available as a standalone installation and as a hosted service, and the implementation of key features for the capture of metadata in IceBear is discussed as an example.




b

The crystal structure of Shethna protein II (FeSII) from Azotobacter vinelandii suggests a domain swap

The Azotobacter vinelandii FeSII protein forms an oxygen-resistant complex with the nitrogenase MoFe and Fe proteins. FeSII is an adrenodoxin-type ferredoxin that forms a dimer in solution. Previously, the crystal structure was solved [Schlesier et al. (2016), J. Am. Chem. Soc. 138, 239–247] with five copies in the asymmetric unit. One copy is a normal adrenodoxin domain that forms a dimer with its crystallographic symmetry mate. The other four copies are in an `open' conformation with a loop flipped out exposing the 2Fe–2S cluster. The open and closed conformations were interpreted as oxidized and reduced, respectively, and the large conformational change in the open configuration allowed binding to nitrogenase. Here, the structure of FeSII was independently solved in the same crystal form. The positioning of the atoms in the unit cell is similar to the earlier report. However, the interpretation of the structure is different. The `open' conformation is interpreted as the product of a crystallization-induced domain swap. The 2Fe–2S cluster is not exposed to solvent, but in the crystal its interacting helix is replaced by the same helix residues from a crystal symmetry mate. The domain swap is complicated, as it is unusual in being in the middle of the protein rather than at a terminus, and it creates arrangements of molecules that can be interpreted in multiple ways. It is also cautioned that crystal structures should be interpreted in terms of the contents of the entire crystal rather than of one asymmetric unit.




b

Crystallographic fragment-binding studies of the Mycobacterium tuberculosis trifunctional enzyme suggest binding pockets for the tails of the acyl-CoA substrates at its active sites and a potential substrate-channeling path between them

The Mycobacterium tuberculosis trifunctional enzyme (MtTFE) is an α2β2 tetrameric enzyme in which the α-chain harbors the 2E-enoyl-CoA hydratase (ECH) and 3S-hydroxyacyl-CoA dehydrogenase (HAD) active sites, and the β-chain provides the 3-ketoacyl-CoA thiolase (KAT) active site. Linear, medium-chain and long-chain 2E-enoyl-CoA molecules are the preferred substrates of MtTFE. Previous crystallographic binding and modeling studies identified binding sites for the acyl-CoA substrates at the three active sites, as well as the NAD binding pocket at the HAD active site. These studies also identified three additional CoA binding sites on the surface of MtTFE that are different from the active sites. It has been proposed that one of these additional sites could be of functional relevance for the substrate channeling (by surface crawling) of reaction intermediates between the three active sites. Here, 226 fragments were screened in a crystallographic fragment-binding study of MtTFE crystals, resulting in the structures of 16 MtTFE–fragment complexes. Analysis of the 121 fragment-binding events shows that the ECH active site is the `binding hotspot' for the tested fragments, with 41 binding events. The mode of binding of the fragments bound at the active sites provides additional insight into how the long-chain acyl moiety of the substrates can be accommodated at their proposed binding pockets. In addition, the 20 fragment-binding events between the active sites identify potential transient binding sites of reaction intermediates relevant to the possible channeling of substrates between these active sites. These results provide a basis for further studies to understand the functional relevance of the latter binding sites and to identify substrates for which channeling is crucial.




b

Likelihood-based interactive local docking into cryo-EM maps in ChimeraX

The interpretation of cryo-EM maps often includes the docking of known or predicted structures of the components, which is particularly useful when the map resolution is worse than 4 Å. Although it can be effective to search the entire map to find the best placement of a component, the process can be slow when the maps are large. However, frequently there is a well-founded hypothesis about where particular components are located. In such cases, a local search using a map subvolume will be much faster because the search volume is smaller, and more sensitive because optimizing the search volume for the rotation-search step enhances the signal to noise. A Fourier-space likelihood-based local search approach, based on the previously published em_placement software, has been implemented in the new emplace_local program. Tests confirm that the local search approach enhances the speed and sensitivity of the computations. An interactive graphical interface in the ChimeraX molecular-graphics program provides a convenient way to set up and evaluate docking calculations, particularly in defining the part of the map into which the components should be placed.




b

Post-translational modifications in the Protein Data Bank

Proteins frequently undergo covalent modification at the post-translational level, which involves the covalent attachment of chemical groups onto amino acids. This can entail the singular or multiple addition of small groups, such as phosphorylation; long-chain modifications, such as glycosylation; small proteins, such as ubiquitination; as well as the interconversion of chemical groups, such as the formation of pyroglutamic acid. These post-translational modifications (PTMs) are essential for the normal functioning of cells, as they can alter the physicochemical properties of amino acids and therefore influence enzymatic activity, protein localization, protein–protein interactions and protein stability. Despite their inherent importance, accurately depicting PTMs in experimental studies of protein structures often poses a challenge. This review highlights the role of PTMs in protein structures, as well as the prevalence of PTMs in the Protein Data Bank, directing the reader to accurately built examples suitable for use as a modelling reference.




b

Surface-mutagenesis strategies to enable structural biology crystallization platforms

A key prerequisite for the successful application of protein crystallography in drug discovery is to establish a robust crystallization system for a new drug-target protein fast enough to deliver crystal structures when the first inhibitors have been identified in the hit-finding campaign or, at the latest, in the subsequent hit-to-lead process. The first crucial step towards generating well folded proteins with a high likelihood of crystallizing is the identification of suitable truncation variants of the target protein. In some cases an optimal length variant alone is not sufficient to support crystallization and additional surface mutations need to be introduced to obtain suitable crystals. In this contribution, four case studies are presented in which rationally designed surface modifications were key to establishing crystallization conditions for the target proteins (the protein kinases Aurora-C, IRAK4 and BUB1, and the KRAS–SOS1 complex). The design process which led to well diffracting crystals is described and the crystal packing is analysed to understand retrospectively how the specific surface mutations promoted successful crystallization. The presented design approaches are routinely used in our team to support the establishment of robust crystallization systems which enable structure-guided inhibitor optimization for hit-to-lead and lead-optimization projects in pharmaceutical research.




b

Microcrystal electron diffraction structure of Toll-like receptor 2 TIR-domain-nucleated MyD88 TIR-domain higher-order assembly

Eukaryotic TIR (Toll/interleukin-1 receptor protein) domains signal via TIR–TIR interactions, either by self-association or by interaction with other TIR domains. In mammals, TIR domains are found in Toll-like receptors (TLRs) and cytoplasmic adaptor proteins involved in pro-inflammatory signaling. Previous work revealed that the MAL TIR domain (MALTIR) nucleates the assembly of MyD88TIR into crystalline arrays in vitro. A microcrystal electron diffraction (MicroED) structure of the MyD88TIR assembly has previously been solved, revealing a two-stranded higher-order assembly of TIR domains. In this work, it is demonstrated that the TIR domain of TLR2, which is reported to signal as a heterodimer with either TLR1 or TLR6, induces the formation of crystalline higher-order assemblies of MyD88TIR in vitro, whereas TLR1TIR and TLR6TIR do not. Using an improved data-collection protocol, the MicroED structure of TLR2TIR-induced MyD88TIR microcrystals was determined at a higher resolution (2.85 Å) and with higher completeness (89%) compared with the previous structure of the MALTIR-induced MyD88TIR assembly. Both assemblies exhibit conformational differences in several areas that are important for signaling (for example the BB loop and CD loop) compared with their monomeric structures. These data suggest that TLR2TIR and MALTIR interact with MyD88 in an analogous manner during signaling, nucleating MyD88TIR assemblies uni­directionally.




b

Comparison of two crystal polymorphs of NowGFP reveals a new conformational state trapped by crystal packing

Crystal polymorphism serves as a strategy to study the conformational flexibility of proteins. However, the relationship between protein crystal packing and protein conformation often remains elusive. In this study, two distinct crystal forms of a green fluorescent protein variant, NowGFP, are compared: a previously identified monoclinic form (space group C2) and a newly discovered ortho­rhombic form (space group P212121). Comparative analysis reveals that both crystal forms exhibit nearly identical linear assemblies of NowGFP molecules interconnected through similar crystal contacts. However, a notable difference lies in the stacking of these assemblies: parallel in the monoclinic form and perpendicular in the orthorhombic form. This distinct mode of stacking leads to different crystal contacts and induces structural alteration in one of the two molecules within the asymmetric unit of the orthorhombic crystal form. This new conformational state captured by orthorhombic crystal packing exhibits two unique features: a conformational shift of the β-barrel scaffold and a restriction of pH-dependent shifts of the key residue Lys61, which is crucial for the pH-dependent spectral shift of this protein. These findings demonstrate a clear connection between crystal packing and alternative conformational states of proteins, providing insights into how structural variations influence the function of fluorescent proteins.




b

Robust and automatic beamstop shadow outlier rejection: combining crystallographic statistics with modern clustering under a semi-supervised learning strategy

During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.




b

Structural studies of β-glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharolyticus

β-Glucosidase from the thermophilic bacterium Caldicellulosiruptor saccharo­lyticus (Bgl1) has been denoted as having an attractive catalytic profile for various industrial applications. Bgl1 catalyses the final step of in the decomposition of cellulose, an unbranched glucose polymer that has attracted the attention of researchers in recent years as it is the most abundant renewable source of reduced carbon in the biosphere. With the aim of enhancing the thermostability of Bgl1 for a broad spectrum of biotechnological processes, it has been subjected to structural studies. Crystal structures of Bgl1 and its complex with glucose were determined at 1.47 and 1.95 Å resolution, respectively. Bgl1 is a member of glycosyl hydrolase family 1 (GH1 superfamily, EC 3.2.1.21) and the results showed that the 3D structure of Bgl1 follows the overall architecture of the GH1 family, with a classical (β/α)8 TIM-barrel fold. Comparisons of Bgl1 with sequence or structural homologues of β-glucosidase reveal quite similar structures but also unique structural features in Bgl1 with plausible functional roles.




b

CHiMP: deep-learning tools trained on protein crystallization micrographs to enable automation of experiments

A group of three deep-learning tools, referred to collectively as CHiMP (Crystal Hits in My Plate), were created for analysis of micrographs of protein crystallization experiments at the Diamond Light Source (DLS) synchrotron, UK. The first tool, a classification network, assigns images into categories relating to experimental outcomes. The other two tools are networks that perform both object detection and instance segmentation, resulting in masks of individual crystals in the first case and masks of crystallization droplets in addition to crystals in the second case, allowing the positions and sizes of these entities to be recorded. The creation of these tools used transfer learning, where weights from a pre-trained deep-learning network were used as a starting point and repurposed by further training on a relatively small set of data. Two of the tools are now integrated at the VMXi macromolecular crystallography beamline at DLS, where they have the potential to absolve the need for any user input, both for monitoring crystallization experiments and for triggering in situ data collections. The third is being integrated into the XChem fragment-based drug-discovery screening platform, also at DLS, to allow the automatic targeting of acoustic compound dispensing into crystallization droplets.




b

EMhub: a web platform for data management and on-the-fly processing in scientific facilities

Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows.




b

Structure and stability of an apo thermophilic esterase that hydrolyzes polyhydroxybutyrate

Pollution from plastics is a global problem that threatens the biosphere for a host of reasons, including the time scale that it takes for most plastics to degrade. Biodegradation is an ideal solution for remediating bioplastic waste as it does not require the high temperatures necessary for thermal degradation and does not introduce additional pollutants into the environment. Numerous organisms can scavenge for bioplastics, such as polylactic acid (PLA) or poly-(R)-hydroxybutyrate (PHB), which they can use as an energy source. Recently, a promiscuous PHBase from the thermophilic soil bacterium Lihuaxuella thermophila (LtPHBase) was identified. LtPHBase can accommodate many substrates, including PHB granules and films and PHB block copolymers, as well as the unrelated polymers polylactic acid (PLA) and polycaprolactone (PCL). LtPHBase uses the expected Ser–His–Asp catalytic triad for hydrolysis at an optimal enzyme activity near 70°C. Here, the 1.75 Å resolution crystal structure of apo LtPHBase is presented and its chemical stability is profiled. Knowledge of its substrate preferences was extended to different-sized PHB granules. It is shown that LtPHBase is highly resistant to unfolding, with barriers typical for thermophilic enzymes, and shows a preference for low-molecular-mass PHB granules. These insights have implications for the long-term potential of LtPHBase as an industrial PHB hydrolase and shed light on the evolutionary role that this enzyme plays in bacterial metabolism.