sin

Using synchrotron high-resolution powder X-ray diffraction for the structure determination of a new cocrystal formed by two active principle ingredients

The crystal structure of a new 1:1 cocrystal of carbamazepine and S-naproxen (C15H12N2O·C14H14O3) was solved from powder X-ray diffraction (PXRD). The PXRD pattern was measured at the high-resolution beamline CRISTAL at synchrotron SOLEIL (France). The structure was solved using Monte Carlo simulated annealing, then refined with Rietveld refinement. The positions of the H atoms were obtained from density functional theory (DFT) ground-state calculations. The symmetry is ortho­rhom­bic with the space group P212121 (No. 19) and the following lattice parameters: a = 33.5486 (9), b = 26.4223 (6), c = 5.3651 (10) Å and V = 4755.83 (19) Å3.




sin

Using cocrystals as a tool to study non-crystallizing mol­ecules: crystal structure, Hirshfeld surface analysis and com­putational study of the 1:1 cocrystal of (E)-N-(3,4-di­fluoro­phen­yl)-1-(pyridin-4-yl)methanimine and acetic

Using a 1:1 cocrystal of (E)-N-(3,4-di­fluoro­phen­yl)-1-(pyridin-4-yl)methanimine with acetic acid, C12H8F2N2·C2H4O2, we investigate the influence of F atoms introduced to the aromatic ring on promoting π–π inter­actions. The cocrystal crystallizes in the triclinic space group P1. Through crystallographic analysis and com­putational studies, we reveal the mol­ecular arrangement within this co­crystal, demonstrating the presence of hydrogen bonding between the acetic acid mol­ecule and the pyridyl group, along with π–π inter­actions between the aromatic rings. Our findings highlight the importance of F atoms in promoting π–π inter­actions without necessitating full halogenation of the aromatic ring.




sin

Structural flexibility of Toscana virus nucleoprotein in the presence of a single-chain camelid antibody

Phenuiviridae nucleoprotein is the main structural and functional component of the viral cycle, protecting the viral RNA and mediating the essential replication/transcription processes. The nucleoprotein (N) binds the RNA using its globular core and polymerizes through the N-terminus, which is presented as a highly flexible arm, as demonstrated in this article. The nucleoprotein exists in an `open' or a `closed' conformation. In the case of the closed conformation the flexible N-terminal arm folds over the RNA-binding cleft, preventing RNA adsorption. In the open conformation the arm is extended in such a way that both RNA adsorption and N polymerization are possible. In this article, single-crystal X-ray diffraction and small-angle X-ray scattering were used to study the N protein of Toscana virus complexed with a single-chain camelid antibody (VHH) and it is shown that in the presence of the antibody the nucleoprotein is unable to achieve a functional assembly to form a ribonucleoprotein complex.




sin

AlphaFold-assisted structure determination of a bacterial protein of unknown function using X-ray and electron crystallography

Macromolecular crystallography generally requires the recovery of missing phase information from diffraction data to reconstruct an electron-density map of the crystallized molecule. Most recent structures have been solved using molecular replacement as a phasing method, requiring an a priori structure that is closely related to the target protein to serve as a search model; when no such search model exists, molecular replacement is not possible. New advances in computational machine-learning methods, however, have resulted in major advances in protein structure predictions from sequence information. Methods that generate predicted structural models of sufficient accuracy provide a powerful approach to molecular replacement. Taking advantage of these advances, AlphaFold predictions were applied to enable structure determination of a bacterial protein of unknown function (UniProtKB Q63NT7, NCBI locus BPSS0212) based on diffraction data that had evaded phasing attempts using MIR and anomalous scattering methods. Using both X-ray and micro-electron (microED) diffraction data, it was possible to solve the structure of the main fragment of the protein using a predicted model of that domain as a starting point. The use of predicted structural models importantly expands the promise of electron diffraction, where structure determination relies critically on molecular replacement.




sin

Using cryo-EM to understand the assembly pathway of respiratory complex I

Complex I (proton-pumping NADH:ubiquinone oxidoreductase) is the first component of the mitochondrial respiratory chain. In recent years, high-resolution cryo-EM studies of complex I from various species have greatly enhanced the understanding of the structure and function of this important membrane-protein complex. Less well studied is the structural basis of complex I biogenesis. The assembly of this complex of more than 40 subunits, encoded by nuclear or mitochondrial DNA, is an intricate process that requires at least 20 different assembly factors in humans. These are proteins that are transiently associated with building blocks of the complex and are involved in the assembly process, but are not part of mature complex I. Although the assembly pathways have been studied extensively, there is limited information on the structure and molecular function of the assembly factors. Here, the insights that have been gained into the assembly process using cryo-EM are reviewed.




sin

A service-based approach to cryoEM facility processing pipelines at eBIC

Electron cryo-microscopy image-processing workflows are typically composed of elements that may, broadly speaking, be categorized as high-throughput workloads which transition to high-performance workloads as preprocessed data are aggregated. The high-throughput elements are of particular importance in the context of live processing, where an optimal response is highly coupled to the temporal profile of the data collection. In other words, each movie should be processed as quickly as possible at the earliest opportunity. The high level of disconnected parallelization in the high-throughput problem directly allows a completely scalable solution across a distributed computer system, with the only technical obstacle being an efficient and reliable implementation. The cloud computing frameworks primarily developed for the deployment of high-availability web applications provide an environment with a number of appealing features for such high-throughput processing tasks. Here, an implementation of an early-stage processing pipeline for electron cryotomography experiments using a service-based architecture deployed on a Kubernetes cluster is discussed in order to demonstrate the benefits of this approach and how it may be extended to scenarios of considerably increased complexity.




sin

Advanced exploitation of unmerged reflection data during processing and refinement with autoPROC and BUSTER

The validation of structural models obtained by macromolecular X-ray crystallography against experimental diffraction data, whether before deposition into the PDB or after, is typically carried out exclusively against the merged data that are eventually archived along with the atomic coordinates. It is shown here that the availability of unmerged reflection data enables valuable additional analyses to be performed that yield improvements in the final models, and tools are presented to implement them, together with examples of the results to which they give access. The first example is the automatic identification and removal of image ranges affected by loss of crystal centering or by excessive decay of the diffraction pattern as a result of radiation damage. The second example is the `reflection-auditing' process, whereby individual merged data items showing especially poor agreement with model predictions during refinement are investigated thanks to the specific metadata (such as image number and detector position) that are available for the corresponding unmerged data, potentially revealing previously undiagnosed instrumental, experimental or processing problems. The third example is the calculation of so-called F(early) − F(late) maps from carefully selected subsets of unmerged amplitude data, which can not only highlight the location and extent of radiation damage but can also provide guidance towards suitable fine-grained parametrizations to model the localized effects of such damage.




sin

High-confidence placement of low-occupancy fragments into electron density using the anomalous signal of sulfur and halogen atoms

Fragment-based drug design using X-ray crystallography is a powerful technique to enable the development of new lead compounds, or probe molecules, against biological targets. This study addresses the need to determine fragment binding orientations for low-occupancy fragments with incomplete electron density, an essential step before further development of the molecule. Halogen atoms play multiple roles in drug discovery due to their unique combination of electronegativity, steric effects and hydrophobic properties. Fragments incorporating halogen atoms serve as promising starting points in hit-to-lead development as they often establish halogen bonds with target proteins, potentially enhancing binding affinity and selectivity, as well as counteracting drug resistance. Here, the aim was to unambiguously identify the binding orientations of fragment hits for SARS-CoV-2 nonstructural protein 1 (nsp1) which contain a combination of sulfur and/or chlorine, bromine and iodine substituents. The binding orientations of carefully selected nsp1 analogue hits were focused on by employing their anomalous scattering combined with Pan-Dataset Density Analysis (PanDDA). Anomalous difference Fourier maps derived from the diffraction data collected at both standard and long-wavelength X-rays were compared. The discrepancies observed in the maps of iodine-containing fragments collected at different energies were attributed to site-specific radiation-damage stemming from the strong X-ray absorption of I atoms, which is likely to cause cleavage of the C—I bond. A reliable and effective data-collection strategy to unambiguously determine the binding orientations of low-occupancy fragments containing sulfur and/or halogen atoms while mitigating radiation damage is presented.




sin

Validation of electron-microscopy maps using solution small-angle X-ray scattering

The determination of the atomic resolution structure of biomacromolecules is essential for understanding details of their function. Traditionally, such a structure determination has been performed with crystallographic or nuclear resonance methods, but during the last decade, cryogenic transmission electron microscopy (cryo-TEM) has become an equally important tool. As the blotting and flash-freezing of the samples can induce conformational changes, external validation tools are required to ensure that the vitrified samples are representative of the solution. Although many validation tools have already been developed, most of them rely on fully resolved atomic models, which prevents early screening of the cryo-TEM maps. Here, a novel and automated method for performing such a validation utilizing small-angle X-ray scattering measurements, publicly available through the new software package AUSAXS, is introduced and implemented. The method has been tested on both simulated and experimental data, where it was shown to work remarkably well as a validation tool. The method provides a dummy atomic model derived from the EM map which best represents the solution structure.




sin

Protonation of histidine rings using quantum-mechanical methods

Histidine can be protonated on either or both of the two N atoms of the imidazole moiety. Each of the three possible forms occurs as a result of the stereochemical environment of the histidine side chain. In an atomic model, comparing the possible protonation states in situ, looking at possible hydrogen bonding and metal coordination, it is possible to predict which is most likely to be correct. A more direct method is described that uses quantum-mechanical methods to calculate, also in situ, the minimum geometry and energy for comparison, and therefore to more accurately identify the most likely proton­ation state.




sin

The success rate of processed predicted models in molecular replacement: implications for experimental phasing in the AlphaFold era

The availability of highly accurate protein structure predictions from AlphaFold2 (AF2) and similar tools has hugely expanded the applicability of molecular replacement (MR) for crystal structure solution. Many structures can be solved routinely using raw models, structures processed to remove unreliable parts or models split into distinct structural units. There is therefore an open question around how many and which cases still require experimental phasing methods such as single-wavelength anomalous diffraction (SAD). Here, this question is addressed using a large set of PDB depositions that were solved by SAD. A large majority (87%) could be solved using unedited or minimally edited AF2 predictions. A further 18 (4%) yield straightforwardly to MR after splitting of the AF2 prediction using Slice'N'Dice, although different splitting methods succeeded on slightly different sets of cases. It is also found that further unique targets can be solved by alternative modelling approaches such as ESMFold (four cases), alternative MR approaches such as ARCIMBOLDO and AMPLE (two cases each), and multimeric model building with AlphaFold-Multimer or UniFold (three cases). Ultimately, only 12 cases, or 3% of the SAD-phased set, did not yield to any form of MR tested here, offering valuable hints as to the number and the characteristics of cases where experimental phasing remains essential for macromolecular structure solution.




sin

EMhub: a web platform for data management and on-the-fly processing in scientific facilities

Most scientific facilities produce large amounts of heterogeneous data at a rapid pace. Managing users, instruments, reports and invoices presents additional challenges. To address these challenges, EMhub, a web platform designed to support the daily operations and record-keeping of a scientific facility, has been introduced. EMhub enables the easy management of user information, instruments, bookings and projects. The application was initially developed to meet the needs of a cryoEM facility, but its functionality and adaptability have proven to be broad enough to be extended to other data-generating centers. The expansion of EMHub is enabled by the modular nature of its core functionalities. The application allows external processes to be connected via a REST API, automating tasks such as folder creation, user and password generation, and the execution of real-time data-processing pipelines. EMhub has been used for several years at the Swedish National CryoEM Facility and has been installed in the CryoEM center at the Structural Biology Department at St. Jude Children's Research Hospital. A fully automated single-particle pipeline has been implemented for on-the-fly data processing and analysis. At St. Jude, the X-Ray Crystallography Center and the Single-Molecule Imaging Center have already expanded the platform to support their operational and data-management workflows.




sin

Analysis of crystallographic phase retrieval using iterative projection algorithms

For protein crystals in which more than two thirds of the volume is occupied by solvent, the featureless nature of the solvent region often generates a constraint that is powerful enough to allow direct phasing of X-ray diffraction data. Practical implementation relies on the use of iterative projection algorithms with good global convergence properties to solve the difficult nonconvex phase-retrieval problem. In this paper, some aspects of phase retrieval using iterative projection algorithms are systematically explored, where the diffraction data and density-value distributions in the protein and solvent regions provide the sole constraints. The analysis is based on the addition of random error to the phases of previously determined protein crystal structures, followed by evaluation of the ability to recover the correct phase set as the distance from the solution increases. The properties of the difference-map (DM), relaxed–reflect–reflect (RRR) and relaxed averaged alternating reflectors (RAAR) algorithms are compared. All of these algorithms prove to be effective for crystallographic phase retrieval, and the useful ranges of the adjustable parameter which controls their behavior are established. When these algorithms converge to the solution, the algorithm trajectory becomes stationary; however, the density function continues to fluctuate significantly around its mean position. It is shown that averaging over the algorithm trajectory in the stationary region, following convergence, improves the density estimate, with this procedure outperforming previous approaches for phase or density refinement.




sin

Structure determination using high-order spatial correlations in single-particle X-ray scattering

Single-particle imaging using X-ray free-electron lasers (XFELs) is a promising technique for observing nanoscale biological samples under near-physiological conditions. However, as the sample's orientation in each diffraction pattern is unknown, advanced algorithms are required to reconstruct the 3D diffraction intensity volume and subsequently the sample's density model. While most approaches perform 3D reconstruction via determining the orientation of each diffraction pattern, a correlation-based approach utilizes the averaged spatial correlations of diffraction intensities over all patterns, making it well suited for processing experimental data with a poor signal-to-noise ratio of individual patterns. Here, a method is proposed to determine the 3D structure of a sample by analyzing the double, triple and quadruple spatial correlations in diffraction patterns. This ab initio method can reconstruct the basic shape of an irregular unsymmetric 3D sample without requiring any prior knowledge of the sample. The impact of background and noise on correlations is investigated and corrected to ensure the success of reconstruction under simulated experimental conditions. Additionally, the feasibility of using the correlation-based approach to process incomplete partial diffraction patterns is demonstrated. The proposed method is a variable addition to existing algorithms for 3D reconstruction and will further promote the development and adoption of XFEL single-particle imaging techniques.




sin

Refining short-range order parameters from the three-dimensional diffuse scattering in single-crystal electron diffraction data

Our study compares short-range order parameters refined from the diffuse scattering in single-crystal X-ray and single-crystal electron diffraction data. Nb0.84CoSb was chosen as a reference material. The correlations between neighbouring vacancies and the displacements of Sb and Co atoms were refined from the diffuse scattering using a Monte Carlo refinement in DISCUS. The difference between the Sb and Co displacements refined from the diffuse scattering and the Sb and Co displacements refined from the Bragg reflections in single-crystal X-ray diffraction data is 0.012 (7) Å for the refinement on diffuse scattering in single-crystal X-ray diffraction data and 0.03 (2) Å for the refinement on the diffuse scattering in single-crystal electron diffraction data. As electron diffraction requires much smaller crystals than X-ray diffraction, this opens up the possibility of refining short-range order parameters in many technologically relevant materials for which no crystals large enough for single-crystal X-ray diffraction are available.




sin

The prediction of single-molecule magnet properties via deep learning

This paper uses deep learning to present a proof-of-concept for data-driven chemistry in single-molecule magnets (SMMs). Previous discussions within SMM research have proposed links between molecular structures (crystal structures) and single-molecule magnetic properties; however, these have only interpreted the results. Therefore, this study introduces a data-driven approach to predict the properties of SMM structures using deep learning. The deep-learning model learns the structural features of the SMM molecules by extracting the single-molecule magnetic properties from the 3D coordinates presented in this paper. The model accurately determined whether a molecule was a single-molecule magnet, with an accuracy rate of approximately 70% in predicting the SMM properties. The deep-learning model found SMMs from 20 000 metal complexes extracted from the Cambridge Structural Database. Using deep-learning models for predicting SMM properties and guiding the design of novel molecules is promising.




sin

The ABC toxin complex from Yersinia entomophaga can package three different cytotoxic components expressed from distinct genetic loci in an unfolded state: the structures of both shell and cargo

Bacterial ABC toxin complexes (Tcs) comprise three core proteins: TcA, TcB and TcC. The TcA protein forms a pentameric assembly that attaches to the surface of target cells and penetrates the cell membrane. The TcB and TcC proteins assemble as a heterodimeric TcB–TcC subcomplex that makes a hollow shell. This TcB–TcC subcomplex self-cleaves and encapsulates within the shell a cytotoxic `cargo' encoded by the C-terminal region of the TcC protein. Here, we describe the structure of a previously uncharacterized TcC protein from Yersinia entomophaga, encoded by a gene at a distant genomic location from the genes encoding the rest of the toxin complex, in complex with the TcB protein. When encapsulated within the TcB–TcC shell, the C-terminal toxin adopts an unfolded and disordered state, with limited areas of local order stabilized by the chaperone-like inner surface of the shell. We also determined the structure of the toxin cargo alone and show that when not encapsulated within the shell, it adopts an ADP-ribosyltransferase fold most similar to the catalytic domain of the SpvB toxin from Salmonella typhimurium. Our structural analysis points to a likely mechanism whereby the toxin acts directly on actin, modifying it in a way that prevents normal polymerization.




sin

From X-ray crystallographic structure to intrinsic thermodynamics of protein–ligand binding using carbonic anhydrase isozymes as a model system

Carbonic anhydrase (CA) was among the first proteins whose X-ray crystal structure was solved to atomic resolution. CA proteins have essentially the same fold and similar active centers that differ in only several amino acids. Primary sulfonamides are well defined, strong and specific binders of CA. However, minor variations in chemical structure can significantly alter their binding properties. Over 1000 sulfonamides have been designed, synthesized and evaluated to understand the correlations between the structure and thermodynamics of their binding to the human CA isozyme family. Compound binding was determined by several binding assays: fluorescence-based thermal shift assay, stopped-flow enzyme activity inhibition assay, isothermal titration calorimetry and competition assay for enzyme expressed on cancer cell surfaces. All assays have advantages and limitations but are necessary for deeper characterization of these protein–ligand interactions. Here, the concept and importance of intrinsic binding thermodynamics is emphasized and the role of structure–thermodynamics correlations for the novel inhibitors of CA IX is discussed – an isozyme that is overexpressed in solid hypoxic tumors, and thus these inhibitors may serve as anticancer drugs. The abundant structural and thermodynamic data are assembled into the Protein–Ligand Binding Database to understand general protein–ligand recognition principles that could be used in drug discovery.




sin

A predicted model-aided reconstruction algorithm for X-ray free-electron laser single-particle imaging

Ultra-intense, ultra-fast X-ray free-electron lasers (XFELs) enable the imaging of single protein molecules under ambient temperature and pressure. A crucial aspect of structure reconstruction involves determining the relative orientations of each diffraction pattern and recovering the missing phase information. In this paper, we introduce a predicted model-aided algorithm for orientation determination and phase retrieval, which has been tested on various simulated datasets and has shown significant improvements in the success rate, accuracy and efficiency of XFEL data reconstruction.




sin

A modified phase-retrieval algorithm to facilitate automatic de novo macromolecular structure determination in single-wavelength anomalous diffraction

The success of experimental phasing in macromolecular crystallography relies primarily on the accurate locations of heavy atoms bound to the target crystal. To improve the process of substructure determination, a modified phase-retrieval algorithm built on the framework of the relaxed alternating averaged reflection (RAAR) algorithm has been developed. Importantly, the proposed algorithm features a combination of the π-half phase perturbation for weak reflections and enforces the direct-method-based tangent formula for strong reflections in reciprocal space. The proposed algorithm is extensively demonstrated on a total of 100 single-wavelength anomalous diffraction (SAD) experimental datasets, comprising both protein and nucleic acid structures of different qualities. Compared with the standard RAAR algorithm, the modified phase-retrieval algorithm exhibits significantly improved effectiveness and accuracy in SAD substructure determination, highlighting the importance of additional constraints for algorithmic performance. Furthermore, the proposed algorithm can be performed without human intervention under most conditions owing to the self-adaptive property of the input parameters, thus making it convenient to be integrated into the structural determination pipeline. In conjunction with the IPCAS software suite, we demonstrated experimentally that automatic de novo structure determination is possible on the basis of our proposed algorithm.




sin

Benchmarking predictive methods for small-angle X-ray scattering from atomic coordinates of proteins using maximum likelihood consensus data

Stimulated by informal conversations at the XVII International Small Angle Scattering (SAS) conference (Traverse City, 2017), an international team of experts undertook a round-robin exercise to produce a large dataset from proteins under standard solution conditions. These data were used to generate consensus SAS profiles for xylose isomerase, urate oxidase, xylanase, lysozyme and ribonuclease A. Here, we apply a new protocol using maximum likelihood with a larger number of the contributed datasets to generate improved consensus profiles. We investigate the fits of these profiles to predicted profiles from atomic coordinates that incorporate different models to account for the contribution to the scattering of water molecules of hydration surrounding proteins in solution. Programs using an implicit, shell-type hydration layer generally optimize fits to experimental data with the aid of two parameters that adjust the volume of the bulk solvent excluded by the protein and the contrast of the hydration layer. For these models, we found the error-weighted residual differences between the model and the experiment generally reflected the subsidiary maxima and minima in the consensus profiles that are determined by the size of the protein plus the hydration layer. By comparison, all-atom solute and solvent molecular dynamics (MD) simulations are without the benefit of adjustable parameters and, nonetheless, they yielded at least equally good fits with residual differences that are less reflective of the structure in the consensus profile. Further, where MD simulations accounted for the precise solvent composition of the experiment, specifically the inclusion of ions, the modelled radius of gyration values were significantly closer to the experiment. The power of adjustable parameters to mask real differences between a model and the structure present in solution is demonstrated by the results for the conformationally dynamic ribonuclease A and calculations with pseudo-experimental data. This study shows that, while methods invoking an implicit hydration layer have the unequivocal advantage of speed, care is needed to understand the influence of the adjustable parameters. All-atom solute and solvent MD simulations are slower but are less susceptible to false positives, and can account for thermal fluctuations in atomic positions, and more accurately represent the water molecules of hydration that contribute to the scattering profile.




sin

High-accuracy measurement, advanced theory and analysis of the evolution of satellite transitions in manganese Kα using XR-HERFD

Here, the novel technique of extended-range high-energy-resolution fluorescence detection (XR-HERFD) has successfully observed the n = 2 satellite in manganese to a high accuracy. The significance of the satellite signature presented is many hundreds of standard errors and well beyond typical discovery levels of three to six standard errors. This satellite is a sensitive indicator for all manganese-containing materials in condensed matter. The uncertainty in the measurements has been defined, which clearly observes multiple peaks and structure indicative of complex physical quantum-mechanical processes. Theoretical calculations of energy eigenvalues, shake-off probability and Auger rates are also presented, which explain the origin of the satellite from physical n = 2 shake-off processes. The evolution in the intensity of this satellite is measured relative to the full Kα spectrum of manganese to investigate satellite structure, and therefore many-body processes, as a function of incident energy. Results demonstrate that the many-body reduction factor S02 should not be modelled with a constant value as is currently done. This work makes a significant contribution to the challenge of understanding many-body processes and interpreting HERFD or resonant inelastic X-ray scattering spectra in a quantitative manner.




sin

Capturing the blue-light activated state of the Phot-LOV1 domain from Chlamydomonas reinhardtii using time-resolved serial synchrotron crystallography

Light–oxygen–voltage (LOV) domains are small photosensory flavoprotein modules that allow the conversion of external stimuli (sunlight) into intra­cellular signals responsible for various cell behaviors (e.g. phototropism and chloro­plast relocation). This ability relies on the light-induced formation of a covalent thio­ether adduct between a flavin chromophore and a reactive cysteine from the protein environment, which triggers a cascade of structural changes that result in the activation of a serine/threonine (Ser/Thr) kinase. Recent developments in time-resolved crystallography may allow the activation cascade of the LOV domain to be observed in real time, which has been elusive. In this study, we report a robust protocol for the production and stable delivery of microcrystals of the LOV domain of phototropin Phot-1 from Chlamydomonas reinhardtii (CrPhotLOV1) with a high-viscosity injector for time-resolved serial synchrotron crystallography (TR-SSX). The detailed process covers all aspects, from sample optimization to data collection, which may serve as a guide for soluble protein preparation for TR-SSX. In addition, we show that the crystals obtained preserve the photoreactivity using infrared spectroscopy. Furthermore, the results of the TR-SSX experiment provide high-resolution insights into structural alterations of CrPhotLOV1 from Δt = 2.5 ms up to Δt = 95 ms post-photoactivation, including resolving the geometry of the thio­ether adduct and the C-terminal region implicated in the signal transduction process.




sin

Refinement of cryo-EM 3D maps with a self-supervised denoising model: crefDenoiser

Cryogenic electron microscopy (cryo-EM) is a pivotal technique for imaging macromolecular structures. However, despite extensive processing of large image sets collected in cryo-EM experiments to amplify the signal-to-noise ratio, the reconstructed 3D protein-density maps are often limited in quality due to residual noise, which in turn affects the accuracy of the macromolecular representation. Here, crefDenoiser is introduced, a denoising neural network model designed to enhance the signal in 3D cryo-EM maps produced with standard processing pipelines. The crefDenoiser model is trained without the need for `clean' ground-truth target maps. Instead, a custom dataset is employed, composed of real noisy protein half-maps sourced from the Electron Microscopy Data Bank repository. Competing with the current state-of-the-art, crefDenoiser is designed to optimize for the theoretical noise-free map during self-supervised training. We demonstrate that our model successfully amplifies the signal across a wide variety of protein maps, outperforming a classic map denoiser and following a network-based sharpening model. Without biasing the map, the proposed denoising method leads to improved visibility of protein structural features, including protein domains, secondary structure elements and modest high-resolution feature restoration.




sin

On the structure refinement of metal complexes against 3D electron diffraction data using multipolar scattering factors

This study examines various methods for modelling the electron density and, thus, the electrostatic potential of an organometallic complex for use in crystal structure refinement against 3D electron diffraction (ED) data. It focuses on modelling the scattering factors of iron(III), considering the electron density distribution specific for coordination with organic linkers. We refined the structural model of the metal–organic complex, iron(III) acetyl­acetonate (FeAcAc), using both the independent atom model (IAM) and the transferable aspherical atom model (TAAM). TAAM refinement initially employed multipolar parameters from the MATTS databank for acetyl­acetonate, while iron was modelled with a spherical and neutral approach (TAAM ligand). Later, custom-made TAAM scattering factors for Fe—O coordination were derived from DFT calculations [TAAM-ligand-Fe(III)]. Our findings show that, in this compound, the TAAM scattering factor corresponding to Fe3+ has a lower scattering amplitude than the Fe3+ charged scattering factor described by IAM. When using scattering factors corresponding to the oxidation state of iron, IAM inaccurately represents electrostatic potential maps and overestimates the scattering potential of the iron. In addition, TAAM significantly improved the fitting of the model to the data, shown by improved R1 values, goodness-of-fit (GooF) and reduced noise in the Fourier difference map (based on the residual distribution analysis). For 3D ED, R1 values improved from 19.36% (IAM) to 17.44% (TAAM-ligand) and 17.49% (TAAM-ligand-Fe3+), and for single-crystal X-ray diffraction (SCXRD) from 3.82 to 2.03% and 1.98%, respectively. For 3D ED, the most significant R1 reductions occurred in the low-resolution region (8.65–2.00 Å), dropping from 20.19% (IAM) to 14.67% and 14.89% for TAAM-ligand and TAAM-ligand-Fe(III), respectively, with less improvement in high-resolution ranges (2.00–0.85 Å). This indicates that the major enhancements are due to better scattering modelling in low-resolution zones. Furthermore, when using TAAM instead of IAM, there was a noticeable improvement in the shape of the thermal ellipsoids, which more closely resembled those of an SCXRD-refined model. This study demonstrates the applicability of more sophisticated scattering factors to improve the refinement of metal–organic complexes against 3D ED data, suggesting the need for more accurate modelling methods and highlighting the potential of TAAM in examining the charge distribution of large molecular structures using 3D ED.




sin

Phase quantification using deep neural network processing of XRD patterns

Mineral identification and quantification are key to the understanding and, hence, the capacity to predict material properties. The method of choice for mineral quantification is powder X-ray diffraction (XRD), generally using a Rietveld refinement approach. However, a successful Rietveld refinement requires preliminary identification of the phases that make up the sample. This is generally carried out manually, and this task becomes extremely long or virtually impossible in the case of very large datasets such as those from synchrotron X-ray diffraction computed tomography. To circumvent this issue, this article proposes a novel neural network (NN) method for automating phase identification and quantification. An XRD pattern calculation code was used to generate large datasets of synthetic data that are used to train the NN. This approach offers significant advantages, including the ability to construct databases with a substantial number of XRD patterns and the introduction of extensive variability into these patterns. To enhance the performance of the NN, a specifically designed loss function for proportion inference was employed during the training process, offering improved efficiency and stability compared with traditional functions. The NN, trained exclusively with synthetic data, proved its ability to identify and quantify mineral phases on synthetic and real XRD patterns. Trained NN errors were equal to 0.5% for phase quantification on the synthetic test set, and 6% on the experimental data, in a system containing four phases of contrasting crystal structures (calcite, gibbsite, dolomite and hematite). The proposed method is freely available on GitHub and allows for major advances since it can be applied to any dataset, regardless of the mineral phases present.




sin

Crossing length scales: X-ray approaches to studying the structure of biological materials

Biological materials have outstanding properties. With ease, challenging mechanical, optical or electrical properties are realised from comparatively `humble' building blocks. The key strategy to realise these properties is through extensive hierarchical structuring of the material from the millimetre to the nanometre scale in 3D. Though hierarchical structuring in biological materials has long been recognized, the 3D characterization of such structures remains a challenge. To understand the behaviour of materials, multimodal and multi-scale characterization approaches are needed. In this review, we outline current X-ray analysis approaches using the structures of bone and shells as examples. We show how recent advances have aided our understanding of hierarchical structures and their functions, and how these could be exploited for future research directions. We also discuss current roadblocks including radiation damage, data quantity and sample preparation, as well as strategies to address them.




sin

A predicted model-aided one-step classification–multireconstruction algorithm for X-ray free-electron laser single-particle imaging

Ultrafast, high-intensity X-ray free-electron lasers can perform diffraction imaging of single protein molecules. Various algorithms have been developed to determine the orientation of each single-particle diffraction pattern and reconstruct the 3D diffraction intensity. Most of these algorithms rely on the premise that all diffraction patterns originate from identical protein molecules. However, in actual experiments, diffraction patterns from multiple different molecules may be collected simultaneously. Here, we propose a predicted model-aided one-step classification–multireconstruction algorithm that can handle mixed diffraction patterns from various molecules. The algorithm uses predicted structures of different protein molecules as templates to classify diffraction patterns based on correlation coefficients and determines orientations using a correlation maximization method. Tests on simulated data demonstrated high accuracy and efficiency in classification and reconstruction.




sin

Quantum refinement in real and reciprocal space using the Phenix and ORCA software

X-ray and neutron crystallography, as well as cryogenic electron microscopy (cryo-EM), are the most common methods to obtain atomic structures of biological macromolecules. A feature they all have in common is that, at typical resolutions, the experimental data need to be supplemented by empirical restraints, ensuring that the final structure is chemically reasonable. The restraints are accurate for amino acids and nucleic acids, but often less accurate for substrates, inhibitors, small-molecule ligands and metal sites, for which experimental data are scarce or empirical potentials are harder to formulate. This can be solved using quantum mechanical calculations for a small but interesting part of the structure. Such an approach, called quantum refinement, has been shown to improve structures locally, allow the determination of the protonation and oxidation states of ligands and metals, and discriminate between different interpretations of the structure. Here, we present a new implementation of quantum refinement interfacing the widely used structure-refinement software Phenix and the freely available quantum mechanical software ORCA. Through application to manganese superoxide dismutase and V- and Fe-nitro­genase, we show that the approach works effectively for X-ray and neutron crystal structures, that old results can be reproduced and structural discrimination can be performed. We discuss how the weight factor between the experimental data and the empirical restraints should be selected and how quantum mechanical quality measures such as strain energies should be calculated. We also present an application of quantum refinement to cryo-EM data for particulate methane monooxygenase and show that this may be the method of choice for metal sites in such structures because no accurate empirical restraints are currently available for metals.




sin

Using deep-learning predictions reveals a large number of register errors in PDB depositions

The accuracy of the information in the Protein Data Bank (PDB) is of great importance for the myriad downstream applications that make use of protein structural information. Despite best efforts, the occasional introduction of errors is inevitable, especially where the experimental data are of limited resolution. A novel protein structure validation approach based on spotting inconsistencies between the residue contacts and distances observed in a structural model and those computationally predicted by methods such as AlphaFold2 has previously been established. It is particularly well suited to the detection of register errors. Importantly, this new approach is orthogonal to traditional methods based on stereochemistry or map–model agreement, and is resolution independent. Here, thousands of likely register errors are identified by scanning 3–5 Å resolution structures in the PDB. Unlike most methods, the application of this approach yields suggested corrections to the register of affected regions, which it is shown, even by limited implementation, lead to improved refinement statistics in the vast majority of cases. A few limitations and confounding factors such as fold-switching proteins are characterized, but this approach is expected to have broad application in spotting potential issues in current accessions and, through its implementation and distribution in CCP4, helping to ensure the accuracy of future depositions.




sin

Crystal structures of two new high-pressure oxynitrides with composition SnGe4N4O4, from single-crystal electron diffraction

SnGe4N4O4 was synthesized at high pressure (16 and 20 GPa) and high temperature (1200 and 1500°C) in a large-volume press. Powder X-ray diffraction experiments using synchrotron radiation indicate that the derived samples are mixtures of known and unknown phases. However, the powder X-ray diffraction patterns are not sufficient for structural characterization. Transmission electron microscopy studies reveal crystals of several hundreds of nanometres in size with different chemical composition. Among them, crystals of a previously unknown phase with stoichiometry SnGe4N4O4 were detected and investigated using automated diffraction tomography (ADT), a three-dimensional electron diffraction method. Via ADT, the crystal structure could be determined from single nanocrystals in space group P63mc, exhibiting a nolanite-type structure. This was confirmed by density functional theory calculations and atomic resolution scanning transmission electron microscopy images. In one of the syntheses runs a rhombohedral 6R polytype of SnGe4N4O4 could be found together with the nolanite-type SnGe4N4O4. The structure of this polymorph was solved as well using ADT.




sin

Search for missing symmetry in the Inorganic Crystal Structure Database (ICSD)

An exhaustive search for missing symmetry was performed for 223 076 entries in the ICSD (2023-2 release). Approximately 0.65% of them can be described with higher symmetry than reported. Out of the identified noncentrosymmetric entries, ∼74% can be described by centrosymmetric space groups; this has implications for compatible physical properties. It is proposed that the information on the correct space group is included in the ICSD.




sin

Determining magnetic structures in GSAS-II using the Bilbao Crystallographic Server tool k-SUBGROUPSMAG

The embedded call to a special version of the web-based Bilbao Crystallographic Server tool k-SUBGROUPSMAG from within GSAS-II to form a list of all possible commensurate magnetic subgroups of a parent magnetic grey group is described. It facilitates the selection and refinement of the best commensurate magnetic structure model by having all the analysis tools including Rietveld refinement in one place as part of GSAS-II. It also provides the chosen magnetic space group as one of the 1421 possible standard Belov–Neronova–Smirnova forms or equivalent non-standard versions.




sin

Hard X-ray single-shot spectrometer of PAL-XFEL

A hard X-ray single-shot spectrometer comprising thin, bent Si crystals has been developed for the Pohang Accelerator Laboratory X-ray Free-Electron Laser (XFEL), for detailed analysis of ultrafast 4.5–17 keV XFEL pulses with a high spectral resolution. This instrument facilitates shot-to-shot spectral structure monitoring and optimization of the operating conditions of the XFEL owing to its ability to provide comprehensive data on the spectral properties and fluctuations of self-amplified spontaneous emission, monochromatic and seeded XFEL modes.




sin

Enhancing the Efficiency of a Wavelength-Dispersive Spectrometer based upon a Slit-less Design Using a Single-Bounce Monocapillary

A slit-less wavelength-dispersive spectrometer design using a single-bounce monocapillary that aligns the sample on the Rowland circle, enhancing photon throughput and maintaining resolution. The compact design supports flexibility and reconfiguration in facilities without complex beamline infrastructure, significantly improving detection efficiency.




sin

Texture measurements on quartz single crystals to validate coordinate systems for neutron time-of-flight texture analysis

In crystallographic texture analysis, ensuring that sample directions are preserved from experiment to the resulting orientation distribution is crucial to obtain physical meaning from diffraction data. This work details a procedure to ensure instrument and sample coordinates are consistent when analyzing diffraction data with a Rietveld refinement using the texture analysis software MAUD. A quartz crystal is measured on the HIPPO diffractometer at Los Alamos National Laboratory for this purpose. The methods described here can be applied to any diffraction instrument measuring orientation distributions in polycrystalline materials.




sin

Van Vleck analysis of angularly distorted octahedra using VanVleckCalculator

Van Vleck modes describe all possible displacements of octahedrally coordinated ligands about a core atom. They are a useful analytical tool for analysing the distortion of octahedra, particularly for first-order Jahn–Teller distortions, but determination of the Van Vleck modes of an octahedron is complicated by the presence of angular distortion of the octahedron. This problem is most commonly resolved by calculating the bond distortion modes (Q2, Q3) along the bond axes of the octahedron, disregarding the angular distortion and losing information on the octahedral shear modes (Q4, Q5 and Q6) in the process. In this paper, the validity of assuming bond lengths to be orthogonal in order to calculate the Van Vleck modes is discussed, and a method is described for calculating Van Vleck modes without disregarding the angular distortion. A Python package for doing this, VanVleckCalculator, is introduced and some examples of its use are given. Finally, it is shown that octahedral shear and angular distortion are often, but not always, correlated, and a parameter η is proposed as the shear fraction. It is demonstrated that η can be used to predict whether the values will be correlated when varying a tuning parameter such as temperature or pressure.




sin

POMFinder: identifying polyoxometallate cluster structures from pair distribution function data using explainable machine learning

Characterization of a material structure with pair distribution function (PDF) analysis typically involves refining a structure model against an experimental data set, but finding or constructing a suitable atomic model for PDF modelling can be an extremely labour-intensive task, requiring carefully browsing through large numbers of possible models. Presented here is POMFinder, a machine learning (ML) classifier that rapidly screens a database of structures, here polyoxometallate (POM) clusters, to identify candidate structures for PDF data modelling. The approach is shown to identify suitable POMs from experimental data, including in situ data collected with fast acquisition times. This automated approach has significant potential for identifying suitable models for structure refinement to extract quantitative structural parameters in materials chemistry research. POMFinder is open source and user friendly, making it accessible to those without prior ML knowledge. It is also demonstrated that POMFinder offers a promising modelling framework for combined modelling of multiple scattering techniques.




sin

Using XAS to monitor radiation damage in real time and post-analysis, and investigation of systematic errors of fluorescence XAS for Cu-bound amyloid-β

X-ray absorption spectroscopy (XAS) is a promising technique for determining structural information from sensitive biological samples, but high-accuracy X-ray absorption fine structure (XAFS) requires corrections of systematic errors in experimental data. Low-temperature XAS and room-temperature X-ray absorption spectro-electrochemical (XAS-EC) measurements of N-truncated amyloid-β samples were collected and corrected for systematic effects such as dead time, detector efficiencies, monochromator glitches, self-absorption, radiation damage and noise at higher wavenumber (k). A new protocol was developed using extended X-ray absorption fine structure (EXAFS) data analysis for monitoring radiation damage in real time and post-analysis. The reliability of the structural determinations and consistency were validated using the XAS measurement experimental uncertainty. The correction of detector pixel efficiencies improved the fitting χ2 by 12%. An improvement of about 2.5% of the structural fitting was obtained after dead-time corrections. Normalization allowed the elimination of 90% of the monochromator glitches. The remaining glitches were manually removed. The dispersion of spectra due to self-absorption was corrected. Standard errors of experimental measurements were propagated from pointwise variance of the spectra after systematic corrections. Calculated uncertainties were used in structural refinements for obtaining precise and reliable values of structural parameters including atomic bond lengths and thermal parameters. This has permitted hypothesis testing.




sin

Refinement of X-ray and electron diffraction crystal structures using analytical Fourier transforms of Slater-type atomic wavefunctions in Olex2

An implementation of Slater-type spherical scattering factors for X-ray and electron diffraction for elements in the range Z = 1–103 is presented within the software Olex2. Both high- and low-angle Fourier behaviour of atomic electron density and electrostatic potential can thus be addressed, in contrast to the limited flexibility of the four Gaussian plus constant descriptions which are currently the most widely used method for calculating atomic scattering factors during refinement. The implementation presented here accommodates the increasing complexity of the electronic structure of heavier elements by using complete atomic wavefunctions without any interpolation between precalculated tables or intermediate fitting functions. Atomic wavefunctions for singly charged ions are implemented and made accessible, and these show drastic changes in electron diffraction scattering factors compared with the neutral atom. A comparison between the two different spherical models of neutral atoms is presented as an example for four different kinds of X-ray and two electron diffraction structures, and comparisons of refinement results using the existing diffraction data are discussed. A systematic but slight improvement in R values and residual densities can be observed when using the new scattering factors, and this is discussed relative to effects on the atomic displacement parameters and atomic positions, which are prominent near the heavier elements in a structure.




sin

The Pixel Anomaly Detection Tool: a user-friendly GUI for classifying detector frames using machine-learning approaches

Data collection at X-ray free electron lasers has particular experimental challenges, such as continuous sample delivery or the use of novel ultrafast high-dynamic-range gain-switching X-ray detectors. This can result in a multitude of data artefacts, which can be detrimental to accurately determining structure-factor amplitudes for serial crystallography or single-particle imaging experiments. Here, a new data-classification tool is reported that offers a variety of machine-learning algorithms to sort data trained either on manual data sorting by the user or by profile fitting the intensity distribution on the detector based on the experiment. This is integrated into an easy-to-use graphical user interface, specifically designed to support the detectors, file formats and software available at most X-ray free electron laser facilities. The highly modular design makes the tool easily expandable to comply with other X-ray sources and detectors, and the supervised learning approach enables even the novice user to sort data containing unwanted artefacts or perform routine data-analysis tasks such as hit finding during an experiment, without needing to write code.




sin

Visualizing the fibre texture of satin spar using laboratory 2D X-ray diffraction

The suitability of point focus X-ray beam and area detector techniques for the determination of the uniaxial symmetry axis (fibre texture) of the natural mineral satin spar is demonstrated. Among the various diffraction techniques used in this report, including powder diffraction, 2D pole figures, rocking curves looped on φ and 2D X-ray diffraction, a single simple symmetric 2D scan collecting the reciprocal plane perpendicular to the apparent fibre axis provided sufficient information to determine the crystallographic orientation of the fibre axis. A geometrical explanation of the `wing' feature formed by diffraction spots from the fibre-textured satin spar in 2D scans is provided. The technique of wide-range reciprocal space mapping restores the `wing' featured diffraction spots on the 2D detector back to reciprocal space layers, revealing the nature of the fibre-textured samples.




sin

Tripling of the scattering vector range of X-ray reflectivity on liquid surfaces using a double-crystal deflector

The maximum range of perpendicular momentum transfer (qz) has been tripled for X-ray scattering from liquid surfaces when using a double-crystal deflector setup to tilt the incident X-ray beam. This is achieved by employing a higher-energy X-ray beam to access Miller indices of reflecting crystal atomic planes that are three times higher than usual. The deviation from the exact Bragg angle condition induced by misalignment between the X-ray beam axis and the main rotation axis of the double-crystal deflector is calculated, and a fast and straightforward procedure to align them is deduced. An experimental method of measuring scattering intensity along the qz direction on liquid surfaces up to qz = 7 Å−1 is presented, with liquid copper serving as a reference system for benchmarking purposes.




sin

A workflow for single-particle structure determination via iterative phasing of rotational invariants in fluctuation X-ray scattering

Fluctuation X-ray scattering (FXS) offers a complementary approach for nano- and bioparticle imaging with an X-ray free-electron laser (XFEL), by extracting structural information from correlations in scattered XFEL pulses. Here a workflow is presented for single-particle structure determination using FXS. The workflow includes procedures for extracting the rotational invariants from FXS patterns, performing structure reconstructions via iterative phasing of the invariants, and aligning and averaging multiple reconstructions. The reconstruction pipeline is implemented in the open-source software xFrame and its functionality is demonstrated on several simulated structures.




sin

Program VUE: analysing distributions of cryo-EM projections using uniform spherical grids

Three-dimensional cryo electron microscopy reconstructions are obtained by extracting information from a large number of projections of the object. These projections correspond to different `views' or `orientations', i.e. directions in which these projections show the reconstructed object. Uneven distribution of these views and the presence of dominating preferred orientations may distort the reconstructed spatial images. This work describes the program VUE (views on uniform grids for cryo electron microscopy), designed to study such distributions. Its algorithms, based on uniform virtual grids on a sphere, allow an easy calculation and accurate quantitative analysis of the frequency distribution of the views. The key computational element is the Lambert azimuthal equal-area projection of a spherical uniform grid onto a disc. This projection keeps the surface area constant and represents the frequency distribution with no visual bias. Since it has multiple tunable parameters, the program is easily adaptable to individual needs, and to the features of a particular project or of the figure to be produced. It can help identify problems related to an uneven distribution of views. Optionally, it can modify the list of projections, distributing the views more uniformly. The program can also be used as a teaching tool.




sin

Tracking copper nanofiller evolution in polysiloxane during processing into SiOC ceramic

Polymer-derived ceramics (PDCs) remain at the forefront of research for a variety of applications including ultra-high-temperature ceramics, energy storage and functional coatings. Despite their wide use, questions remain about the complex structural transition from polymer to ceramic and how local structure influences the final microstructure and resulting properties. This is further complicated when nanofillers are introduced to tailor structural and functional properties, as nanoparticle surfaces can interact with the matrix and influence the resulting structure. The inclusion of crystalline nanofiller produces a mixed crystalline–amorphous composite, which poses characterization challenges. With this study, we aim to address these challenges with a local-scale structural study that probes changes in a polysiloxane matrix with incorporated copper nanofiller. Composites were processed at three unique temperatures to capture mixing, pyrolysis and initial crystallization stages for the pre-ceramic polymer. We observed the evolution of the nanofiller with electron microscopy and applied synchrotron X-ray diffraction with differential pair distribution function (d-PDF) analysis to monitor changes in the matrix's local structure and interactions with the nanofiller. The application of the d-PDF to PDC materials is novel and informs future studies to understand interfacial interactions between nanofiller and matrix throughout PDC processing.




sin

Accessing self-diffusion on nanosecond time and nanometre length scales with minute kinetic resolution

Neutron spectroscopy uniquely and non-destructively accesses diffusive dynamics in soft and biological matter, including for instance proteins in hydrated powders or in solution, and more generally dynamic properties of condensed matter on the molecular level. Given the limited neutron flux resulting in long counting times, it is important to optimize data acquisition for the specific question, in particular for time-resolved (kinetic) studies. The required acquisition time was recently significantly reduced by measurements of discrete energy transfers rather than quasi-continuous neutron scattering spectra on neutron backscattering spectrometers. Besides this reduction in acquisition times, smaller amounts of samples can be measured with better statistics, and most importantly, kinetically changing samples, such as aggregating or crystallizing samples, can be followed. However, given the small number of discrete energy transfers probed in this mode, established analysis frameworks for full spectra can break down. Presented here are new approaches to analyze measurements of diffusive dynamics recorded within fixed windows in energy transfer, and these are compared with the analysis of full spectra. The new approaches are tested by both modeled scattering functions and a comparative analysis of fixed energy window data and full spectra on well understood reference samples. This new approach can be employed successfully for kinetic studies of the dynamics focusing on the short-time apparent center-of-mass diffusion.




sin

Determination of α lamellae orientation in a β-Ti alloy using electron backscatter diffraction

The spatial orientation of α lamellae in a metastable β-Ti matrix of Timetal LCB (Ti–6.8 Mo–4.5 Fe–1.5 Al in wt%) was examined and the orientation of the hexagonal close-packed α lattice in the α lamella was determined. For this purpose, a combination of methods of small-angle X-ray scattering, scanning electron microscopy and electron backscatter diffraction was used. The habit planes of α laths are close to {111}β, which corresponds to (1320)α in the hexagonal coordinate system of the α phase. The longest α lamella direction lies approximately along one of the 〈110〉β directions which are parallel to the specific habit plane. Taking into account the average lattice parameters of the β and α phases in aged conditions in Timetal LCB, it was possible to index all main axes and faces of an α lath not only in the cubic coordinate system of the parent β phase but also in the hexagonal system of the α phase.




sin

Automated pipeline processing X-ray diffraction data from dynamic compression experiments on the Extreme Conditions Beamline of PETRA III

Presented and discussed here is the implementation of a software solution that provides prompt X-ray diffraction data analysis during fast dynamic compression experiments conducted within the dynamic diamond anvil cell technique. It includes efficient data collection, streaming of data and metadata to a high-performance cluster (HPC), fast azimuthal data integration on the cluster, and tools for controlling the data processing steps and visualizing the data using the DIOPTAS software package. This data processing pipeline is invaluable for a great number of studies. The potential of the pipeline is illustrated with two examples of data collected on ammonia–water mixtures and multiphase mineral assemblies under high pressure. The pipeline is designed to be generic in nature and could be readily adapted to provide rapid feedback for many other X-ray diffraction techniques, e.g. large-volume press studies, in situ stress/strain studies, phase transformation studies, chemical reactions studied with high-resolution diffraction etc.




sin

Quantitative selection of sample structures in small-angle scattering using Bayesian methods

Small-angle scattering (SAS) is a key experimental technique for analyzing nanoscale structures in various materials. In SAS data analysis, selecting an appropriate mathematical model for the scattering intensity is critical, as it generates a hypothesis of the structure of the experimental sample. Traditional model selection methods either rely on qualitative approaches or are prone to overfitting. This paper introduces an analytical method that applies Bayesian model selection to SAS measurement data, enabling a quantitative evaluation of the validity of mathematical models. The performance of the method is assessed through numerical experiments using artificial data for multicomponent spherical materials, demonstrating that this proposed analysis approach yields highly accurate and interpretable results. The ability of the method to analyze a range of mixing ratios and particle size ratios for mixed components is also discussed, along with its precision in model evaluation by the degree of fitting. The proposed method effectively facilitates quantitative analysis of nanoscale sample structures in SAS, which has traditionally been challenging, and is expected to contribute significantly to advancements in a wide range of fields.