Matthew Griffiths scite author profile

Finding the optimal alignment between two structures is important for identifying the minimum root-mean-square distance (RMSD) between them and as a starting point for calculating pathways. Most current algorithms for aligning structures are stochastic, scale exponentially with the size of structure, and the performance can be unreliable. We present two complementary methods for aligning structures corresponding to isolated clusters of atoms and to condensed matter described by a periodic cubic supercell. The first method (Go-PERMDIST), a branch and bound algorithm, locates the global minimum RMSD deterministically in polynomial time. The run time increases for larger RMSDs. The second method (FASTOVERLAP) is a heuristic algorithm that aligns structures by finding the global maximum kernel correlation between them using fast Fourier transforms (FFTs) and fast SO(3) transforms (SOFTs). For periodic systems, FASTOVERLAP scales with the square of the number of identical atoms in the system, reliably finds the best alignment between structures that are not too distant, and shows significantly better performance than existing algorithms. The expected run time for Go-PERMDIST is longer than FASTOVERLAP for periodic systems. For finite clusters, the FASTOVERLAP algorithm is competitive with existing algorithms. The expected run time for Go-PERMDIST to find the global RMSD between two structures deterministically is generally longer than for existing stochastic algorithms. However, with an earlier exit condition, Go-PERMDIST exhibits similar or better performance.

show abstract

Nested sampling for physical scientists

Ashton

Bernstein

Buchner

et al. 2022

Nat Rev Methods Primers

View full text Add to dashboard Cite

show abstract

Nested Basin-Sampling

Griffiths

Wales

2019

J. Chem. Theory Comput.

View full text Add to dashboard Cite

We report an embarrassingly parallel method for the evaluation of thermodynamic properties over an energy landscape exhibiting broken ergodicity, nested is the likelihood of the observed data D givenbasin-sampling (NBS). We also introduce the No Galilean U-Turn Sampler (NoGUTS), a new sampling scheme based on the No U-Turn Sampler (NUTS) introduced by Hoffman and Gelman (2014) that works with the Galilean Monte Carlo scheme introduced by Betancourt (2012) to aid the efficient generation of new live points. NoGUTS can be thought of as a form of reflective slice sampling with an automatic stopping criterion. We apply this approach to a benchmark atomic cluster of 31 Lennard-Jones atoms, which exhibits a low temperature solid–solid heat capacity peak. The calculated heat capacity is compared with results generated by parallel tempering (PT), basin-sampling parallel tempering (BSPT), and standard nested sampling (NS) simulations. NBS reproduces the full heat capacity curve predicted by PT and BSPT, while the NS calculation with similar computational cost fails to resolve the low-temperature solid–solid phase transition.

show abstract

PyAutoFit: A Classy Probabilistic Programming Language for Model Composition and Fitting

Nightingale¹,

Hayes²,

Griffiths³

2021

JOSS

View full text Add to dashboard Cite

A major trend in academia and data science is the rapid adoption of Bayesian statistics for data analysis and modeling, leading to the development of probabilistic programming languages (PPL). A PPL provides a framework that allows users to easily specify a probabilistic model and perform inference automatically. PyAutoFit is a Python-based PPL which interfaces with all aspects of the modeling (e.g., the model, data, fitting procedure, visualization, results) and therefore provides complete management of every aspect of modeling. This includes composing high-dimensionality models from individual model components, customizing the fitting procedure and performing data augmentation before a model-fit. Advanced features include database tools for analysing large suites of modeling results and exploiting domainspecific knowledge of a problem via non-linear search chaining. Accompanying PyAutoFit is the autofit workspace, which includes example scripts and the HowToFit lecture series which introduces non-experts to model-fitting and provides a guide on how to begin a project using PyAutoFit. Readers can try PyAutoFit right now by going to the introduction Jupyter notebook on Binder or checkout our readthedocs for a complete overview of PyAutoFit's features.

show abstract

Sampling Configurational Energy Landscapes

Griffiths¹

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.