Monte Carlo simulations on UV‐vis titration data quantify how the precision of binding constant calculation deteriorates in strong binding regimes. For 1:1 binding, global analysis is reliable when K[H]o < 1000, a significant improvement over previous recommendations. Initial concentration error generates the majority of the error in the calculated binding constant. We derive a novel experimental design formula to maximize binding constant precision over both strong and weak binding regimes and validate it through Monte Carlo simulations.
Computation of binding constants from spectrophotometric titration data is a very popular application of chemometric hard modeling. However, the calculated values are misleading if the correct binding model is not used. Given that many supramolecular systems of interest feature unknown speciation, a priori determination of binding stoichiometry constitutes an important unsolved problem in chemometrics. We present a new and reliable algorithm for accomplishing this task, implemented using a hybrid particle swarm optimization technique. Simultaneous optimization of stoichiometry ratios and binding constants allows the optimal binding model to be calculated in just a few minutes for systems with up to four reactions. Simulated data studies demonstrate that the algorithm finds the correct stoichiometry with up to nine reactions in the absence of noise, including accurately determining species with unusual stoichiometry, such as H 2 G 5 . Application to four experimental datasets shows the algorithm is robust to experimental errors for a variety of chemical systems and binding models. This algorithm will facilitate the discovery of complex binding models, increase efficiency in titration analysis, and avert incorrect stoichiometry models, thereby improving the reliability of binding constant information in spectrophotometric titrations.
Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classification and topic modeling tasks; however, most methods that can perform both do not allow for guidance of the topics or features. In this paper, we propose a novel method, namely Guided Semi-Supervised Non-negative Matrix Factorization (GSSNMF), that performs both classification and topic modeling by incorporating supervision from both pre-assigned document class labels and user-designed seed words. We test the performance of this method on legal documents provided by the California Innocence Project and the 20 Newsgroups dataset. Our results show that the proposed method improves both classification accuracy and topic coherence in comparison to past methods such as Semi-Supervised Non-negative Matrix Factorization (SSNMF), Guided Non-negative Matrix Factorization (Guided NMF), and Topic Supervised NMF.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.