Dissolved organic matter (DOM) is a complex mixture of thousands of natural molecules that undergo constant transformation in the environment, such as sunlight induced photochemical reactions. Despite molecular level resolution from ultrahigh resolution mass spectrometry (UHRMS), trends of mass peak intensities are currently the only way to follow photochemically induced molecular changes in DOM. Many real-world relationships and temporal processes can be intuitively modeled using graph data structures (networks). Graphs enhance the potential and value of AI applications by adding context and interconnections allowing the uncovering of hidden or unknown relationships in data sets. We use a temporal graph model and link prediction to identify transformations of DOM molecules in a photo-oxidation experiment. Our link prediction algorithm simultaneously considers educt removal and product formation for molecules linked by predefined transformation units (oxidation, decarboxylation, etc.). The transformations are further weighted by the extent of intensity change and clustered on the graph structure to identify groups of similar reactivity. The temporal graph is capable of identifying relevant molecules subject to similar reactions and enabling to study their time course. Our approach overcomes previous data evaluation limitations for mechanistic studies of DOM and leverages the potential of temporal graphs to study DOM reactivity by UHRMS.
<p>Dissolved organic matter (DOM) is a complex mixture of thousands of natural molecules that undergo constant transformation in the environment, such as sunlight induced, photochemical reactions. Despite molecular level resolution obtained by ultra-high resolution mass spectrometry, the mechanistic understanding of DOM transformations is still hampered due to a multitude of simultaneous reactions. Temporal trends of mass peak intensities are currently the only way to follow photochemical induced molecular changes in DOM, but are often limited by low temporal resolution or the necessity to apply monotonic regression models. Here, we present a novel computational approach using a temporal graph (a temporal molecular network) to model the transformation of DOM molecules in a photolysis experiment by employing a predefined set of basic molecular transformation units (like oxidation, decarboxylation, etc.). The new algorithm focuses on the temporal changes of mass peak intensities of molecular formulas by simultaneously considering educt removal and product formation for molecules linked by a transformation unit (e.g. -CO2). The transformations and molecules are further weighted by the extent of intensity change and grouped by unsupervised machine learning algorithms to find clusters of similar reactivity. The temporal graph thus allows to simultaneously identify relevant molecules subject to similar reactions and to study their time course. The graph can be constructed for low and high temporal resolution data without presumptions on reaction kinetics and with different sets of transformation units according to the experimental design. Our approach overcomes previous data evaluation limitations for mechanistic studies of DOM transformation and leverages the potential of temporal graphs to study the reactivity of complex DOM by means of ultra-high resolution mass spectrometry.</p>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.