High-throughput (HTP) material design is an emerging field and has been proved to be powerful in the prediction of novel functional materials. In this work, an HTP effort has been carried out for thermoelectric chalcogenides with diamond-like structures on the newly established Materials Informatics Platform (MIP). Specifically, the relaxation time is evaluated by a reliable yet efficient method, which greatly improves the accuracy of HTP electrical transport calculations. The results show that all the compounds may have power factors over 10 μW/cm·K if fully optimized. A new series of diamond-like chalcogenides with an atomic ratio of 1:2:4 possess relatively higher electrical transport properties among all the compounds investigated. One particular compound, CdInTe, and its variations have been verified experimentally with a peak ZT over 1.0. Further analysis reveals the existence of general conductive networks and the similar Pisarenko relations under the same anion sublattice, and the transport distribution function is found to be a good indicator for the power factors for the compounds investigated. This work demonstrates a successful case study in HTP material screening.
Textual representations play an important role in the field of natural language processing (NLP). The efficiency of NLP tasks, such as text comprehension and information extraction, can be significantly improved with proper textual representations. As neural networks are gradually applied to learn the representation of words and phrases, fairly efficient models of learning short text representations have been developed, such as the continuous bag of words (CBOW) and skip-gram models, and they have been extensively employed in a variety of NLP tasks. Because of the complex structure generated by the longer text lengths, such as sentences, algorithms appropriate for learning short textual representations are not applicable for learning long textual representations. One method of learning long textual representations is the Long Short-Term Memory (LSTM) network, which is suitable for processing sequences. However, the standard LSTM does not adequately address the primary sentence structure (subject, predicate and object), which is an important factor for producing appropriate sentence representations. To resolve this issue, this paper proposes the dependency-based LSTM model (D-LSTM). The D-LSTM divides a sentence representation into two parts: a basic component and a supporting component. The D-LSTM uses a pre-trained dependency parser to obtain the primary sentence information and generate supporting components, and it also uses a standard LSTM model to generate the basic sentence components. A weight factor that can adjust the ratio of the basic and supporting components in a sentence is introduced to generate the sentence representation. Compared with the representation learned by the standard LSTM, the sentence representation learned by the D-LSTM contains a greater amount of useful information. The experimental results show that the D-LSTM is superior to the standard LSTM for sentences involving compositional knowledge (SICK) data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.