BackgroundWhile a large number of well-known knowledge bases (KBs) in life science have been published as Linked Open Data, there are few KBs in Chinese. However, KBs in Chinese are necessary when we want to automatically process and analyze electronic medical records (EMRs) in Chinese. Of all, the symptom KB in Chinese is the most seriously in need, since symptoms are the starting point of clinical diagnosis.ResultsWe publish a public KB of symptoms in Chinese, including symptoms, departments, diseases, medicines, and examinations as well as relations between symptoms and the above related entities. To the best of our knowledge, there is no such KB focusing on symptoms in Chinese, and the KB is an important supplement to existing medical resources. Our KB is constructed by fusing data automatically extracted from eight mainstream healthcare websites, three Chinese encyclopedia sites, and symptoms extracted from a larger number of EMRs as supplements.MethodsFirstly, we design data schema manually by reference to the Unified Medical Language System (UMLS). Secondly, we extract entities from eight mainstream healthcare websites, which are fed as seeds to train a multi-class classifier and classify entities from encyclopedia sites and train a Conditional Random Field (CRF) model to extract symptoms from EMRs. Thirdly, we fuse data to solve the large-scale duplication between different data sources according to entity type alignment, entity mapping, and attribute mapping. Finally, we link our KB to UMLS to investigate similarities and differences between symptoms in Chinese and English.ConclusionsAs a result, the KB has more than 26,000 distinct symptoms in Chinese including 3968 symptoms in traditional Chinese medicine and 1029 synonym pairs for symptoms. The KB also includes concepts such as diseases and medicines as well as relations between symptoms and the above related entities. We also link our KB to the Unified Medical Language System and analyze the differences between symptoms in the two KBs. We released the KB as Linked Open Data and a demo at https://datahub.io/dataset/symptoms-in-chinese.
Using the density functional theory (DFT) method at the B3LYP /6−311G (D) level, we studied how silicon doping affects the geometrical structure, stability, and electronic and spectral properties of magnesium clusters. The stable isomers of SiMg
n (n = 1‐12) clusters were calculated by searching numerous initial configurations using the CALYPSO program. The geometrical structure optimization shows that most stable SiMg
n (n = 3‐12) clusters are three‐dimensional. In addition, geometrical structure growth patterns show that some structures of SiMg
n clusters can be directly formed by replacing one Mg atom in the corresponding Mg
n + 1 cluster with one silicon atom, such as SiMg8 and Mg9 clusters. The stability of SiMg
n clusters is analyzed by calculating the average binding energy, fragmentation energy, and second‐order energy difference. The results show that SiMg
n clusters with n = 5 and 8 are more stable than others. MO contents analysis show that the Si 3p‐orbitals and Mg 3s‐orbital are mainly responsible for the stability of these two clusters. The results of the natural charge population (NCP) and natural electronic configure (NEC) analysis of the electronic properties reveal that the charges in SiMgn (n = 1‐12) clusters transfer from magnesium atoms to silicon frame, and electronic charge distributions are primarily governed by s‐ and p‐orbital interactions. In addition, the Vertical ionization potential (VIP), vertical electron affinity (VEA), and chemical hardness of ground sates of SiMg
n (n = 1‐12) clusters were studied in detail and compared with the experimental results. The conclusions show that the chemical hardness of most SiMg
n clusters are lower than that of pure Mg
n + 1 (n = 1‐12) clusters, except for n = 1 and 8. This indicates that the doping of silicon atom can always reduce the chemical hardness of pure magnesium clusters. Finally, the infrared and Raman spectral properties of SiMg5 and SiMg8 clusters were calculated and discussed in detail.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.