Myostatin (MSTN), also referred to as growth and differentiation factor-8, is a protein secreted in muscle tissues. Researchers believe that its primary function is in negatively regulating muscle because a mutation in its coding region can lead to the famous double muscle trait in cattle. Muscle and adipose tissue develop from the same mesenchymal stem cells, and researchers have found that MSTN is expressed in fat tissues and plays a key role in adipogenesis. Interestingly, MSTN can exert a dual function, either inhibiting or promoting adipogenesis, according to the situation. Due to its potential function in controlling body fat mass, MSTN has attracted the interest of researchers. In this review, we explore its function in regulating adipogenesis in mammals, including preadipocytes, multipotent stem cells and fat mass.
Background
Long-chain non-coding RNA (lncRNA) is closely related to many biological activities. Since its sequence structure is similar to that of messenger RNA (mRNA), it is difficult to distinguish between the two based only on sequence biometrics. Therefore, it is particularly important to construct a model that can effectively identify lncRNA and mRNA.
Results
First, the difference in the k-mer frequency distribution between lncRNA and mRNA sequences is considered in this paper, and they are transformed into the k-mer frequency matrix. Moreover, k-mers with more species are screened by relative entropy. The classification model of the lncRNA and mRNA sequences is then proposed by inputting the k-mer frequency matrix and training the convolutional neural network. Finally, the optimal k-mer combination of the classification model is determined and compared with other machine learning methods in humans, mice and chickens. The results indicate that the proposed model has the highest classification accuracy. Furthermore, the recognition ability of this model is verified to a single sequence.
Conclusion
We established a classification model for lncRNA and mRNA based on k-mers and the convolutional neural network. The classification accuracy of the model with 1-mers, 2-mers and 3-mers was the highest, with an accuracy of 0.9872 in humans, 0.8797 in mice and 0.9963 in chickens, which is better than those of the random forest, logistic regression, decision tree and support vector machine.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.