The purpose of automatic text summarising technology is to condense a given text while properly portraying the main information in the original text in a summary. To present generative text summarising approaches, on the other hand, restructure the original language and introduce new words when constructing summary sentences, which can easily lead to incoherence and poor readability. This research proposes a XAI (explainable artificial intelligence)-based Reinforcement Learning-based Text Summarization of Social IoT-Based Content using Reinforcement Learning. Furthermore, standard supervised training based on labelled data to improve the coherence of summary sentences has substantial data costs, which restricts practical applications. In order to do this, a ground-truth-dependent text summarization (generation) model (XAI-RL) is presented for coherence augmentation. On the one hand, based on the encoding result of the original text, a sentence extraction identifier is generated, and the screening process of the vital information of the original text is described. Following the establishment of the overall benefits of the two types of abstract writings, the self-judgment approach gradient assists the model in learning crucial sentence selection and decoding the selected key phrases, resulting in a summary text with high sentence coherence and good content quality. Experiments show that the proposed model's summary content index surpasses text summarising ways overall, even when there is no pre-annotated summary ground-truth; information redundancy, lexical originality, and abstract perplexity also outperform the current methods.