In an era where social media platforms burgeon with diverse content, effective content moderation becomes imperative to filter harmful materials. Traditional methods often grapple with the dual challenges of accuracy and computational efficiency levels. These conventional approaches typically rely on either text-based or image-based analysis, neglecting the complex interplay of multimodal content prevalent in social media scenarios. This limitation leads to suboptimal content filtering, often missing contextually nuanced or visually deceptive harmful content sets. Addressing these challenges, the proposed work introduces an innovative, hybrid deep learning model that integrates Convolutional Neural Networks (CNNs) and Transformers for nuanced visual content extraction from social media posts. CNNs adeptly handle image features, while Transformers enrich the analysis with contextual textual understanding, thereby enhancing accuracy in identifying harmful contents. Additionally, a Bi-directional Attention Mechanism (BAM) is employed for efficient text-visual fusion, dynamically prioritizing relevant information and reducing computational loads. The fusion process is further refined using Genetic Algorithms (GAs) for hyperparameter optimization, streamlining the model's performance levels. The model's robustness is demonstrated through its ability to discern complex intra and inter-modal relationships within social media content, facilitated by Graph Neural Networks (GNNs). Moreover, the model's scalability and deployment efficiency on social media platforms are ensured through quantization and pruning techniques, balancing accuracy with computational practicality levels. Empirical testing on datasets from Google, Facebook, and Kaggle reveals the model's superior performance over existing methods, with notable improvements in precision, accuracy, recall, AUC, specificity, and response delay in detecting harmful contents. The model also excels in preempting potential harmful content posters, offering enhanced pre-emption metrics.