Tabular data and images have been used from machine learning models as two diverse types of inputs, in order to perform path loss predictions in urban areas. Different types of models are applied on these distinct modes of input information. The work at hand tries to incorporate both modes of input data within a single prediction model. It therefore manipulates and transforms the vectors of tabular data into images. Each feature of the tabular data vector is spread into several pixels, corresponding to the calculated importance of the particular feature. The resulting synthetic images are then fused with images representing selected regions of the area's map. Compound pseudoimages, having channels of both map-based and tabular data-based images, are then being used as inputs for a Convolutional Neural Network (CNN), which predicts the path loss value at a specific point of the area of interest. The results are clearly better than those obtained from models based on a single mode of input data, as well as from the results produced by other bimodal-input approaches. This approach could be applied for path loss prediction in urban environments for several state-of-art wireless networks like 5G and Internet of Things (IoT).