Spiking Neural Networks (SNNs) stand as the third generation of Artificial Neural Networks (ANNs), mirroring the functionality of the mammalian brain more closely than their predecessors. Their computational units, spiking neurons, characterized by Ordinary Differential Equations (ODEs), allow for dynamic system representation, with spikes serving as the medium for asynchronous communication among neurons. Because of their inherent capability of capturing input dynamics, SNNs are promising as deep networks in Reinforcement Learning (RL) tasks. Deep RL (DRL), and in particular Proximal Policy Optimization (PPO) has been proven to be valuable for for training robots due to the difficulty in creating comprehensive offline datasets that capture all environmental features.DRL combined with SNNs offers a compelling solution for tasks characterized by temporal complexity. In this work, we study the effectiveness of SNNs on DRL tasks leveraging a novel framework we developed for training SNNs with PPO in Isaac Gym simulator implemented using the SKRL library. Thanks to its significantly faster training speed compared to available SNN DRL tools, the framework allowed us to: i) Perform an effective exploration of SNN configurations for DRL robotic tasks; ii) Compare SNNs and ANNs for various network configurations such as the number of layers and neurons. Our work demonstrates that in DRL tasks the optimal SNN topology has a lower number of layers than ANN and highlights how in complex tasks, such as Ant, SNNs fail to leverage deeper layers. Finally, we applied the best topology identified thanks to our Isaac Gym-based framework on Ant-v4 benchmark running on MuJoCo simulator, exhibiting a performance improvement by a factor of 4.4x over the state-of-art SNN trained on the same task.