We propose a novel neural network architecture based on dual quaternions which allow for a compact representation of information with a main focus on describing rigid body movements. After introducing the underlying dual quaternion math, we derive dual quaternion valued neural network layer which are generally applickable to all sorts of problems which can benefit from a mathematical description in dual quaternion space. To cover the dynamic behavior inherent to rigid body movements, we propose recurrent architectures in the neural network. To further model the rigid bodies interactions efficiently, we incorporate a novel attention mechanism employing dual quaternion algebra. The introduced architecture is trainable by means of gradient based algorithms. We apply our approach to a parcel prediction problem where a rigid body with an initial position, orientation, velocity and angular velocity moves through a fixed simulation environment which exhibits rich interactions between the parcel and the boundaries. There we can show that the dual quaternion valued models outperform their counterparts operation on the real numbers, confirming the successful introduction of an inductive bias through the usage of dual quaternion math. Furthermore, we used an advantageous custom data augmentation technique specifically tailored for the usage with our dual quaternion valued input data.
INDEX TERMSdual quaternion attention, dual duaternion neural networks, hypercomplex neural networks, rigid body dynamics