Accurate and ubiquitous localization is crucial for a variety of applications such as logistics, navigation, intelligent transport, monitoring, and control. Exploiting mmWave signals in 5G and Beyond 5G systems can provide accurate localization with limited infrastructure. We consider the single base station localization problem and extend it to 3D position and 3D orientation estimation of an unsynchronized multi-antenna user, using downlink MIMO-OFDM signals. Through a Fisher information analysis, we show that the problem is often identifiable, provided that there is at least one additional multipath component, even if the position of corresponding incidence point is a priori unknown. Subsequently, we pose a maximum likelihood (ML) estimation problem, to jointly estimate the 3D position and 3D orientation of the user as well as several nuisance parameters (the user clock offset and the positions of incidence points corresponding to the multipath). The ML problem is a highdimensional non-convex optimization problem over a product of Euclidean and Riemannian manifolds. To avoid complex exhaustive search procedures, we propose a geometric initial estimate of all parameters, which reduces the problem to a 1dimensional search over a finite interval. Numerical results show the efficiency of the proposed ad-hoc estimation, whose gap to the Cramér-Rao bound (CRB) is tightened using ML estimation.