Recently unsupervised learning of depth from videos has made remarkable progress and the results are comparable to fully supervised methods in outdoor scenes like KITTI. However, there still exist great challenges when directly applying this technology in indoor environments, e.g., large areas of non-texture regions like white wall, more complex ego-motion of handheld camera, transparent glasses and shiny objects. To overcome these problems, we propose a new optical-flow based training paradigm which reduces the difficulty of unsupervised learning by providing a clearer training target and handles the non-texture regions. Our experimental evaluation demonstrates that the result of our method is comparable to fully supervised methods on the NYU Depth V2 benchmark. To the best of our knowledge, this is the first quantitative result of purely unsupervised learning method reported on indoor datasets.
A new efficient biorthogonal wavelet analysis based on the principal square root of subdivision is proposed in the paper by using the lifting scheme. Since the principal square root of subdivision is of the slowest topological refinement among the traditional triangular subdivisions, the multiresolution analysis based on the principal square root of subdivision is more balanced than the existing wavelet analyses on triangular meshes, and accordingly offers more levels of detail for processing polygonal models. In order to optimize the multiresolution analysis process, the new wavelets, no matter whether they are interior or on boundaries, are orthogonalized with the local scaling functions based on a discrete inner product with subdivision masks. Because the wavelet analysis and synthesis algorithms are actually composed of a series of local lifting operations, they can be performed in linear time. The experiments demonstrate the efficiency and stability of the wavelet analysis for both closed and open triangular meshes with principal square root of subdivision connectivity. The principal square root of -subdivision-based biorthogonal wavelets can be used in many applications such as progressive transmission, shape approximation, multiresolution editing and rendering of 3D geometric models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.