Slightly off-axis digital holography is proposed using transmission grating to obtain quantitative phase distribution. The experimental device is based on an improved 4f optical system in which a two-window input plane is used to form the object beam and reference beam. Then, the two beams are diffracted into multiple orders by the transmission grating placed at the Fourier plane. By applying a modified Michelson configuration, the interference patterns can be generated by the object and reference beams from different diffraction orders. After translating the grating, a random phase shift can be introduced to the hologram. To demonstrate the feasibility of our method, both thick and thin phase specimens are retrieved using two carrier phase-shifting holograms. Furthermore, we use the phase reconstruction algorithm based on the NVIDIA CUDA programming model to reduce the retrieval time. Meanwhile, we optimize the discrete cosine transform (DCT)-based least-squares unwrapping algorithm to unwrap the phase. By porting the entire phase reconstruction process to the graphics processing unit (GPU), the phase retrieval acceleration and execution efficiency significantly improve. To demonstrate the feasibility of our method, it is found that our method can measure the surface profiles of standard elements, such as a plano-convex cylinder lens and a microlens array, with a relative error of about 0.5%. For holograms with a different phase shift, the root-mean-square (RMS) value of the phase difference for the main imaging region is about 0.2 rad. By accelerating the phase reconstruction with GPU implementation, a speedup ratio of about 20× for the thick phase specimen and a speedup ratio of about 15× for the thin-phase specimen can be obtained for holograms with a pixel size of 1024 × 1024.