Automatically generated depth maps from video are usually not aligned with the objects in the original image and produced at lower resolutions. We propose to apply a jointbilateral filter to smoothen the depth map within the objects and upsample it to the original image resolution while keeping object edges in the depth map aligned with the original image. We performed algorithmic and DSP specific optimizations to achieve the real-time implementation on an embedded DSP processor, TM3270, while preserving high quality results. Upsampling 90x72@50Hz depth maps to 720x576@50Hz, requires 69% to 86% of the TM3270 cycle budget.