In this paper, we proposed MVC decoder architecture based on parallel combination of Cortex-A8 processor and GPU (graphics processing unit). The basic operations are performed by the processor while the motion compensation (MC) feedback loop of the decoder is moved to GPU in order to achieve decoding efficiently. The experimental results show that compared to general implementation, the proposed parallel processing of a particular task in an embedded system can reconstruct the target images with higher quality with reduced processing time and energy saving with almost the same compression performance.