In this paper, we proposed a two-level NC and transmission scheme to stream multimedia content to smartphones. At the top level, the Cloud produces coded blocks of video segments using systematic network coding in GF(256) and serves coded blocks to the smartphones. At the bottom level, received content is shared among smartphones within a WiFi network. The content is transmitted in both verbatim form and coded form using systematic network coding in GF(2). In summary, we cope with the variable loss rate and bandwidth fluctuation, common problems in wireless communication, by taking advantage of network coding. Furthermore, we minimize the network coding operations on smartphones by offloading the encoding process to the Cloud and using XOR-only network coding in the WiFi network only when necessary. Finally, we utilize a light-weight distributed scheduling algorithm to eliminate the centralize management for data swarming among the nodes. We verify the proposed system through a simulator that is driven by the energy profile of a Samsung Galaxy phone. Our experimental results show that our proposed system saves up to 73% of battery usage on each phone compared to the current streaming mechanism over the 3G network, and up to 52% compared to MicroCast [7]. Due to the efficient use of CPU time, the maximum achievable throughput in our system is approaching the maximum capacity of the WiFi network, i.e., the system is capable to support higher quality streaming. Furthermore, the proposed system provides short delay and long-lasting video streaming by balancing the remaining battery lifetime among a group of cooperating nodes.