I had written an article on a short comparison of different tools of the Hadoop ecosystem some time ago. You can visit it here, if you wish. It's not an in depth comparison, but a short intro to each of these tools which can help you to get started. (Just to add on to my answer. No self promotion intended)