Grid computing is nothing but the computing environment in which the resources are shared by multiple systems to obtain a goal. In day today life performance analysis of very large data sets in the computing environment is necessary in many applications. The distributed grids are formed from the computing resources of multiple individuals or multiple administrative domains. This can make easier to perform commercial transactions. This paper is an introduction to the Grid infrastructure and the potential for machine learning tasks.