It has been shown that the characteristics of a singer’s voice can be extracted from music via vocal segment detection followed by solo vocal signal modeling which propose a clustering algorithm that integrates features from both lyrics and acoustic data sources to perform bimodal learning