Machine Learning Methods for feature selection and prediction applied to large scale genetics data