A data mining approach to forecast students’ career placement probabilities and recommendations in the programming field
Abstract
The career opportunities in computer programming are vast and rapidly increasing. Skilled software engineers, programmers, and developers are vigorously in demand worldwide. The capability to forecast a student's future career can be helpful in a wide variety of pedagogical practices. Data mining is becoming a more robust tool for analysis and forecasting. Therefore, to forecast career placement probabilities in the programming field, data mining classification and forecast techniques are used in this study to facilitate prospective students to make sensible career decisions. To achieve this objective, passed-out graduates' data is utilized, which comprises features like graduates' educational attainments in pre-university grades, i.e. grades of matriculation and intermediate, programming courses taught in early semesters along with the Cumulative Grade Point Average (CGPA) with the internship experience, gender, and family demographic information. Various multi-way Classification Trees are generated, which could help students to choose a branch with high career placement probabilities. From historical data, the Classification Trees have determined whether the branch is 'Good', 'Satisfactory', or 'Poor' based on the given information. The experimental findings indicate that all the features significantly influence the career placement probabilities in the programming field.