Goal: create a Machine Learning model for the purpose of predicting score outcomes on any CollegeBoard AP exam given that they have taken the course at school.

The target users are the set of all US high school students. They will need to be able to enter their information through the Frontend UI.

The data

Categories of factors to be taken into consideration within the ML Algorithm:

Personal details

  • Grade (9, 10, 11, 12)
  • UW GPA (0-4)

Work Experience

  • What fields they have had internships in (from the below)

Screenshot-2024-06-02-at-10-02-03-PM.png

  • What fields they’ve held permanent job positions in (from the above)
  • Number of hours of work per week

Study & Work Habits

  • Hours of sleep per night (0-24)
  • Percentage of homework completion in the school course (0-100)
  • Cram or not (yes, no, sort of)
  • Average hours of active studying for the subject per day (0-24)
  • Weeks of active preparation prior to test (>0)
  • Average hours of video games per day during week leading up to test (0-24)

Family Background

  • What fields that at least one of their parents have high school tier or above knowledge in (refer to above list in screenshot)

Google Form to collect training data

The target respondents are high schoolers across the US who have taken at least one AP exam before. Since the model will be built to predict AP scores for students who have taken the course at school, the training data must be collected only from students in this situation.

The form has already been made: https://forms.gle/yBy4X9Try2UTSLo47

It is completely anonymous.