Difference between revisions of "Spring 2020 Project 4: Defeat Learners"

From Quantitative Analysis Software Courses
Jump to navigation Jump to search
Line 4: Line 4:
  
 
==Overview==
 
==Overview==
 +
 +
For this homework you will generate data that you believe will work better for one learner than another.  This will test your understanding of the strengths and weaknesses of various learners.  The two learners you should aim your datasets at are:
 +
* A decision tree learner with leaf_size = 1 (DTLearner).  Note that for testing purposes we will use our implementation of DTLearner
 +
* The LinRegLearner provided as part of the repo.
 +
 +
Your data generation should use a random number generator as part of its data generation process.  We will pass your generators a random number seed.  Whenever the seed is the same you should return exactly the same data set.  Different seeds should result in different data sets.
  
 
== Template==
 
== Template==

Revision as of 21:58, 12 January 2020

Revisions

This assignment is subject to change up until 3 weeks prior to the due date. We do not anticipate changes; any changes will be logged in this section.

Overview

For this homework you will generate data that you believe will work better for one learner than another. This will test your understanding of the strengths and weaknesses of various learners. The two learners you should aim your datasets at are:

  • A decision tree learner with leaf_size = 1 (DTLearner). Note that for testing purposes we will use our implementation of DTLearner
  • The LinRegLearner provided as part of the repo.

Your data generation should use a random number generator as part of its data generation process. We will pass your generators a random number seed. Whenever the seed is the same you should return exactly the same data set. Different seeds should result in different data sets.

Template

Tasks

What to turn in

Rubric

Report

Code

Required, Allowed & Prohibited