Difference between revisions of "Learning Algorithms for Trading"

From Quantitative Analysis Software Courses
Jump to navigation Jump to search
 
(11 intermediate revisions by 2 users not shown)
Line 1: Line 1:
==Supervised machine learning==
+
==Lesson 1: How Machine Learning is used at a hedge fund==
*Regression
+
*Overview of use and backtesting
*Classification
+
**Out of sample
*Parametric
+
**Roll forward cross validation
*Instance-based
+
*ML methods we will use in this class
*Overview: LinReg, KNN, Decision Trees, ANN
+
**Linear regression
 +
**KNN regression
 +
**(Random Forest regression considering to drop)
 +
**Q-Learning
 +
*Supervised ML (intent is that the treatment here is light)
 +
**Use: Regression
 +
**Use: Classification
 +
**Model type: Parametric
 +
**Model type: Instance-based
 +
*Introduce the problem we will focus on in the rest of the class, namely:
 +
**Example data, will learn on over a particular year (2012)
 +
**Will test on over the next two years (2013 2014)
 +
**It will be "easy" data that has obvious patterns
 +
**You will create trades.txt and run them through your backtester
  
==How ML fits into the computing at a hedge fund==
+
==Lesson 2: Q-Learning and Dyna==
 
*Long/short
 
*Long/short
 +
*Overview of use and backtesting
 +
**Out of sample
 +
**Roll forward cross validation
 +
*Supervised ML (intent is that the treatment here is light)
 +
**Use: Regression
 +
**Use: Classification
 +
**Model type: Parametric
 +
**Model type: Instance-based
 +
*Reinforcement Learning (light)
 +
**Use: Find a policy
 +
*Introduce the problem we will focus on in the rest of the class, namely:
 +
**Example data, will learn on over a particular year (2012)
 +
**Will test on over the next two years (2013 2014)
 +
**It will be "easy" data that has obvious patterns
 +
**You will create trades.txt and run them through your backtester
 +
*Overview: LinReg, KNN, Decision Trees, Q-Learning
  
==Time series prediction as an ML problem==
+
==Lesson 3: Time series prediction as an ML problem==
 
[note: need to create fake stock data that has embedded patterns]
 
[note: need to create fake stock data that has embedded patterns]
  
==Learner APIs==
+
==Lesson 4: Learner APIs==
  
==Linear regression==
+
==Lesson 5: Linear regression==
  
==KNN==
+
==Lesson 6: KNN==
  
==Assessing a learning algorithm==
+
==Lesson 7: Assessing a learning algorithm==
 
*Now that we have two, (linreg & KNN), let's compare them
 
*Now that we have two, (linreg & KNN), let's compare them
 
*RMS error
 
*RMS error
Line 24: Line 53:
 
*Corrcoef
 
*Corrcoef
  
==Overfitting==
+
==Lesson 8: Overfitting==
  
==Decision trees==
+
==Lesson 9: Decision trees==
  
==Ensemble learners==
+
==Lesson 10: Ensemble learners & bagging==
  
==Random trees & forests==
+
==Lesson 11: Random trees & forests==

Latest revision as of 12:37, 14 July 2015

Lesson 1: How Machine Learning is used at a hedge fund

  • Overview of use and backtesting
    • Out of sample
    • Roll forward cross validation
  • ML methods we will use in this class
    • Linear regression
    • KNN regression
    • (Random Forest regression considering to drop)
    • Q-Learning
  • Supervised ML (intent is that the treatment here is light)
    • Use: Regression
    • Use: Classification
    • Model type: Parametric
    • Model type: Instance-based
  • Introduce the problem we will focus on in the rest of the class, namely:
    • Example data, will learn on over a particular year (2012)
    • Will test on over the next two years (2013 2014)
    • It will be "easy" data that has obvious patterns
    • You will create trades.txt and run them through your backtester

Lesson 2: Q-Learning and Dyna

  • Long/short
  • Overview of use and backtesting
    • Out of sample
    • Roll forward cross validation
  • Supervised ML (intent is that the treatment here is light)
    • Use: Regression
    • Use: Classification
    • Model type: Parametric
    • Model type: Instance-based
  • Reinforcement Learning (light)
    • Use: Find a policy
  • Introduce the problem we will focus on in the rest of the class, namely:
    • Example data, will learn on over a particular year (2012)
    • Will test on over the next two years (2013 2014)
    • It will be "easy" data that has obvious patterns
    • You will create trades.txt and run them through your backtester
  • Overview: LinReg, KNN, Decision Trees, Q-Learning

Lesson 3: Time series prediction as an ML problem

[note: need to create fake stock data that has embedded patterns]

Lesson 4: Learner APIs

Lesson 5: Linear regression

Lesson 6: KNN

Lesson 7: Assessing a learning algorithm

  • Now that we have two, (linreg & KNN), let's compare them
  • RMS error
  • Scatterplot predict vs actual
  • Corrcoef

Lesson 8: Overfitting

Lesson 9: Decision trees

Lesson 10: Ensemble learners & bagging

Lesson 11: Random trees & forests