Machine Learning Engineering

BDA-602 - Machine Learning Engineering

NULL of you are in the class
~5 minutes per presentations (rapid mode!)
- Tuesday, May. 9th - Finals Day
  - All have to go in one day 😞

Discuss what you wanted to predict. What's in the source dataset
- Did you have to construct this predictor
- Why do you want to predict this?
Data conversions
- Discuss data conversions: Cleaned / Organized / ETL (Extract Transform and Load) Procedure
- Mistakes in the data?
  - What was wrong?
  - How did you fix them?

Generate a ton of features to inspect (I'm not giving a number out intentionally!)
You've built tools to analyze features: use them!
- Variable/Feature Importance
  - p-values & t-scores
  - Diffenence with mean of response
    - Plots
    - Rankings
  - Random Forest
- Brute force variable combinations
  - Try to see if other variable combinations exist that you didn't think of
    - Plots
    - Rankings

Compare the models you built against one another
- We went over a ton of evaluation techniques
- Don't forget to train/test split
Which one is the best one?
- Why?
Show off the performance metrics on the best model

You will be presenting slides for your presentation
- Don't need to be fancy
- Google / Powerpoint / whatever you want
- I don't need a copy of it
  - Your Wiki will have the same content (and hopefully more!)
I know presentation is just around the corner
- Your written report (wiki) and code can reflect more work then you presented
  - I won't hold it agains you
  - It'll probably make me happy
I would highly suggest putting links in your wiki to techniques you used
- If you used a cool modeling technique I didn't cover
  - Put a hyperlink to a page describing it

The final report should be of sufficient length to cover your modeling process in enough depth that someone without access to your code could recreate your work
- If you come up with a novel idea: Explain it in depth
- Make sure the report has graphs backing you up
This isn't an English class
- I could care less how strong your command of the English language is
- As long as I can understand what you're doing
- I don't care about spelling errors
  - My slides are probably riddled with them