MGS 8040 Data Mining Project

 

Written Report

 

  1. Introduction – what is the goal of the project?

 

  1. Data
    1. Source, variables (put data dictionary in appendix)
    2. Sample Size
    3. Data Preparation (aggregation, variable creation, data cleaning)

 

  1. Methodology
    1. Example of crosstabs, dummy creation
    2. Regression (discrim/logistic)

 

  1. Results
    1. Show scorecard (results of final regression in plain English)
    2. (put actual regression results – the 1st and the last regression only in appendix)
    3. KS test results for training and validation samples.
    4. Discussion of improvement in performance that client will get from your model.
  2. Implementation
    1. Discuss how the client should implement your model – what cutoff scores you recommend, what strategies go with the cutoffs.

 

  1. Monitoring Reports
    1. Explain how the client should monitor the performance of the model after implementation.

 

  1. Project flow diagram: Include a diagram that shows all the files created, the programs used to create them, and other actions taken using those files. This is for your benefit rather than the client’s.

 

Oral Presentation

 

Since we are all following similar methodology, there is no need to discuss that in your presentation. Present only the results – the scorecard and the KS tests. Ten minutes should be sufficient.

 

 

Sample Scorecard

 

 

Variable

Intervals

Points

0

Intercept

(All applicants start with this score)

 

459

1

Customer Age

 

Missing

18-25

26-40

41-50

51+

0

-25

0

31

64

2

Number of Revolving Trades

No Record

Inqs only

Pub Rec only

Inqs and Pub rec only

Missing

0-3

4-9

10-13

14-20

21+

0

0

0

0

0

-44

0

35

55

35

3

Retail Value

$ 0- 1000

1001 – 5000

 

Etc…

 

4

And so on …