Project 1: Retail Performance Monitoring: Create monitoring reports for a marketing campaign:
To predict customer response rates:
- Check and manage the data, to find the overall response rate from 193,728 customers ;
- Create missing value indicators for inputs that have missing values. (By SAS: Array);
- Impute missing value to a new dataset; (By SAS: Proc Stdize);
- Split the imputed dataset into training and test datasets. Use 70% of the data for each dataset role. Stratify on the target variable; (By SAS: Proc Surveyselect);
- Build a logistic regression model with the FAST Stepwise method;
- Achieved the ROC is 60.36%, and Somers' D is 20.7%;
- Finally, to test and make sure there is no over-fitting between training and test datasets;
- Project. 2: A car insurance marketing campaign.
- Check and import the data, to get the total response rate; (By SAS: Proc Freq) from 9,134 customers;
- Break down the customer base into multiple segments and compute KPIs for individual segments;
- Analyze how these response varies by different EmploymentStatus groups, MaritalStatus groups, and SalesChannel groups;
- Identify the most important factors which affect response rate;
- Converted the response into numeric by creating a variable; And check missing rate for numerical variables;
- Build a logistic regression model: Split the data into train(70%) and test(30%) and check average response rate;
- Use stepwise to decide the best model, and achieved the ROC is 67.25%, and Somers' D is 34.5%;
- Finally, to test and make sure there is no over-fitting between training and test datasets;