Search the whole station

# 应用数学计算代写 数学计算代写 数据分析代写 数学作业代写

786

## Fall 2021

This project involves predicting what happens to a mortgage loans that have been purchased by FNMA during the 3rd quarter of 2001.

The data has 30 predictor variables described in the file PredictorsDescription.csv”. All 30 of these variables are known to FNMA when the loans were acquired. Recently obtained performance information about all of the loans has become available relatively recently (in around 2021). Some of the loans performed as lender would expect: monthly payments made on time up until performance data stopped being collected. Other loans were problematic in that the holder of the mortgage may have foreclosed (payments stopped, home was taken by the lender, etc..) In many instances, the holder of the mortgage sold the home and paid the outstanding balance owed before the mortgage term (15 years, 30 years, etc..)

Every loan has a 12 digit loan id LID.

### Two performance variables are provided in the training set for each loan: 应用数学计算代写

FORCLOSED – a Boolean variable that indicates whether or not foreclosure took place

NMONTHS – a variable giving the number of months that the mortgage remained on the books by the lender

A large data set has been randomly split into

TrainingData.csv – this one has all 30 predictors, a loan ID (LID), and the two performance variables (FORCLOSED, NMONTHS ) for 588,490 loans.

TestDataYremoved.csv – this one has only the 30 predictors and loan ID (LID) for a different set of loans (196,164 of them).

Your task is to use the 31 predictors to make predictions about performance for the loans in the test set.

### Specifically for each loan in the test set:

1) Predict whether the loan will foreclose or not by supplying a Boolean value (True for foreclosure, False for non-foreclosure)

2) Predict the accuracy of your predictions in 1)

3) Predict NMONTHS – this should be a number but needn’t be an integer

4) Predict the accuracy of your predictions in 3).

### Important details: 应用数学计算代写

o Answering 3 questions in a Blackboard final submission assessment (see Three Questions link)

• The .csv file should have a header column and a row for every loan in the test dataset. This file should have exactly 3 columns, labeled LID, FORCLOSED, NMONTHS and the number of rows not including the header should be exactly 1+196,164 = 196,165.
• The final submission questions appear in Blackboard (Three questions) as an assessment where you are asked to provide 3 numbers:

o the true positive rate (TPR) for your predictions in 1) – see definitions below

o and the false positive rate (FPR) for your predictions in 1) – see definitions below

o the MAD (mean absolute deviation) between predicted and true values of NMONTHS i.e. the average of the absolute difference between your predicted value of NMONTHS and the true value of NMONTHS, averaged over all loans in the test dataset.

• The quality of your predictions in 1) will be measured by taking the difference TPR-FPR. You should try to make this as large as you can.
• The quality of your predictions in 3) will be measured by how small the MAD is for your predictions.

### Some definitions: 应用数学计算代写

For foreclosure prediction (since I know the true status of each loan) I can fill a 2×2 table with counts of where each loan falls in the following table based on its true and predicted foreclosure status:

So once you submit your predictions the four numbers in the table should add up to 196,164. I can then calculate the quantities

TPR=TP/(TP+FN)

FPR=FP/(FP+TN)

and you should be striving to make TPR high and FPR low.

The prev: The next:

### Related recommendations

• #### 统计数据分析作业代写 Statistics代写 统计作业代写 数据分析代写

753

Statistics 统计数据分析作业代写 Background: Exoplanets are planets which orbit other stars, like the Earth orbits the Sun. Exoplanet discovery is currently an exciting and Background: 统计...

View details
• #### 数学概率作业代写 数学概率代写 数学作业代写 概率作业代写

357

4. Probability on finite sample spaces. 数学概率作业代写 1. Are the following events A, B ⊂ Ωroulette independent? Find P(A|B) and P(B|A) in each case. a) A = Red, B = Even, 1. Are the fol...

View details
• #### 离散数学作业代写 MACM 201代写 D100 AND D200代写

1295

MACM 201 - D100 AND D200 ASSIGNMENT #8 离散数学作业代写 Instructions Answer all questions on paper or a tablet using your own handwriting. Put your name, student ID number and page number at th...

View details
• #### 半群理论作业代做 MT5863代写 半群理论代写 数学作业代写

346

MT5863 Semigroup theory: Problem sheet 3 半群理论作业代做 Binary relations and equivalences 3-1. Let X = {1, 2, 3, 4, 5, 6}, let ρ be the equivalence relation on X with equivalence classes {1,...

View details
1