Business & Finance 2023 Data Mining
2023 You will be using the Real Estate data set to build a model
You will be using the Real Estate data set to build a model to predict what a house should sell for. This model will be used by a real estate agency to help their clients understand what their house should sell for so they can make an educated decision about listing price. Secondarily, the model will be used by a home contractor. S/he would like to be able to tell clients the selling value of adding an additional bathroom.
Last week you completed the first 3 steps in the data mining process. For this assignment you will be completing the last two steps: Model and Assess.
- Briefly recap the dummy coding and missing value decisions you made in Part 1.
- Prepare a professionally formatted correlation table in a separate tab/worksheet.
- What is multicollinearity? Do you need to address it? If so, how?
- Discuss which variables have the best correlation with price
- Run a regression and discuss the results
- Is the model significant?
- How much of rice is explained by the independent variables?
- What is the model?
- Are all of the independent variables significant? Discuss.
- What factors have the largest impact on home selling price?
- How much does a bathroom add to the value of a home?
- Run another regression (change some independent variables or change the sample of data) and discuss the results
- You have been provided with the listing info and selling price on 2 houses that were not in your original sample. Please use both models to predict the selling price for these 2 homes.
- How accurate is your model? Please calculate your accuracy percentage as (predicted price – actual price)/actual price
- Which model is better? Why?
- If you had the time, money, expertise, etc. what would you have done differently and why?
As always, please see the rubric for all of the assignment requirements and relative weights.
I intend for you to use the data that you prepared in Part 1 for Part 2 of the project. If you have concerns about using your coded data set, please contact me to discuss.
This assignment will have substantially more discussion than prior assignments. Rather than putting the discussion in Excel, it is preferred that you prepare this as a Word (or PDF) document and include the relevant Excel output as figures within the Word document. You should submit the Excel file that contains your work, however, I will only look at the file if necessary. All work to be graded should be in the Word (or PDF) file.
We give our students 100% satisfaction for their assignment, which is one of the most important reasons students prefer us from other helpers. Our professional group and planners have more than ten years of rich experience. The only reason that our inception days, we have helped more than 100000 students with their assignments successfully. Our expert’s group have more than 2200 professionals of different topics, and that not all; we get more than 300 jobs every day more than 90% of the assignment get the conversion for payment.