Recall that samples are used to generate statistics, which businesses use to estimate population parameters. You have learned how to take samples from populations and use them to produce statistics. For two quantitative variables, businesses can use scatterplots and the correlation coefficient to explore a potential linear relationship. Furthermore, they can quantify the relationship in a regression equation. Housing Price Prediction Model for D.M. Pan Real Estate Company Housing Price Prediction Model for D.M. Pan Real Estate Company Prompt This assignment picks up where the Module Two assignment left off and will use components of that assignment as a foundation. You have submitted your initial analysis to the sales team at D.M. Pan Real Estate Company. You will continue your analysis of the provided Real Estate Data spreadsheet using your selected region to complete your analysis. You may refer back to the initial report you developed in the Module Two Assignment Template to con
Housing Price Prediction Model for D.M. Pan Real Estate Company
The Pattern
The simple linear regression model is used in establishing the relationship between listing price and area in square feet. The listing price assumes the dependent variable (Y), while the area in square feet assumes the independent variable (X), where the area in square feet will be used to estimate the price of real estate. The relationship between the two variables can be described as a strong positive relationship. This is evidenced by a positive gradient scatter plot of the two variables. Additionally, a linear regression of the two variables generates a positively sloped scatter plot. Both the R2 and the adjusted R2 are greater than 78%, implying that more than 78% of changes in listing prices can be explained by changes in the square feet area. Do you need help with your assignment ? Get in touch with us at eminencepapers.com.
There exist several outliers in the data set whose values are significantly larger than the rest of the values in the data set. The outliers may exist due to the strength of other factors influencing listing prices not included in the model in certain areas (Yusof & Ismael, 2012). However, the outliers align with the general direction of the slope.
If one had an 1800 square foot house, the price may be estimated using the regression as:
Price = 72930 + 89.763*1800
Price = 72930 + 161573.4 = $ 234503.4
The house can be sold at a price of $ 234503.4, where $ 72930 of the price is the fixed price irrespective of the size of the house, and $ 234503.4 of the price can be attributed to the area in square feet.
Regression Equation
The regression equation is obtained as follows:
Y = 72930 + 89.763X.
Determine r
The listing price assumes the dependent variable (Y), while the area in square feet assumes the independent variable (X), where the area in square feet will be used to estimate the price of real estate. The relationship between the two variables can be described as a strong positive relationship. This is evidenced by a positive multiple r (0.89). The r-value indicates a positive linear relationship between the variables (Salkind, 2015). Also, the gradient scatter plot of the two variables shows a strong positive correlation. The relationship is an almost strong positive relationship. The direction of the association between the two variables is determined by the correlation coefficient and the slope of the line. In this case, the slope of the line shows a positive gradient in that an increase in the explanatory variable leads to an increase in the response of another variable. Therefore, the direction of the correlation is towards 1, which is a strong positive correlation.
Examine the Slope and Intercepts
Slope:
For Y intercept= taking the regression equation, y=mx + c, the y-intercept becomes;
Y = 72930 + 89.763X.
Therefore, Y= 72930
For x-intercept, from the equation,
Y = 72930 + 89.763X.
72930=89.763X
Therefore, X = 812.473
The slope of the line from the equation can be obtained as 89.763. Considering the slope of the line, the slope implies a true indication of the slope. The price of the land can be obtained by making the footage of the house zero. Alternatively, the footage of the house can be obtained by calculating the price of the land at zero. Therefore, the x and y-intercepts give the value of one variable by obscuring the value of the other variable in a two-variable equation. Looking at the line of best fit, the intercept makes sense. The value of the land is given by the y-intercept, which is obtained as 72930.
R-squared Coefficient
The R-squared value is 0.79, which implies that the model explains 79% of the change in the outcome variable (price listing) due to the change in square feet. The R-squared value shows the percentage of the points that fall on the scatter plot regression line. In other words, 79% of the values fit the model (Turner, 2020). In this analysis, the change in a land’s area in square feet explains 79% of the change in pricing.
Conclusions
The mean sales price for the land is 342,365. Four hundred and nine (409) states have listing prices greater than the mean listing price. This implies that a greater number of states have listing prices below