Exercise
#2: Probability
|
Solution Commentary
Manual Solution The key realization to solving this problem is to recognize that the range specified by the second subset (X<= 5) consists of the range specified by the first subset(X<5) and the subset of interest (X=5). Similarly, the probability of the second subset is equal to the sum of the probabilities of the first subset and the subset of interest. In other words, the probability of the subset of interest is the difference between the probabilities of the two subsets. Thus, Pr(X= 5)= Pr(X<= 5) -Pr(X<5) Pr(X= 5)= 5/18 - 2/9 Pr(X=5) = 5/18 - 4/18 Pr(X=5) = 1/18 |
| Observation | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| Advertising ($) | 60,000 | 70,000 | 70,000 | 80,000 | 80,000 | 90,000 | 100,000 | 100,000 | 100,000 | 110,000 |
| Sales ($) | 362,000 | 416,000 | 417,000 | 499,000 | 485,000 | 536,000 | 602,000 | 623,000 | 616,000 | 663,000 |
Solution Commentary
Manual Solution The equation for the best-fit regression line through a set of points is given by y = mx + b where m is the slope and b is the y-intercept. The slope m is given by the covariance of x and y divided by the variance of x. The y-intercept b is given by yavg - m * xavg where yavg is the mean of y, m is the slope, and xavg is the mean of x. We take these definitions as given without explanation and focus on computation. We now compute the various intermediate values needed to define the equation of the regression line. Let's compute first the means of x and y, which in our case are the advertising and sales values, respectively. Mean of x = (60,000 + 70,000 + 70,000 + 80,000 + 80,000 + 90,000 + 100,000 + 100,000 + 100,000 + 110,000)/10 Mean of x = 86,000 Mean of y = (362,000 + 416,000 + 417,000 + 499,000 + 485,000 + 536,000 + 602,000 + 623,000 + 616,000 + 663,000)/10 Mean of y = 521,900 Let's now compute the variance of x: Variance of x = ((60,000 - 86,000)2 + (70,000 - 86,000)2 + (70,000 - 86,000)2 + (80,000 - 86,000)2 + (80,000 - 86,000)2 + (90,000 - 86,000)2 + (100,000 - 86,000)2 + (100,000 - 86,000)2 + (100,000 - 86,000)2 + (110,000 - 86,000)2)/10 Variance of x = 244,000,000 Next we compute the covariance of x and y: Covariance of x and y = ((60,000 - 86,000)*(362,000 - 521,900) + (70,000 - 86,000)*(416,000 - 521,900) + (70,000 - 86,000)*(417,000 - 521,900) + (80,000 - 86,000)*(499,000 - 521,900) + (80,000 - 86,000)*(485,000 - 521,900) + (90,000 - 86,000)*(536,000 - 521,900) + (100,000 - 86,000)*(602,000 - 521,900))/10 Covariance of x and y = 1,518,600,000 We can now put together the pieces of the regression line equation: The slope m = 1,518,600,000/244,000,000 = 6.22 The y-intercept b = 521,900 - 6.22 * 86,000 = -13,344 The value requested in this exercise is the slope, which is 6.22 Excel Solution The Excel solution image omits six of the data points for space reasons.
|
Solution Commentary
Manual Solution "Standard" normal distributions have a mean of zero and a standard deviation of 1. Fortunately, nonstandard normal distributions can be easily converted into standard normal distributions. Solutions obtained for the standard distribution can be applied to the corresponding nonstandard problem. The motivation for this conversion from standard to nonstandard normal distribution is that there is no simple functional form for obtaining solutions to normal distribution problems. Rather, the manual solution process for standard normal distribution problems with the probability specified requires solving the problem using the normal distribution table of values. The table entries consist of probabilities for the interval from the mean to positive z values. We use the first column to identify the row for the z value integer and tenth place. We then use the first row to identify the column for the z value hundredth place. The probability for the z value is at the intersection of the corresponding row and column. We interpolate for z-values with more than two decimal places. We first convert the value of interest from the x scale and units of the original problem to the corresponding z-value, using the formula: z = (x - m)/s, where x is value of interest, s is the standard deviation and m is the mean, all in the units of the original problem z = (3,400 - 4,600)/ 950 z = -1.263 The clearest way to understand the transformations of the exercise into a form that can be answered using the standard normal distribution table of values is to create a set of pictures that illustrate the transformations. (Note that the pictures below show relationship to the mean and do not reflect the exact z-values of the exercise.)
We compute the target probability for a z-value of 1.2632 in a series of steps: 1. Round the target z-value up to the nearest hundredth to find the 'upper' z-value. The 'upper' z-value is 1.27 The corresponding 'upper' probability is 0.3980 2. Note that the 'lower' z-value is 1.26, which is the upper z-value minus 0.01. The corresponding 'lower' probability is 0.3962 3. Because our probability solution is reported to the nearest percent, and the upper and lower probabilities round to the same percent, we can report the probability as 0.40 without further interpolation.
Excel Solution
|
|
© 2005-2010 DecisionClass, LLC |
About Us | MBA Quant Blog | Contact Us | Site Map | |||