Information about How To Compute The Best Fit Straight Line To A Set Of Data?

How-To Compute the Best-Fit Straight Line to a Set of Data Page 2 Regression Equation Given a collection of paired sample data, the following formula (regression equation) describes the relationship between X (independent or predictor variable) and Y’ (dependent or response variable): Y’ = mX + C (eq. 1) where, m expresses the slope (rate of change) of the best-fit line, X is any particular X value within the range of the data set, and C represents the Y-intercept or the value of X when Y equals zero. Use equation 2 to compute m as follows: Σ∆Ψ m=ρ (eq. 2) Σ∆ Ξ The symbol r equals the correlation coefficient between X and Y; SDY and SDX are their respective X and Y variable standard deviations. The Y-intercept C computes in equation 3 as: C = Ψ− µ Ξ (eq. 3) where, Y equals the mean of the Y scores, m equals the slope, and X equals the mean of the X scores. Equation 3 can be rewritten as follows: Σ∆Ψ C = Ψ − ρ( )Ξ (eq. 4) Σ∆Ξ Combining equations 2 and 3 expresses the equation for the best-fit regression line to predict Y (Y’) from any X value in equation 5: SDY Y ′ = r( (eq. 5) )X + Y - mX SDX Alternatively, combining equations 2 and 4 expresses the equation for a straight line as:

How-To Compute the Best-Fit Straight Line to a Set of Data Page 3 SDY SD Y ′ = r( )X + Y - (r Y )X (eq. 6) SDX SDX The slope and Y-intercept of the best-fit regression line also can be computed from raw scores with the following formula (equation 7): (∑ ) − ( ∑ )( ∑ ) ΞΨ Ξ Ψ Ν Ν Ν Slope,m = (eq. 7) (∑ ) − ( ∑ ) Ξ2 Ξ2 Ν Ν where, the equation’s numerator equals the numerator for the correlation coefficient (r) and the denominator equals SDX. The raw score equation (eq. 8) computes the Y-intercept (C) as: (eq. 8) The denominator in equation 8 equals the square of SDX. Example Given the following 5 data points for temperature in degrees Fahrenheit (Y- variable) and temperature in degrees Celsius (X-variable), compute the equation for the best-fitting straight line. Y 32 40 60 80 100 Fahrenheit Temperature X 0 4.44 15.55 26.6 37.77 Celsius Temperature Step 1. Compute r, SDx, and SDy. ∑Y=312; ∑Y2=22624; ∑X=84.36; ∑X2=2395.65; ∑XY=7015.6; N=5 r = 0.999 Y = 62.4; SDY = 25.12 X = 16.87; SDX = 13.95

How-To Compute the Best-Fit Straight Line to a Set of Data Page 4 Step 2. Compute the slope (m) and Y-intercept (C) using equations 2 and 4, respectively. Σ∆Ψ m=ρ (eq. 2) Σ∆ Ξ m = 0.9999 (25.12 ÷ 13.95) m = 1.80 Σ∆Ψ C = Ψ − ρ( )Ξ (eq. 4) Σ∆Ξ 25.2 C = 62.4(0.999 )16.87 13.95 Χ = 32.0 Step 3. The equation for the regression of degrees Celsius on degrees Fahrenheit becomes: Y’ = mX + C Y’ = 1.8 X + 32.0 Step 4. Determine the best-fit straight line of the regression, and plot the individual data points as a scatter diagram. (See figure 1) Arbitrarily select a value of X near the maximum observed values of X, and substitute the score in the equation to solve for Y’ (predicted Y). Plot this point (X, Y’) on the scattergram. Repeat the procedure for another value of X near the minimum observed value of X. The straight line joining the two points represents the best-fitting straight line generated from the regression equation. 32 0 40 4.44 60 15.55 80 26.6

