STA215H5 Lecture Notes - Lecture 7: Standard Deviation, Scatter Plot, Simple Linear Regression
STA215; Chapter 7
What is Linear Regression?
●Linear Regression is a linear approach to modelling the relationship between the
response variable and one or more explanatory variables.
● The case of one explanatory variable is called simple linear regression
● Example;
○ Suppose we find that how long running shoes last is related to how much they
cost BUT we want to know how long a pair of running shoes are expected to
last if we pay $55?
○ Correlation alone won’t answer this question. - Instead, we need a model!!
○ In particular, we need a model to be able to use one variable to predict the
other.
○ The solution: Linear Regression!
● Regression Line
○ A regression line is a straight line that describes how a response variable, y,
changes as an explanatory variable, x ,changes.
○ We can use a regression line to predict the value of y for a given value of x.
● Equation of Straight Line
○ A straight line relating y (response variable) to x (explanatory variable) has an
equation of the form: y = ax + b
○ Where;
■a is the slope, the amount by which y changes when x increases by
one unit
■b is the intercept and corresponds to the value of y when x=0
● Example;
○ Using our motivating example from slide 3, we expect that how long running
shoes last is related to the cost. Data collected from 100 elementary school
students was used to develop a model for this relationship and the following
was obtained:
■ Amount of time running shoes last (hours) = 15.34 + 35.02 x price of
running shoes (dollars)
○ a) What is the slope of this line? Say in words what this value means.
■ The slope is 35.02. This means on average, the amount of time shoes
last increases by 35.02 hours for every dollar increase in price
○ b) What is the intercept? Explain why the value of the intercept is not
statistically meaningful.
■ The intercept is 15.34. This is the amount of time running shoes that
cost $0 last. Even though this interpretation is valid, such prediction
would not make sense because shoes are not free...unfortunately
○ c) Find the predicted amount of time running shoes that cost $55 will last
■ For shoes that cost $55, we predict they will last: 1,941.44 hours 15.34
+35.02(55) = 1,941.44 hours
● Example;
○ Suppose you have a lucky pencil and only use that pencil everyday. It initially
weighed 5 grams when you first bought it. Its’ weight goes down 0.004 grams
Document Summary
What is linear regression? response variable and one or more explanatory variables. Linear regression is a linear approach to modelling the relationship between the. The case of one explanatory variable is called simple linear regression. In particular, we need a model to be able to use one variable to predict the other. A regression line is a straight line that describes how a response variable, y, We can use a regression line to predict the value of y for a given value of x. changes as an explanatory variable, x ,changes. A straight line relating y (response variable) to x (explanatory variable) has an equation of the form: y = ax + b. A is the slope, the amount by which y changes when x increases by. B is the intercept and corresponds to the value of y when x=0 one unit.