Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Correlation and Regression - Introduction to Statistics - Lecture notes, Study notes of Statistics

Correlation and Regression, Simple relationship, Multiple relationship, Scatter plot, Algebra notation, Statistics notation, Correlation coefficient, Critical value for Scatterplot are learning points available in this lecture notes.

Typology: Study notes

2011/2012

Uploaded on 11/14/2012

dharm
dharm 🇮🇳

4.3

(24)

59 documents

1 / 8

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Confidence intervals
Hypothesis testing
Correlation & Regression
Q: Given data of two separate variables how can we determine if a relationship exists?
Correlation is a statistical method used to determine whether a relationship between
variables exists.
Regression is a statistical method used to describe the relationship.
Simple VS Multiple relationships:
Simple relationships have two variables, one dependent on the other one.
o Q: In a staff of sales people, does years of experience affect sales?
Experience Sales
Independent (explanatory) variable: Experience
Dependent (response) variable: Sales
o We call a analysis on simple cause and effect relationships a simple
regression.
Analysis on multiple relationships is called a multiple regression. In multiple
regressions two or more independent variables are used to predict one dependent
variable.
o Study Hours
GPA College Success
High School Success
Relationships can be classified into two categories:
A positive relationship exists when both variables increase at the same time.
o Height and weight have a positive relationship.
In a negative relationship, as one variable increases, the other one decreases.
o Party hours and GPA have a negative relationship.
Simple relationships have two variables: x and y.
A scatter plot is a graph of the ordered pairs of data (x, y) where x is the independent
variable and y is the dependent variable.
Docsity.com
pf3
pf4
pf5
pf8

Partial preview of the text

Download Correlation and Regression - Introduction to Statistics - Lecture notes and more Study notes Statistics in PDF only on Docsity!

Confidence intervals √ Hypothesis testing √ Correlation & Regression Q: Given data of two separate variables how can we determine if a relationship exists? Correlation is a statistical method used to determine whether a relationship between variables exists. Regression is a statistical method used to describe the relationship. Simple VS Multiple relationships:

  • Simple relationships have two variables, one dependent on the other one. o Q: In a staff of sales people, does years of experience affect sales? ExperienceSales Independent (explanatory) variable: Experience Dependent (response) variable: Sales o We call a analysis on simple cause and effect relationships a simple regression.
  • Analysis on multiple relationships is called a multiple regression. In multiple regressions two or more independent variables are used to predict one dependent variable. o Study Hours GPACollege Success High School Success Relationships can be classified into two categories:
  • A positive relationship exists when both variables increase at the same time. o Height and weight have a positive relationship.
  • In a negative relationship , as one variable increases, the other one decreases. o Party hours and GPA have a negative relationship. Simple relationships have two variables: x and y. A scatter plot is a graph of the ordered pairs of data (x, y) where x is the independent variable and y is the dependent variable.

Correlation : Linear Relationships Algebra notation Statistics notation y = mx + b y/^ = a + bx How linear is a relationship... .linear-ness? We call this “linear-ness” the correlation coefficient. o Population: ρ o Sample: r The correlation coefficient measures both strength and direction of linear relationships A value of r = 0 represents no linear relationship. Strong positive relationships approach r = 1 and strong negative relationships approach r = – 1. Scatter plots may show relationships that are non-linear.

The line of best fit minimizes the average of the squares of distances from the line to each point (again we use squares to compensate for + and – values). Getting the line of best fit ( regression line ): Formula: y/^ = a + b x a =

(∑^ y ) ( ∑ x^2 ) −^ (∑ x ) (∑ xy )

n ( ∑ x^2 ) − (∑ x )

2 b =

n ( ∑ xy ) − ( ∑ x ) (∑ y )

n ( ∑ x^2 ) − ( ∑ x )

2 OR STAT CALC LinReg (a + bx) ENTER ENTER