Business Statistics Notes: Correlation and Regression Analysis


Unit – 2: Correlation and Regression analysis
Correlation analysis
Q.N.1. Define Correlation analysis. What are its various kinds?
Ans: - Definition: - Correlation is the degree of the relationship between two or more variables. It does not explain the cause behind the relationship. Kinds of correlation may be studied on the basis of:
I. Change in proportion.
II. Number of variation.
III. Change in direction.
(I) Basis of change in proportion:-There are two important correlations on the basis of change in proportion. They are:
(a) Linear correlation (b) Non-linear correlation
(a) Linear correlation: - Correlation is said to be linear when one variable move with the other variable in fixed proportion
(b) Non-linear correlation: - Correlation is said to be non-linear when one variable move with the other variable in changing proportion.
(II) On the basis of number of variables: On the basis of number of variables, correlation may be:
(a) Simple (b) Partial (c) Multiple
(a) Simple correlation: - When only two variables are studied it is a simple correlation.
(b) Partial correlation: - When more than two variables are studied keeping other variables constant, it is called partial correlation.
(c) Multiple correlations: - When at least three variables are studied and their relationships are simultaneously worked out, it is a case of multiple correlations
(III) On the basis of Change in direction: On the basis of Chang in direction, correlation may be
(a)Positive Correlation (b) Negative Correlation
(a) Positive Correlation: - Correlation is said to be positive when two variables move in same direction.
(b) Negative Correlation: - Correlation is said to be negative when two variables moves in opposite direction.
Q.N.2.What are the uses and limitations of Correlation?
Ans: - Following are the main advantages of correlation:
1. It gives a precise quantitative value indicating the degree of relationship existing between the two variables.
2. It measures the direction as well as relationship between the two variables.
3. Further in regression analysis it is used for estimating the value of dependent variable from the known value of the independent variable
4. .The effect of correlation is to reduce the range of uncertainty in predictions.
Following are the main limitations of correlation:
1. Extreme items affect the value of the coefficient of correlation.
2. Its computational method is difficult as compared to other methods.
3. It assumes the linear relationship between the two variables, whether such relationship exist or not.
Q.N.3. What are the different degrees of Correlation?
Ans: The different degrees of correlation are:                         
i)        Perfect Correlation: - It two variables vary in same proportion, and then the correlation is said to be perfect correlation.
ii)       Positive Correlation: - If increase (or decrease) in one variable corresponds to an increase (or decrease) in the other, the correlation is said to be positive correlation.
iii)     Negative Correlation: - If increase (or decrease) in one variable corresponds to a decrease (or increase) in the other, the correlation is said to be positive correlation.
iv)    Zero or No Correlation: - If change in one variable does not other, than there is no or zero correlation.
Q.N.4. What are the different methods of studying correlation?
Ans: - The different methods of studying relationship between two variables are:
i)        Scatter diagram method.
ii)       Graphic method
iii)     Karl Pearson’s coefficient of correlation
iv)     Rank correlation method
i) Scatter Diagram Method: - It is a graphical representation of finding relationship between two or more variables. Independent variable are taken on the x-axis and dependent variable on the y-axis and plot the various values of x and y on the graph. If all values move upwards then there is positive correlation, if they move downwards then there is negative correlation.
Merits:
i) It is easy and simple to use and understand.
ii) Relation between two variables can be studied in a non-mathematical way.
Demerits:-
i) It is non-mathematical method so the results are non-exact and accurate.
ii) It gives only approximate idea of the relationship.
ii) Graphic Method: - This is an extension of linear graphs. In this case two or more variables are plotted on graph paper. If the curves move in same direction the correlation is positive and if moves in opposite direction then correlation is negative. But if there is no definite direction, there is absence of correlation. Although it is a simple method, but this shows only rough estimate of nature of relationship.
                Merits: -
i) It is easy and simple to use and understand.
ii) Relation between two variables can be studied in a non-mathematical way.
Demerits:-
i) It is non-mathematical method so the results are non-exact and accurate.
ii) It gives only approximate idea of the relationship.
iii) Karl Pearson’s Coefficient of correlation: - Correlation coefficient is a mathematical and most popular method of calculating correlation. Arithmetic mean and standard deviation are the basis for its calculation. The Correlation coefficient (r), also called as the linear correlation coefficient measures the strength and direction of a linear relationship between two variables. The value of r lies between -1 to +1.
Properties of r:-
i)        r is the independent to the unit of measurement of variable.
ii)       r does not depend on the change of origin and scale.
iii)     If two variables are independent to each other, then the value of r is zero.
Merits:-
i)        The co-efficient of correlation measures the degree of relationship between two variables.
ii)       It also measures the direction.
iii)     It may be used to determine regression coefficient provided s.d. of two variables are known.
Demerits:-
i)        It assumes always the linear relationship between the variables even if this assumption is not correct.
ii)       It is affected by extreme values.
iii)     It takes a lot of time to compute.
iv) Spearman’s rank Coefficient of correlation: - This is a qualitative method of measuring correlation co-efficient. Qualities such as beauty, honesty, ability, etc. cannot be measured in quantitative terms. So, ranks are used to determine the correlation coefficient.
Merits:-
i)        It is easy and simple to calculate and understand.
ii)       This method is most suitable if the data are qualitative.
Demerits:-
i)        This method cannot be used in case of grouped frequency distribution.
ii)       Where the number of items exceeds 30 the calculations become quite tedious and require a lot of time.
 Regression Analysis
Q.N.1. Define Regression analysis. What are its Various kinds?
Ans: - Definition: -Regression is the measure of the average relationship between two or more variable in terms of the original units of the data. It is a statistical tool with the help of which the unknown values of one variable can be estimated from known values of another variable.
Kinds of regression may be studied on the basis of:
I. Change in proportions.
II. Number of variation.
(I) Basis of change in proportion:-There are two important regressions on the basis of change in proportion. They are:
(a) Linear regression: - Regression is said to be linear when one variable move with the other variable in fixed proportion
(b) Non-linear regression: - Regression is said to be non-linear when one variable move with the other variable in changing proportion.
(II) On the basis of number of variables: On the basis of number of variables, regression may be:
(a) Simple regression: - When only two variables are studied it is a simple regression.
(b) Partial regression: - When more than two variables are studied keeping other variables constant, it is called partial regression.
(c) Multiple regressions: - When at least three variables are studied and their relationships are simultaneously worked out, it is a case of multiple regressions.
Q.N.2.What are the uses and limitations of Regression?
Ans: - The following are main Advantages of regression analysis:
(1) Helpful to statisticians:- The study of regression helps the statisticians to estimate the most probable value of one variable of a series for the given values of the other related variables of the series.
(2) Nature of relationship: - Regression is useful in describing the nature of the relationship between two variables.
(3) Estimation of relationship: - Regression analysis is widely used for the measurement and estimation of relationship among economic variables.
(4) Predictions: - Regression analysis is helpful in making quantitative predictions on the basis of estimated relationship among variables.
(5) Policy formulation: - The predictions made on the basis of estimated relationship are used in policy making.
The following are the main limitation of regression:
1) No change in relationship: - Regression analysis is based on the assumption that while computing regression equation; the relationship between variables will not change.
(2) Conditions: - The application of regression analysis is based on certain conditions like, for existence of linear relationship between the variables; exact values are needed for the independent variable.
(3) Spurious relationships: - There may be nonsense and spurious regression relationships. In such case, the regression analysis is of no use.
Q.N.3. Distinguish between correlation and regression.
Ans:-There are some basis difference between correlation and regression:
(1) Nature of relationship: - Correlation explains the degree of relationship, whereas regression explains the nature of the relationship.
(2) Causal relationship: - Correlation does not explain the cause behind the relationship whereas regression studies the cause and effect relationship.
(3) Prediction: - Correlation does not help in making prediction whereas regression enable us to make prediction.
(4) Origin and scale: - Correlation coefficient is independent of the change of origin and scale, whereas regression coefficient is independent of change of origin but not of scale.
Q.N.4. What are the characteristics of regression co-efficients?
Ans: - Characteristics of good regression coefficient:
1. Both regression co-efficients will have the same sign.
2. If one regression co-efficient is above unity, then the other regression co-efficient should be below unity.
3. If both the regression co-efficient are negative, correlation co-efficient should be negative
4. Regression co-efficients are independent of change of origin but not of scale.
Q.N.5. What are the regression lines? Why do we generally have two regression equations?
Ans: - A line of regression by the method of “least square” shows an average relationship between variables under study. This regression line can be drawn graphically or derived algebraically. A line fitted by method of least square is known as the line of best fit. There are two regression lines:-
Regression line of x on y: - Regression line of x on y is used to predict x for a given value of y. The regression equation of x on y is x=a+by.
Regression line of y on x: - Regression line of y on x is used to predict y for a given value of x. The regression equation of y on x is y=a+bx
Two regression lines: - We know that there are two lines of regression: - x on y and y on x. For these lines, the sum of the square of the deviations between the given values and their corresponding estimated values obtained from the line is least as compared to other line. One regression line cannot minimise the sum of squares for both the variables that is why we are getting two regression lines. (We get one regression line when r = +1 and Two regression lines will be at right angles when r = 0.)

0/Post a Comment/Comments

Kindly give your valuable feedback to improve this website.