Statistics Notes

STATISTICS AND ECONOMETRICS

(Complete, Easy & Detailed Notes for UGC NET – Economics)


1. Introduction

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data for decision-making.

Why do economists use statistics?

  • To measure and summarize economic activities (GDP, inflation, unemployment, poverty)

  • To test hypotheses (e.g., Does education increase income?)

  • To explain relationships (e.g., demand & price, interest & investment)

  • To forecast future trends (e.g., stock markets, rainfall, exchange rates)

Econometrics is the application of statistical and mathematical tools to economic data to verify theories and predict outcomes.

Econometrics = Economics + Mathematics + Statistics + Computer science

Hal Varian: Econometrics gives empirical content to economic relationships and helps estimate real-world cause-and-effect using data.


PART A – STATISTICS


2. Types of Data

Type Description Example
Primary Collected first-hand Field surveys
Secondary Already available data RBI, CSO, Census
Cross-section Multiple units at a point in time Household income 2024
Time series Same unit across time GDP yearly
Panel Combination of both NFHS data

3. Measurement Scales

Scale Nature Example
Nominal Labels only Gender
Ordinal Ranking Education level
Interval Equal units, no true zero Temperature
Ratio Absolute zero Income, weight

4. Probability Concepts

Probability = Likelihood that an event occurs.

Rules of Probability

  • 0P(A)1

  • P(S)=1

  • Complement: P(A)=1P(A)

  • Addition: P(AB)=P(A)+P(B)P(AB)

Conditional Probability

P(AB)=P(AB)P(B)

Bayes’ Theorem

P(AB)=P(BA)P(A)P(B)


5. Probability Distributions

Discrete Distributions

Mean Variance
Binomial np np(1p)
Poisson λ λ

Continuous Distributions

Mean Variance
Normal μ σ2

Most economic variables are normally distributed (e.g., heights, test scores, errors).


6. Central Limit Theorem

When sample size is large, the distribution of sample mean tends toward normal, regardless of population distribution.

xˉN(μ,σ2/n)

Basis of hypothesis testing.


7. Descriptive Statistics

Measures of Central Tendency

xˉ=xin

Measures of Dispersion

σ2=(xixˉ)2n,σ=σ2

CV=σxˉ×100

Moments

Used to measure shape of distribution.

  • Skewness → symmetry

  • Kurtosis → peakedness


8. Correlation

r=nxy(x)(y)[nx2(x)2][ny2(y)2]

 1r+1


9. Index Numbers

Types: Wholesale Price Index, CPI, Laspeyres, Paasche

PL=P1Q0P0Q0×100

PART B – ECONOMETRICS


1. Linear Regression Model

Y=β0+β1X+u

OLS Estimators

β1=(xxˉ)(yyˉ)(xxˉ)2,β0=yˉβ1xˉ

Interpretation: Regression gives the best linear predictor of Y given X.


2. BLUE (Gauss-Markov Theorem)

OLS is Best Linear Unbiased Estimator if:

  1. Linear in parameters

  2. Zero mean error

  3. No autocorrelation

  4. Homoscedasticity (constant variance)

  5. No perfect multicollinearity

  6. X is non-stochastic


3. Hypothesis Testing

  • t-test: significance of each coefficient

  • F-test: significance of model

  • : goodness of fit


4. Model Problems

Problem Meaning
Heteroscedasticity non-constant variance
Autocorrelation errors correlated over time
Multicollinearity strong correlation among Xs
Endogeneity correlation between X and u

Solutions

  • White test, Breusch-Pagan, Durbin-Watson

  • GLS, HAC estimators, Cochrane-Orcutt


5. Simultaneous Equation Models

  • Exact, under, over-identified

  • 2SLS, 3SLS, LIML estimators


6. Discrete Choice Models

Model Purpose
Logit binary choices
Probit utility behavior
Tobit censored data

7. Time Series

Components: Trend, Seasonal, Cyclical, Irregular

Stationarity

E(Yt)=μ,Var(Yt)=σ2

ADF Test for unit roots.

ARIMA models

ARIMA(p,d,q)


What This Unit Helps You Achieve

  • Understand data & probability

  • Build and interpret econometric models

  • Perform hypothesis testing

  • Forecast economic variables

  • Distinguish correlation vs causation

👋Subscribe to
ProTeacher.in

Sign up to receive NewsLetters in your inbox.

We don’t spam! Read our privacy policy for more info.