Statistics for Management, Operations, and Operations Research
PART 1: STATISTICS FOR MANAGEMENT
1. Concept and Scope
Statistics is both a science and an art of collecting, classifying, presenting, analyzing, and interpreting numerical data to aid rational decision-making under uncertainty.
It provides quantitative foundations for managerial functions such as planning, control, and forecasting.
Branches of Statistics
-
Descriptive Statistics – summarizing data through tables, charts, and averages.
-
Inferential Statistics – drawing conclusions about populations from samples using probability theory.
Role of Statistics in Management
-
Marketing: Market surveys, consumer behaviour analysis.
-
Finance: Portfolio risk analysis, stock price movements.
-
Production: Quality control, forecasting demand.
-
HR: Wage analysis, performance evaluation.
-
Operations: Scheduling, process optimization.
2. Data and Its Types
A. Based on Source
-
Primary Data: Collected first-hand for a specific study (surveys, interviews).
-
Secondary Data: Collected earlier for another purpose (reports, journals, databases).
B. Based on Nature
-
Qualitative (Attribute): Categorical (e.g., gender, brand preference).
-
Quantitative (Variable): Numeric (e.g., income, profit).
C. Based on Measurement Scale
| Scale | Meaning | Example |
|---|---|---|
| Nominal | Classification only | Gender, religion |
| Ordinal | Rank order | Satisfaction level |
| Interval | Equal intervals, no true zero | Temperature (°C) |
| Ratio | True zero and intervals | Sales, weight |
🟩 3. MEASURES OF CENTRAL TENDENCY
Central tendency expresses the “typical” or “representative” value of a dataset.
A. Arithmetic Mean
For grouped data:
Merits: Simple, algebraically tractable.
Limitations: Affected by extreme values.
B. Median
The middle value when data are arranged in order.
-
Less affected by outliers.
-
Appropriate for skewed data.
C. Mode
Most frequent value.
For grouped data:
Used for qualitative data like brand or color preference.
D. Relationship among Mean, Median, and Mode
Useful for estimating one measure from the other two.
🟩 4. MEASURES OF DISPERSION
Dispersion measures how values deviate from the average — indicating consistency or risk.
| Measure | Formula | Interpretation |
|---|---|---|
| Range | Max − Min | Simple measure of spread |
| Quartile Deviation (Q.D.) | (Q₃ − Q₁) / 2 | Dispersion of middle 50% |
| Mean Deviation (M.D.) | (\frac{\sum | X – \bar{X} |
| Variance (σ²) | Fundamental for inferential statistics | |
| Standard Deviation (σ) | √Variance | Most widely used measure |
| Coefficient of Variation (CV) | Compare variability between datasets |
Example:
Dataset A: Mean = 50, SD = 10 → CV = 20%
Dataset B: Mean = 80, SD = 16 → CV = 20%
→ Both have equal relative variability.
🟩 5. PROBABILITY AND PROBABILITY DISTRIBUTIONS
A. Concept of Probability
Probability quantifies the likelihood of occurrence of an event.
Range: 0 ≤ P(A) ≤ 1
-
P(A) = 1: Certain event
-
P(A) = 0: Impossible event
B. Rules of Probability
-
Addition Rule:
If A and B are mutually exclusive, -
Multiplication Rule:
For independent events, -
Conditional Probability:
C. Probability Distributions
(1) Binomial Distribution
Discrete distribution used for number of successes in n independent trials.
-
Mean = np
-
Variance = np(1−p)
Example: Probability of 3 defective bulbs out of 10 when defect rate = 0.1.
(2) Poisson Distribution
For rare events (e.g., accidents per day).
-
Mean = Variance = m
Used when n → large, p → small, and np = constant.
(3) Normal Distribution
Continuous, bell-shaped curve.
Properties:
-
Symmetrical about mean.
-
68.26% within ±1σ, 95.45% within ±2σ, 99.73% within ±3σ.
Used in hypothesis testing and control charts.
(4) Exponential Distribution
Used to model time between events (e.g., waiting time).
Mean = 1/λ
Variance = 1/λ²
🟩 6. DATA COLLECTION AND QUESTIONNAIRE DESIGN
Data Collection
-
Primary: Through direct observation, survey, or experimentation.
-
Secondary: Government reports, journals, internet sources.
Questionnaire Design
-
Define objectives
-
Select information to collect
-
Choose question type:
-
Open-ended (qualitative insights)
-
Closed-ended (quantitative analysis)
-
-
Logical sequencing (easy → complex)
-
Pilot testing and revision
Common Mistakes: Ambiguous wording, double-barreled questions, poor scaling.
🟩 7. SAMPLING THEORY
A. Basic Concepts
-
Population (Universe): Entire group under study
-
Sample: Representative subset
-
Sampling Unit: Element from which data is collected
B. Steps in Sampling Process
-
Define population
-
Select sampling frame
-
Decide sample size
-
Choose technique
-
Collect and analyze
C. Probability Sampling Techniques
| Method | Description | When to Use |
|---|---|---|
| Simple Random | Equal chance for each unit |
Small, homogeneous population |
| Systematic | Every kth item selected | Sequential data |
| Stratified |
Population divided into strata, then random sample |
Heterogeneous population |
| Cluster |
Dividing into clusters, sampling entire cluster |
Wide geographical dispersion |
D. Non-Probability Sampling Techniques
| Method | Description |
|---|---|
| Convenience | Easy to reach sample (quick but biased) |
| Judgmental |
Based on researcher’s expertise |
| Quota |
Fixed proportion from categories |
| Snowball | Existing respondents recruit new ones |
🟩 8. HYPOTHESIS TESTING
A. Key Definitions
-
Parameter: Numerical summary of population (μ, σ²).
-
Statistic: Calculated from sample (x̄, s²).
Goal: Use sample data to infer about population.
B. Hypothesis Types
-
Null Hypothesis (H₀): No significant difference.
-
Alternative Hypothesis (H₁): Significant difference exists.
Errors:
-
Type I (α): Rejecting true H₀
-
Type II (β): Accepting false H₀
C. Testing Steps
-
Formulate H₀ and H₁
-
Choose significance level (α = 0.05 or 0.01)
-
Choose appropriate test statistic (Z, t, F, χ²)
-
Compute test statistic
-
Compare with critical value
-
Draw conclusion
D. Parametric Tests
| Test | Application | Condition |
|---|---|---|
| Z-Test | Large samples, known σ | n > 30 |
| t-Test | Small samples, unknown σ | n < 30 |
| F-Test | Compare variances | Ratio test |
| Paired t-Test | Before–after comparison | Related samples |
E. Non-Parametric Test
Chi-Square (χ²) Test:
Used for testing independence or goodness of fit.
Example: Relationship between gender and brand preference.
🟩 9. CORRELATION AND REGRESSION
A. Correlation
Measures strength and direction of linear relationship.
Karl Pearson’s Coefficient (r):
Range: -1 to +1
r = +1 → Perfect positive
r = -1 → Perfect negative
B. Rank Correlation (Spearman’s ρ)
Used when data are ordinal (ranks).
C. Regression Analysis
Used to predict value of dependent variable (Y) from independent variable (X).
Simple Linear Regression:
where
Multiple Regression:
🟩 10. OPERATIONS MANAGEMENT
A. Concept
Operations Management deals with conversion of inputs (materials, labour, capital, information) into outputs (goods/services) efficiently.
B. Functions
-
Product design & process selection
-
Plant layout and facility location
-
Capacity planning
-
Scheduling and inventory control
-
Maintenance and quality management
C. Objectives
-
Improve productivity
-
Optimize resources
-
Ensure quality and timely delivery
-
Minimize cost
🟩 11. FACILITY LOCATION AND LAYOUT
Facility Location
The strategic decision of choosing where to situate production or service facilities.
Quantitative Methods:
-
Centre of Gravity Method: minimizes transport cost
-
Break-even Analysis: compares fixed and variable cost by site
Plant Layout
Arrangement of machines, departments, or work areas.
| Type | Features | Example |
|---|---|---|
| Product Layout | Line flow, high volume | Automobile plant |
| Process Layout | Functional grouping | Hospitals |
| Fixed Position |
Product remains stationary |
Shipbuilding |
| Cellular Layout | Hybrid for efficiency | Electronics plant |
🟩 12. ENTERPRISE RESOURCE PLANNING (ERP)
A. Concept
ERP integrates core business functions through a central database.
ERP Modules:
-
Finance
-
HR
-
Production
-
SCM
-
CRM
B. ERP Implementation Steps
-
Project planning
-
Requirement analysis
-
System design & customization
-
Data migration
-
Training
-
Testing & Go-live
Benefits: Integration, transparency, faster reporting.
Challenges: High cost, change resistance, data migration errors.
🟩 13. PRODUCTION SCHEDULING AND CONTROL
A. Loading: Assigning jobs to machines.
B. Scheduling: Determining when and in what sequence jobs are processed.
C. Sequencing: Prioritizing jobs (rules: FCFS, SPT, EDD).
D. Monitoring: Tracking progress, revising schedules.
🟩 14. QUALITY MANAGEMENT
A. Quality Concepts
Quality = fitness for purpose.
Quality management ensures that output meets customer expectations.
B. Statistical Quality Control (SQC)
Uses control charts (mean, range, p-chart, c-chart) to monitor process variation.
C. Total Quality Management (TQM)
An organization-wide philosophy emphasizing continuous improvement and customer satisfaction.
Principles:
-
Customer orientation
-
Continuous improvement (Kaizen)
-
Employee involvement
-
Scientific decision-making
D. Quality Tools
-
Control Charts
-
Fishbone Diagram (Ishikawa)
-
Pareto Analysis (80/20 Rule)
-
Check Sheets, Histograms, Scatter Diagrams
E. Kaizen
Continuous small improvements involving all employees.
F. Benchmarking
Comparing performance with best-in-class organizations.
G. Six Sigma
A disciplined methodology targeting defect reduction to 3.4 defects per million opportunities (DPMO).
Focuses on DMAIC cycle – Define, Measure, Analyze, Improve, Control.
H. ISO 9000 Series
Global quality management system standards (documentation, process control, auditing).
🟩 15. OPERATIONS RESEARCH (OR)
Definition
Operations Research applies scientific and mathematical models to managerial decision-making for optimization of limited resources.
Applications:
-
Production scheduling
-
Inventory control
-
Transportation and distribution
-
Network planning
-
Queuing and service design
A. Transportation Problem
Objective: Minimize total cost of shipping goods.
Methods:
-
Initial solution → North-West Corner, Least Cost, Vogel’s Approximation (VAM).
-
Optimality test → MODI method.
B. Queuing Theory
Studies waiting line systems.
Parameters:
λ = Arrival rate, μ = Service rate
System utilization: ρ = λ / μ
Objective → minimize waiting cost + service cost.
C. Decision Theory
Used when decisions must be made under risk or uncertainty.
Criteria:
-
Maximax (optimistic)
-
Maximin (pessimistic)
-
Minimax regret (Savage)
-
Expected monetary value (probabilistic)
D. PERT / CPM (Project Scheduling)
| Technique | Time Estimate | Nature |
|---|---|---|
| PERT | Probabilistic (a, m, b) | Uncertain projects |
| CPM | Deterministic | Routine projects |
PERT Expected Time:
Variance
Critical Path: Longest path through network; determines project duration.
Slack Time: → available delay time.
