01
Population
Statistical Analysis
Statistical Analysis
"
"

The entire set of items or individuals of interest in a study. Denoted By N.

02
Sample
Statistical Analysis
Statistical Analysis
"
"

A subset selected from the larger population; Denoted by n.

03
Parameter
Statistical Analysis
Statistical Analysis
"
"

A numerical value that describes a characteristic of the entire population. It is the opposite of statistic.

04
Statistic
Statistical Analysis
Statistical Analysis
"
"

A numerical value that describes a characteristic of a sample and used to estimate a population parameter. It is the opposite of a parameter.

05
Random
Sample
Statistical Analysis
Statistical Analysis
"
"

A sample in which every member of the population has an equal chance of being selected.

06
Representative
Sample
Statistical Analysis
Statistical Analysis
"
"

A sample that accurately mirrors the characteristics of the larger population.

07
Variable
Statistical Analysis
Statistical Analysis
"
"

A characteristic or attribute that can take on different values or categories. E.g. height, occupation, age etc.

08
Type
of
Data
Statistical Analysis
Statistical Analysis
"
"

The classification of data based on its nature.There are two types of data - categorical and numerical.

09
Categorical
Data
Statistical Analysis
Statistical Analysis
"
"

Data that represents categories or labels without inherent numerical value.

10
Numerical
Data
Statistical Analysis
Statistical Analysis
"
"

Data that represents quantifiable amounts or values. Can be further classified into discrete and continuous.

11
Discrete
Data
Statistical Analysis
Statistical Analysis
"
"

Numerical data that can only take on specific, distinct values. Opposite of continuous.

12
Continuous
Data
Statistical Analysis
Statistical Analysis
"
"

Numerical data that is 'infinite' and impossible to count. Opposite of discrete.

13
Levels
of
Measurement
Statistical Analysis
Statistical Analysis
"
"

A way to classify data. There are two levels of measurement - qualitative and quantitative.

14
Qualitative
Data
Statistical Analysis
Statistical Analysis
"
"

A subgroup of levels of measurement. There are two types of qualitative data - nominal and ordinal.

15
Quantitative
Data
Statistical Analysis
Statistical Analysis
"
"

A subgroup of levels of measurement. There are two types of quantitative data - ratio and interval.

16
Nominal
Level of
Measurement
Statistical Analysis
Statistical Analysis
"
"

Nominal level of measurement refers to variables that describe different categories or names. These categories cannot be put in any specific order.

17
Ordinal
Level of
Measurement
Statistical Analysis
Statistical Analysis
"
"

Ordinal level of measurement refers to variables that describe different categories, and they can be ordered.

18
Ratio
Level of
Measurement
Statistical Analysis
Statistical Analysis
"
"

Ratio level of measurement represents a number that has a unique and unambiguous zero point, no matter if a whole number or a fraction. For example, the temperature in Kelvin is a ratio variable.

19
Interval
Level of
Measurement
Statistical Analysis
Statistical Analysis
"
"

An interval variable represents a number or an interval. There isn't a unique and unambiguous zero point. For example, degrees in Celsius and Fahrenheit are interval variables.

20
Frequency
Distribution
Table
Statistical Analysis
Statistical Analysis
"
"

A table showing the frequency of each variable.

21
Frequency
Statistical Analysis
Statistical Analysis
"
"

The number of times a particular value or category occurs in a dataset.

22
Absolute
Frequency
Statistical Analysis
Statistical Analysis
"
"

Measures the number of occurrences of a variable.

23
Relative
Frequency
Statistical Analysis
Statistical Analysis
"
"

Measures the relative number of occurrences of a variable. Usually, expressed in percentages.

24
Cumulative
Frequency
Statistical Analysis
Statistical Analysis
"
"

The sum of the relative frequencies of all members in a dataset up to a certain point. The cumulative frequency of all members is 100% or 1.

25
Pareto
Diagram
Statistical Analysis
Statistical Analysis
"
"

A type of bar chart where frequencies are shown in descending order. There is an additional line on the chart, showing the cumulative frequency.

26
Histogram
Statistical Analysis
Statistical Analysis
"
"

A type of bar chart that represents numerical data. It is divided into intervals (or bins) that are not overlapping and span from the first observation to the last. The intervals (bins) are adjacent - where one stops, the other starts.

27
Cross or
Contingency
Table
Statistical Analysis
Statistical Analysis
"
"

A table in a matrix format that displays the frequency distribution of the variables.

28
Bins
(Histogram)
Statistical Analysis
Statistical Analysis
"
"

The intervals that are represented in a histogram.

29
Scatter
Plot
Statistical Analysis
Statistical Analysis
"
"

A plot that represents numerical data. Graphically, each observation looks like a point on the scatter plot.

30
Measures of
Central
Tendency
Statistical Analysis
Statistical Analysis
"
"

The arithmetic average of all data points in a dataset.

31
Mean
Statistical Analysis
Statistical Analysis
"
"

A characteristic or attribute that can take on different values or categories. E.g. height, occupation, age etc.

32
Median
Statistical Analysis
Statistical Analysis
"
"

The middle number in a data set sorted in ascending or descending order.

33
Mode
Statistical Analysis
Statistical Analysis
"
"

The value that occurs most frequently in the dataset. A dataset can have one mode (unimodal), more than one mode (multimodal), or no mode at all.

34
Skewness
Statistical Analysis
Statistical Analysis
"
"

A measure which indicates whether the observations in a dataset are concentrated on one side.

35
Sample
Formula
Statistical Analysis
Statistical Analysis
"
"

Sample Formula

A formula that is calculated on a sample. The value obtained is a statistic.

36
Population
Formula
Statistical Analysis
Statistical Analysis
"
"

A formula that is calculated on a population. The value obtained is a parameter.

37
Measures of
Variability
Statistical Analysis
Statistical Analysis
"
"

Measures that describe the data through the level of dispersion (variability). The most common ones are variance and standard deviation.

38
Variance
Statistical Analysis
Statistical Analysis
"
"

Measures the dispersion of the dataset around its mean. It is measured in units squared. Denoted σ2 for a population and s2 for a sample.

39
Standard
Deviation
Statistical Analysis
Statistical Analysis
"
"

Measures the dispersion of the dataset around its mean. It is measured in original units. Denoted σ for a population and s for a sample.

40
Coefficient
of
Variation
Statistical Analysis
Statistical Analysis
"
"

Measures the dispersion of the dataset around its mean. The coefficient of variation is unitless. Therefore, it is useful when comparing the dispersion across different datasets that have different units of measurement.

41
Univariate
Measure
Statistical Analysis
Statistical Analysis
"
"

Univariate measure refers to the summary of a dataset that includes multiple categories of variables.

42
Multivariate
Measure
Statistical Analysis
Statistical Analysis
"
"

A measure which refers to multiple variables.

43
Covariance
Statistical Analysis
Statistical Analysis
"
"

A statistical measure that quantifies the degree to which two random variables in a dataset change together. Usually, because of its scale of measurement, covariance is not directly interpretable.

44
Linear
Correlation
Coefficient
Statistical Analysis
Statistical Analysis
"
"

A measure of of the strength and direction of a linear relationship relationship between two variables. Very useful for direct interpretation as it takes on values from [-1,1]. Denoted ρxy for a population and rxy for a sample.

45
Correlation
Statistical Analysis
Statistical Analysis
"
"

A statistical measure that describes the extent to which two variables change together. There are several ways to compute it, the most common being the linear correlation coefficient.

46
Distribution
Statistical Analysis
Statistical Analysis
"
"

A function that shows the possible values for a variable and the probability of their occurrence.

47
Bell
Curve
Statistical Analysis
Statistical Analysis
"
"

A common name for the normal distribution.

48
Normal
Distribution
Statistical Analysis
Statistical Analysis
"
"

A continuous, symmetric probability distribution that is completely described by its mean and its variance. Also known as the Gaussian distribution or bell curve.

49
Gaussian
Distribution
Statistical Analysis
Statistical Analysis
"
"

The original name of the normal distribution. Named after the famous mathematician Gauss, who was the first to explore it through his work on the Gaussian function.

50
Standard
Normal
Distribution
Statistical Analysis
Statistical Analysis
"
"

A normal distribution with a mean of 0, and a standard deviation of 1

51
z-statistic
Statistical Analysis
Statistical Analysis
"
"

The cumulative frequency of a data value in a frequency distribution.

52
Standardized
Variable
Statistical Analysis
Statistical Analysis
"
"

A variable which has been standardized using the z-score formula - by first subtracting the mean and then dividing by the standard deviation.

53
What does
the Central
Limit
Theorem
state?
Statistical Analysis
Statistical Analysis
"
"

The sampling distribution will approximate a normal distribution as the sample size increases. In general, a sample of at least 30 is often considered sufficient for the theorem to hold.

54
Sampling
Distribution
Statistical Analysis
Statistical Analysis
"
"

The probability distribution of a given statistic (like the mean or variance) based on all possible samples of a fixed size from a population.

55
Standard
Error
Statistical Analysis
Statistical Analysis
"
"

The standard deviation of the sampling distribution, which reflects the variability of sample means. It accounts for the sample size, with larger samples generally having smaller standard errors.

56
Estimator
Statistical Analysis
Statistical Analysis
"
"

Estimations we make according to a function or rule.

57
Estimate
Statistical Analysis
Statistical Analysis
"
"

The particular value that was estimated through an estimator.

58
Bias
Statistical Analysis
Statistical Analysis
"
"

The difference between an estimator's expected value and the true population parameter.

59
Efficiency
(Estimators)
Statistical Analysis
Statistical Analysis
"
"

Refers to an estimator's variability. An efficient estimator has minimal variability compared to others.

60
Point
Estimator
Statistical Analysis
Statistical Analysis
"
"

A function or a rule, according to which we make estimations that will result in a single number.

61
Point
Estimate
Statistical Analysis
Statistical Analysis
"
"

The specific numerical value obtained from a point estimator.

62
Interval
Estimator
Statistical Analysis
Statistical Analysis
"
"

A function or a rule, according to which we make estimations that will result in an interval.

63
Interval
Estimate
Statistical Analysis
Statistical Analysis
"
"

The categorization of data into discrete groups based on their attributes.

64
Confidence
Interval
Statistical Analysis
Statistical Analysis
"
"

A confidence interval is the range within which you expect the population parameter to be. You have a certain probability of it being correct, equal to the significance level.

65
Reliability
Factor
Statistical Analysis
Statistical Analysis
"
"

A singular metric that captures the entire variance of a dataset.

66
Level
of
Confidence
Statistical Analysis
Statistical Analysis
"
"

The probability that the population parameter lies within a given confidence interval. Denoted 1 - α.

67
Critical
Value
Statistical Analysis
Statistical Analysis
"
"

A threshold value from a statistical table (z, t, F, etc.) associated with a chosen significance level.

68
z-table
Statistical Analysis
Statistical Analysis
"
"

A table showing values of the Z-statistic for various probabilities under the standard normal distribution.

69
t-statistic
Statistical Analysis
Statistical Analysis
"
"

A statistic that is generally associated with the Student's T distribution, in the same way the z-statistic is associated with the normal distribution.

70
t-table
Statistical Analysis
Statistical Analysis
"
"

A table showing t-statistic values for given probabilities and degrees of freedom.

71
Degrees
of
Freedom
Statistical Analysis
Statistical Analysis
"
"

The number of values in a statistical calculation that are free to vary without violating the data's constraints.

72
Margin
of
Error
Statistical Analysis
Statistical Analysis
"
"

The range within which the true population parameter is likely to lie, given a specific confidence level. Often expressed as a percentage of the estimate itself.

73
Hypothesis
Statistical Analysis
Statistical Analysis
"
"

A testable proposition or assumption about a population parameter.

74
Hypothesis
Test
Statistical Analysis
Statistical Analysis
"
"

A test that is conducted in order to verify if a hypothesis is true or false.

75
Null
Hypothesis
Statistical Analysis
Statistical Analysis
"
"

A default hypothesis for testing. Whenever we are conducting a test, we are trying to reject the null hypothesis.

76
Alternative
Hypothesis
Statistical Analysis
Statistical Analysis
"
"

The hypothesis that contradicts the null hypothesis. It represents the researcher's claim.

77
To Accept a
Hypothesis
Statistical Analysis
Statistical Analysis
"
"

The statistical evidence shows that the hypothesis is likely to be true.

78
To Reject a Hypothesis
Statistical Analysis
Statistical Analysis
"
"

The statistical evidence shows that the hypothesis is likely to be false.

79
One-Tailed
(One-Sided)
Test
Statistical Analysis
Statistical Analysis
"
"

A test that examines if a parameter is greater than or less than a specified value. In a one-tailed test, the alternative hypothesis focuses on a specific difference (higher than, lower than, or equal to).

80
Two-Tailed
(Two-Sided)
Test
Statistical Analysis
Statistical Analysis
"
"

A test that examines if a value is different (or equal) from a specified value. A two-tailed test considers the possibility of a difference in either direction from the null hypothesis.

81
Significance
Level
Statistical Analysis
Statistical Analysis
"
"

The probability of rejecting the null hypothesis when it's true. Denoted α. You choose the significance level. All else equal, the lower the level, the better the test.

82
Rejection
Region
Statistical Analysis
Statistical Analysis
"
"

The part of the distribution, for which we would reject the null hypothesis.

83
Type I Error
(False Positive)
Statistical Analysis
Statistical Analysis
"
"

Rejecting a null hypothesis that is true. The probability of committing it is α, the significance level.

84
Type II Error
(False Negative)
Statistical Analysis
Statistical Analysis
"
"

Accepting a null hypothesis that is false. The probability of committing it is β.

85
Power
of the Test
Statistical Analysis
Statistical Analysis
"
"

The probability of correctly rejecting a false null hypothesis. (the researcher's goal). Denoted by 1- β.

86
z-score
Statistical Analysis
Statistical Analysis
"
"

A value indicating how many standard deviations an element is from the mean.

87
μ0
Statistical Analysis
Statistical Analysis
"
"

The most frequent value occurring in a population dataset.

88
p-value
Statistical Analysis
Statistical Analysis
"
"

The smallest significance level at which the null hypothesis can be rejected based on the observed data.