تحلیل‌های داده‌محور و بینش‌های عملیاتی | مقالات تخصصی در حوزه تحلیل داده

Ratio level of measurement represents a number that has a unique and unambiguous zero point, no matter if a whole number or a fraction. For example, the temperature in Kelvin is a ratio variable.

Enroll Now

Let's Go!

19

Interval
Level of
Measurement

Statistical Analysis

"

An interval variable represents a number or an interval. There isn't a unique and unambiguous zero point. For example, degrees in Celsius and Fahrenheit are interval variables.

Enroll Now

Let's Go!

20

Frequency
Distribution
Table

Statistical Analysis

"

A table showing the frequency of each variable.

Enroll Now

Let's Go!

21

Frequency

Statistical Analysis

"

The number of times a particular value or category occurs in a dataset.

Enroll Now

Let's Go!

22

Absolute
Frequency

Statistical Analysis

"

Measures the number of occurrences of a variable.

Enroll Now

Let's Go!

23

Relative
Frequency

Statistical Analysis

"

Measures the relative number of occurrences of a variable. Usually, expressed in percentages.

Enroll Now

Let's Go!

24

Cumulative
Frequency

Statistical Analysis

"

The sum of the relative frequencies of all members in a dataset up to a certain point. The cumulative frequency of all members is 100% or 1.

Enroll Now

Let's Go!

25

Pareto
Diagram

Statistical Analysis

"

A type of bar chart where frequencies are shown in descending order. There is an additional line on the chart, showing the cumulative frequency.

Enroll Now

Let's Go!

26

Histogram

Statistical Analysis

"

A type of bar chart that represents numerical data. It is divided into intervals (or bins) that are not overlapping and span from the first observation to the last. The intervals (bins) are adjacent - where one stops, the other starts.

Enroll Now

Let's Go!

27

Cross or
Contingency
Table

Statistical Analysis

"

A table in a matrix format that displays the frequency distribution of the variables.

Enroll Now

Let's Go!

28

Bins
(Histogram)

Statistical Analysis

"

The intervals that are represented in a histogram.

Enroll Now

Let's Go!

29

Scatter
Plot

Statistical Analysis

"

A plot that represents numerical data. Graphically, each observation looks like a point on the scatter plot.

Enroll Now

Let's Go!

30

Measures of
Central
Tendency

Statistical Analysis

"

The arithmetic average of all data points in a dataset.

Enroll Now

Let's Go!

31

Mean

Statistical Analysis

"

A characteristic or attribute that can take on different values or categories. E.g. height, occupation, age etc.

Enroll Now

Let's Go!

32

Median

Statistical Analysis

"

The middle number in a data set sorted in ascending or descending order.

Enroll Now

Let's Go!

33

Mode

Statistical Analysis

"

The value that occurs most frequently in the dataset. A dataset can have one mode (unimodal), more than one mode (multimodal), or no mode at all.

Enroll Now

Let's Go!

34

Skewness

Statistical Analysis

"

A measure which indicates whether the observations in a dataset are concentrated on one side.

Enroll Now

Let's Go!

35

Sample
Formula

Statistical Analysis

"

Sample Formula

A formula that is calculated on a sample. The value obtained is a statistic.

Enroll Now

Let's Go!

36

Population
Formula

Statistical Analysis

"

A formula that is calculated on a population. The value obtained is a parameter.

Enroll Now

Let's Go!

37

Measures of
Variability

Statistical Analysis

"

Measures that describe the data through the level of dispersion (variability). The most common ones are variance and standard deviation.

Enroll Now

Let's Go!

38

Variance

Statistical Analysis

"

Measures the dispersion of the dataset around its mean. It is measured in units squared. Denoted σ2 for a population and s2 for a sample.

Enroll Now

Let's Go!

39

Standard
Deviation

Statistical Analysis

"

Measures the dispersion of the dataset around its mean. It is measured in original units. Denoted σ for a population and s for a sample.

Enroll Now

Let's Go!

40

Coefficient
of
Variation

Statistical Analysis

"

Measures the dispersion of the dataset around its mean. The coefficient of variation is unitless. Therefore, it is useful when comparing the dispersion across different datasets that have different units of measurement.

Enroll Now

Let's Go!

41

Univariate
Measure

Statistical Analysis

"

Univariate measure refers to the summary of a dataset that includes multiple categories of variables.

Enroll Now

Let's Go!

42

Multivariate
Measure

Statistical Analysis

"

A measure which refers to multiple variables.

Enroll Now

Let's Go!

43

Covariance

Statistical Analysis

"

A statistical measure that quantifies the degree to which two random variables in a dataset change together. Usually, because of its scale of measurement, covariance is not directly interpretable.

Enroll Now

Let's Go!

44

Linear
Correlation
Coefficient

Statistical Analysis

"

A measure of of the strength and direction of a linear relationship relationship between two variables. Very useful for direct interpretation as it takes on values from [-1,1]. Denoted ρxy for a population and rxy for a sample.

Enroll Now

Let's Go!

45

Correlation

Statistical Analysis

"

A statistical measure that describes the extent to which two variables change together. There are several ways to compute it, the most common being the linear correlation coefficient.

Enroll Now

Let's Go!

46

Distribution

Statistical Analysis

"

A function that shows the possible values for a variable and the probability of their occurrence.

Enroll Now

Let's Go!

47

Bell
Curve

Statistical Analysis

"

A common name for the normal distribution.

Enroll Now

Let's Go!

48

Normal
Distribution

Statistical Analysis

"

A continuous, symmetric probability distribution that is completely described by its mean and its variance. Also known as the Gaussian distribution or bell curve.

Enroll Now

Let's Go!

49

Gaussian
Distribution

Statistical Analysis

"

The original name of the normal distribution. Named after the famous mathematician Gauss, who was the first to explore it through his work on the Gaussian function.

Enroll Now

Let's Go!

50

Standard
Normal
Distribution

Statistical Analysis

"

A normal distribution with a mean of 0, and a standard deviation of 1

Enroll Now

Let's Go!

51

z-statistic

Statistical Analysis

"

The cumulative frequency of a data value in a frequency distribution.

Enroll Now

Let's Go!

52

Standardized
Variable

Statistical Analysis

"

A variable which has been standardized using the z-score formula - by first subtracting the mean and then dividing by the standard deviation.

Enroll Now

Let's Go!

53

What does
the Central
Limit
Theorem
state?

Statistical Analysis

"

The sampling distribution will approximate a normal distribution as the sample size increases. In general, a sample of at least 30 is often considered sufficient for the theorem to hold.

Enroll Now

Let's Go!

54

Sampling
Distribution

Statistical Analysis

"

The probability distribution of a given statistic (like the mean or variance) based on all possible samples of a fixed size from a population.

Enroll Now

Let's Go!

55

Standard
Error

Statistical Analysis

"

The standard deviation of the sampling distribution, which reflects the variability of sample means. It accounts for the sample size, with larger samples generally having smaller standard errors.

Enroll Now

Let's Go!

56

Estimator

Statistical Analysis

"

Estimations we make according to a function or rule.

Enroll Now

Let's Go!

57

Estimate

Statistical Analysis

"

The particular value that was estimated through an estimator.

Enroll Now

Let's Go!

58

Bias

Statistical Analysis

"

The difference between an estimator's expected value and the true population parameter.

Enroll Now

Let's Go!

59

Efficiency
(Estimators)

Statistical Analysis

"

Refers to an estimator's variability. An efficient estimator has minimal variability compared to others.

Enroll Now

Let's Go!

60

Point
Estimator

Statistical Analysis

"

A function or a rule, according to which we make estimations that will result in a single number.

Enroll Now

Let's Go!

61

Point
Estimate

Statistical Analysis

"

The specific numerical value obtained from a point estimator.

Enroll Now

Let's Go!

62

Interval
Estimator

Statistical Analysis

"

A function or a rule, according to which we make estimations that will result in an interval.

Enroll Now

Let's Go!

63

Interval
Estimate

Statistical Analysis

"

The categorization of data into discrete groups based on their attributes.

Enroll Now

Let's Go!

64

Confidence
Interval

Statistical Analysis

"

A confidence interval is the range within which you expect the population parameter to be. You have a certain probability of it being correct, equal to the significance level.

Enroll Now

Let's Go!

65

Reliability
Factor

Statistical Analysis

"

A singular metric that captures the entire variance of a dataset.

Enroll Now

Let's Go!

66

Level
of
Confidence

Statistical Analysis

"

The probability that the population parameter lies within a given confidence interval. Denoted 1 - α.

Enroll Now

Let's Go!

67

Critical
Value

Statistical Analysis

"

A threshold value from a statistical table (z, t, F, etc.) associated with a chosen significance level.

Enroll Now

Let's Go!

68

z-table

Statistical Analysis

"

A table showing values of the Z-statistic for various probabilities under the standard normal distribution.

Enroll Now

Let's Go!

69

t-statistic

Statistical Analysis

"

A statistic that is generally associated with the Student's T distribution, in the same way the z-statistic is associated with the normal distribution.

Enroll Now

Let's Go!

70

t-table

Statistical Analysis

"

A table showing t-statistic values for given probabilities and degrees of freedom.

Enroll Now

Let's Go!

71

Degrees
of
Freedom

Statistical Analysis

"

The number of values in a statistical calculation that are free to vary without violating the data's constraints.

Enroll Now

Let's Go!

72

Margin
of
Error

Statistical Analysis

"

The range within which the true population parameter is likely to lie, given a specific confidence level. Often expressed as a percentage of the estimate itself.

Enroll Now

Let's Go!

73

Hypothesis

Statistical Analysis

"

A testable proposition or assumption about a population parameter.

Enroll Now

Let's Go!

74

Hypothesis
Test

Statistical Analysis

"

A test that is conducted in order to verify if a hypothesis is true or false.

Enroll Now

Let's Go!

75

Null
Hypothesis

Statistical Analysis

"

A default hypothesis for testing. Whenever we are conducting a test, we are trying to reject the null hypothesis.

Enroll Now

Let's Go!

76

Alternative
Hypothesis

Statistical Analysis

"

The hypothesis that contradicts the null hypothesis. It represents the researcher's claim.

Enroll Now

Let's Go!

77

To Accept a
Hypothesis

Statistical Analysis

"

The statistical evidence shows that the hypothesis is likely to be true.

Enroll Now

Let's Go!

78

To Reject a Hypothesis

Statistical Analysis

"

The statistical evidence shows that the hypothesis is likely to be false.

Enroll Now

Let's Go!

79

One-Tailed
(One-Sided)
Test

Statistical Analysis

"

A test that examines if a parameter is greater than or less than a specified value. In a one-tailed test, the alternative hypothesis focuses on a specific difference (higher than, lower than, or equal to).

Enroll Now

Let's Go!

80

Two-Tailed
(Two-Sided)
Test

Statistical Analysis

"

A test that examines if a value is different (or equal) from a specified value. A two-tailed test considers the possibility of a difference in either direction from the null hypothesis.

Enroll Now

Let's Go!

81

Significance
Level

Statistical Analysis

"

The probability of rejecting the null hypothesis when it's true. Denoted α. You choose the significance level. All else equal, the lower the level, the better the test.

Enroll Now

Let's Go!

82

Rejection
Region

Statistical Analysis

"

The part of the distribution, for which we would reject the null hypothesis.

Enroll Now

Let's Go!

83

Type I Error
(False Positive)

Statistical Analysis