A sampling distribution arises when repeated samples of the same size are drawn from a particular population (distribution) and a statistic (numerical measure of description of sample data, e.g. a mean, variance or proportion) is calculated for each sample
Sampling distributions arise in the context of statistical inference i.e. when statements are made about a population on the basis of random samples drawn from it
If X1, X2, ..., Xn are a random sample of size n drawn from a population (with any distribution) with a population mean μ and variance σ^2, then for a sufficiently large n, the mean of the sample (X-bar) will be approximately normally distributed with a mean μ and a variance σ^2/n
The size of n depends on the distribution of the population: for a normal distribution, the CLT holds for any value of n; for an 'almost' normal distribution, n should be larger than 30; if the distribution is substantially different from normal, a much larger value of n will be needed for the CLT to hold
The basis of many statistical inference methods (hypothesis tests, confidence intervals, statistical models) is formed from the normal distribution, hence such methods require normality (an assumption is that the underlying population is normal)
When the assumption of normality is not met, these methods will not be accurate, and other methods such as non-parametric methods or machine learning methods that do not require normality can be considered
An interval estimate is more appropriate and useful than a point estimate, since a point estimate can differ each time depending on the sample obtained