Skip to main content
← Back to I Definitions

In the middle

What Is Mean?

The mean, often referred to as the arithmetic mean, is a fundamental concept in statistics and finance that represents the central tendency of a dataset. It is calculated by summing all the values in a set and dividing by the total number of values. This measure provides a single value that summarizes the entire distribution of data points, making it a key tool in quantitative analysis and portfolio theory.

History and Origin

The basic concept of the arithmetic mean can be traced back to Babylonian astronomers around 2000 BCE, who used simple averaging for astronomical predictions. Ancient Egyptians also employed weighted averaging in trade calculations20. The Greek mathematician Pythagoras and his followers, around 500 BC, formalized the concept of three mean values: arithmetic, geometric, and harmonic, often in the context of musical theory and geometry18, 19. However, the modern systematic calculation approach using a decimal system was developed by Al-Khwarizmi between 780-850 CE17.

In the modern era, the arithmetic mean gained prominence in the 17th century as a method for combining multiple observations that should ideally be identical but varied due to measurement errors. For instance, it was used to estimate the direction of magnetic north16. The statistician Churchill Eisenhart extensively traced its history, noting that by 1668, the practice of "taking the mean" was sufficiently established to be mentioned casually in the Transactions of the Royal Society15. Later, mathematicians like Gauss and Legendre established the modern mathematical foundation for the mean through methods such as least squares14. In finance, the mean became a cornerstone with the development of modern portfolio theory (MPT) by Harry Markowitz in his 1952 paper, "Portfolio Selection." Markowitz's work, for which he later received a Nobel Memorial Prize in Economic Sciences, established mean-variance analysis as a framework for optimal investment decisions12, 13.

Key Takeaways

  • The mean is calculated by summing all values in a dataset and dividing by the count of values.
  • It provides a single numerical representation of the central tendency of a dataset.
  • The concept of mean has ancient origins, with systematic calculation methods emerging in the Islamic Golden Age.
  • In finance, the mean is a core component of modern portfolio theory, used to represent expected return.
  • While widely used, the mean can be significantly affected by outliers and skewed distributions.

Formula and Calculation

The formula for the arithmetic mean ((\bar{x})) of a set of (n) observations (x_1, x_2, \ldots, x_n) is:

xˉ=i=1nxin\bar{x} = \frac{\sum_{i=1}^{n} x_i}{n}

Where:

  • (\sum x_i) represents the sum of all values in the dataset.
  • (n) represents the total number of values in the dataset.

For example, to calculate the mean of investment returns over several periods, one would sum the individual period returns and divide by the number of periods. This helps in assessing the average return of a portfolio or asset.

Interpreting the Mean

The mean serves as a measure of the typical or average value within a dataset. In finance, when discussing the mean, it often refers to the expected return of an investment or portfolio. A higher mean typically indicates a better average performance. However, interpreting the mean requires context. For instance, a high mean return on an investment might be accompanied by high volatility, suggesting greater risk.

It is crucial to consider the distribution of the data when interpreting the mean. For symmetrical distributions, the mean, median, and mode tend to be similar, offering a clear picture of central tendency. However, in skewed distributions, the mean can be pulled towards the longer tail, potentially misrepresenting the typical value. For example, if a dataset contains a few extremely large values (positive outliers), the mean will be higher than most other values, making the average appear inflated10, 11.

Hypothetical Example

Consider an investment portfolio with the following annual returns over five years:

  • Year 1: 10%
  • Year 2: 5%
  • Year 3: 15%
  • Year 4: -2% (a loss)
  • Year 5: 8%

To calculate the mean annual return:

Sum of returns = (10% + 5% + 15% + (-2%) + 8% = 36%)
Number of years = (5)

Mean annual return = (\frac{36%}{5} = 7.2%)

In this hypothetical example, the mean annual return of the portfolio is 7.2%. This figure gives an investor a quick way to understand the historical average performance of their investment portfolio.

Practical Applications

The mean is extensively used across various financial domains:

  • Investment Analysis: Investors and analysts use the mean to calculate the average historical returns of stocks, bonds, mutual funds, or other asset classes. This helps in evaluating past performance and setting expectations for future returns. For example, Reuters often reports the average weekly inflows into global equity funds8, 9.
  • Risk Management: In the context of Modern Portfolio Theory, the mean (expected return) is combined with measures of risk, such as standard deviation, to construct efficient portfolios that offer the highest expected return for a given level of risk.
  • Economic Indicators: Many economic statistics, such as per capita income or average household income, rely on the mean. Government agencies like the Internal Revenue Service (IRS) publish average tax refund amounts, which utilize the mean to provide a general understanding of tax season outcomes5, 6, 7.
  • Financial Modeling: The mean is a critical input in various financial models, including those used for valuation, budgeting, and forecasting. For instance, in discounted cash flow (DCF) analysis, the mean of projected cash flows might be used.
  • Performance Benchmarking: Financial professionals often compare the mean return of a particular investment or fund against a relevant market index or benchmark to assess its relative performance.

Limitations and Criticisms

Despite its widespread use, the mean has several limitations, particularly in financial analysis, where data can often be skewed or contain outliers.

  • Sensitivity to Outliers: The mean is highly sensitive to outliers – extreme values that are significantly different from the rest of the data. A single exceptionally high or low value can disproportionately pull the mean in its direction, making it an unrepresentative measure of central tendency. 3, 4For example, a few very large incomes in a dataset can significantly inflate the average income, making it seem higher than what most people earn.
  • Skewed Distributions: In financial data, returns and other metrics often exhibit skewness, meaning their distribution is not symmetrical. In such cases, the mean may not accurately reflect the typical value. For instance, a distribution of investment returns might have a long tail of negative returns, pulling the mean lower than what an investor might typically experience. When data is skewed, other measures of central tendency, such as the median, might provide a more accurate representation.
    2* Lack of Context for Variability: The mean alone does not provide any information about the spread or variability of the data. Two datasets can have the same mean but vastly different levels of dispersion, indicating different levels of risk or consistency. This is why the mean is often paired with measures like standard deviation to offer a more complete picture of data characteristics. 1For example, an investment with a high average return but also high variability might be considered riskier than one with a slightly lower average return but much less variability.
  • Misleading for Qualitative Data: While primarily used for quantitative data, applying the mean to data that isn't truly quantitative (e.g., categorical ratings) can lead to meaningless results.

Understanding these limitations is crucial for a comprehensive data analysis and helps in choosing the most appropriate statistical measure.

Mean vs. Median

The mean and the median are both measures of central tendency, but they calculate this "center" differently, making each more suitable for different types of datasets, especially in financial contexts.

The mean (arithmetic average) is calculated by summing all data points and dividing by the total number of points. It considers every value in the dataset, which makes it sensitive to extreme values or outliers. In finance, the mean is frequently used for calculating the average return of an asset or portfolio.

The median, on the other hand, is the middle value in a dataset when the data points are arranged in ascending or descending order. If there is an even number of observations, the median is the average of the two middle values. The median is less affected by outliers and skewed distributions, making it a more robust measure of central tendency for datasets that contain extreme values, such as income distributions or housing prices.

The key difference lies in their sensitivity to extreme values. For example, if you are looking at individual income levels in a country, a few extremely wealthy individuals can significantly inflate the mean income, making it appear much higher than what the majority of the population earns. In this scenario, the median income would likely be a more accurate representation of the "typical" income, as it is not pulled by these high income disparities. Both measures provide valuable insights, but choosing between the mean and the median depends on the specific characteristics of the data and the objective of the analysis.

FAQs

What is the primary purpose of calculating the mean?

The primary purpose of calculating the mean is to determine the average or central value of a set of numerical data. It provides a single number that summarizes the entire dataset, making it easier to compare different datasets or track changes over time.

When is the mean a less reliable measure?

The mean is less reliable when the dataset contains outliers (extreme values) or when the data distribution is highly skewed. In such cases, the mean can be pulled away from the true center of the data, providing a misleading representation.

Can the mean be negative?

Yes, the mean can be negative if the sum of the values in the dataset is negative. This is common in finance when calculating the average of losses or negative returns on an investment. For instance, the average loss on a series of losing trades would be a negative mean.

How does the mean relate to risk in investing?

In investing, the mean is often used to represent the expected return of an investment. However, the mean alone doesn't tell the whole story about risk. It's typically paired with measures of dispersion, such as standard deviation, to understand the potential variability or risk associated with that expected return. A higher mean with lower standard deviation generally indicates a more favorable risk-return profile.

Is the mean the same as the average?

In common usage, "mean" and "average" are often used interchangeably to refer to the arithmetic mean. However, in statistics, "average" is a broader term that can also encompass other measures of central tendency, such as the median and mode. When discussing statistical concepts, "arithmetic mean" is preferred for precision.

LINK_POOL

Anchor TextInternal Link Slug
outliersoutlier
expected returnexpected-return
volatilityvolatility
investment portfolioinvestment-portfolio
asset classesasset-classes
standard deviationstandard-deviation
market indexmarket-index
data analysisdata-analysis
riskrisk
assetasset
portfolioportfolio
income disparitiesincome-disparities
medianmedian
average lossaverage-loss
dispersiondispersion
skewed distributionsskewed-distribution
skewnessskewness
average returnaverage-return