• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
Analytic Strategy Partners

Analytic Strategy Partners

Improve your analytic operations and refine your analytic strategy

  • Home
  • Blog
  • Books
  • About
  • Services
  • Contact Us

economics of inferencing

The Economics of Inferencing: Managing the Long Tail of Data

November 15, 2021 by Robert Grossman

The different steps in the inferencing process and the costs involved.

When developing an analytic strategy, it is important to have a good understanding of the economics of inferencing.

Our conceptional model is that we: 1) collect, purchase or otherwise acquire the data, 2) clean, curate and process the data so that it can be used as the inputs to models and to train models, 3) build the model; 4) deploy the model, and 5) operate the model, using the data as model inputs to to produce outputs of interest. See Figure 1.

To simplify the analysis, we will call the outputs of the model “inferences,” and assign a monetary value to each inference.

At the highest level, the economics of inferencing is driven by two equations: The first equation is the value provided by the inferences. This is the simplest equation and since inferences usually have a fixed value or a value determined by the subsequent actions (such as the value of the sale or value of the sale and subsequent sales over the next month). The value is simply the number of inferences x the value of each inference. This the right side of Figure 1.

The second equation is the cost of the inference. This is the sum of five components. See Table 1 and Figure 1. There is the cost to acquire the data, to clean and process the data, build the model, to deploy the model and to operate the model.

The challenge is that the cost to clean and process the data can have a long tail. In other words, to clean 50% of the data is often fairly easy, to clean 75% much hard, to clean 90% much, much harder, and, so, with the costs to clean the last few percent getting higher and higher.

There is an important difference between how for profit-companies offering a product or service and not-for-profit companies that are mission driven can manage the escalating cost associated with the long tail of data curation. For-profit companies can cut the tail at a sensible operating point, while not-for-profit or mission-driven organizations often have to process more of the tail in oder to satisfy their mission. Here are some example of why the long tail of data curation is often important for mission driven organizations:

  • When cleaning up scientific data for machine learning a high model coverage threshold is important, since an important discovery may be in the long tail
  • When identifying threats in national defense, a high model coverage threshold is important since missing just one threat can have devastating consequences
  • When identifying fires and floods, covering the long tail is critical, since early detection of a fire or flood can result in significant savings of property and sometimes savings in lives.

costs of inferencing = 
   cost to acquire the data                    large & variable
   + cost to clean & curate the data           large & variable
   + cost to build the model                   fixed cost
   + cost to deploy the model                  fixed cost
   + cost to operate the model                 variable, but modest
Table 1. Costs of inferencing

For more about the costs of acquiring and curating data, see Chapter 6 of Developing an AI Strategy: a Primer.

Filed Under: Uncategorized Tagged With: economics of AI, economics of inferencing, economics of machine learning, long tail of data, long tail of data curation

Primary Sidebar

Recent Posts

  • Developing an AI Strategy: Four Points of View
  • Ten Books to Motivate and Jump-Start Your AI Strategy
  • A Rubric for Evaluating New Projects that Produce Data
  • How Does No-Code Impact Your Analytic Strategy?
  • The Different Varieties of Advisors & the Difference it Makes

Recent Comments

    Archives

    • May 2022
    • April 2022
    • March 2022
    • February 2022
    • January 2022
    • December 2021
    • November 2021
    • October 2021
    • September 2021
    • August 2021
    • July 2021
    • June 2021
    • May 2021
    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • March 2020
    • February 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • June 2019
    • May 2019
    • September 2018

    Categories

    • Uncategorized

    Meta

    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org

    Copyright © 2025