• Twitter
  • LinkedIn
(503) 212-0687 | hello@cascadeinsights.com
Cascade Insights
  • Market Research Services
    • B2B Customer Experience Research
      • Buyer Personas
      • Buyer’s Journey Research
      • Key Buying Criteria Research
      • Jobs-To-Be-Done Research
      • User Personas
      • Customer Satisfaction Research
    • B2B Product/Service Research
      • Market Opportunity Research
      • Concept Testing
      • Go-To-Market Research
    • B2B Marketing Enablement Research
      • Data-Driven Marketing Research
      • Message Testing
      • Brand Research
      • Thought Leadership Services
      • Partner Enablement
    • B2B Sales Enablement Research
      • Competitive Landscape Analysis
      • Win-Loss Research
      • Churn Analysis
      • Channel Research
  • Marketing Services
    • Marketing Strategy
    • Messaging
    • Content Marketing
    • Sales Enablement
  • Insights and Perspectives
    • B2B Market Research Blog
    • B2B Marketing Blog
    • B2B Resources
  • About Us
    • Our Story
    • Our Clients
    • Client Testimonials
    • Careers
    • Ethics Policy
    • Privacy Policy
  • Contact Us
  • Search
  • Menu Menu
Big Data Ethics: Math Responsibly

Big Data Ethics: Math Responsibly

March 22, 2017/in B2B Market Research Blog, Blog Posts /by Sean Campbell
Share this entry
  • Share on Facebook
  • Share on Twitter
  • Share on LinkedIn
  • Share by Mail

Big data can be used to create a powerful supporting argument for nearly anything. Wielded foolishly or maliciously, the potential for harm is great. It has never been more important to use math responsibly.

That’s the core premise of “Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy” by Cathy O’Neil. The book demonstrates the importance of looking critically at big data analytics. Especially now that analytics increasingly sway the decisions made by powerful institutions such as business, technology, and government. We highly recommend this book for anyone who leverages big data and/or works in tech.

This article is based off a B2B Revealed episode.

You can listen to the episode or read the article below.

http://traffic.libsyn.com/competitiveintel/Episode143-BigDataEthicsMathWisely.mp3

B2B Revealed – ON: iTunes, Stitcher, and Google Play

Business Loves Big Data

We have access to more data than we’ve ever had before. Companies of all types have embraced big data. Retail, healthcare, education, political campaigns… and tech.

MarTech

Consider the huge amount of data in an average marketing technology stack.  You can:

  • Track clicks on your site.
  • See which site visitors were on before coming to yours.
  • Determine how often visitors came to your site before clicking “add to cart” or filling out a Contact Us Form.
  • View the content visitors download from your site.
  • Gather exponential amounts of data on social media, content analytics, brand engagement, etc.

Product Teams

Think of the amount of data that cloud services product teams have to consider. A typical SaaS product manager will look at:

  • Sign-in frequency.
  • The percentage of users that take a certain action within the product.
  • How often a customer persona uses certain features.
  • Retention stats on customers.
  • Churn metrics.
  • A customer’s lifetime value.
  • And much, much more.

Market Research

By the nature of the job, market researchers also add to the piles of data company leaders have to consider.  Every in-depth interview, focus group, survey, and usability study adds to the pile.

Things To Consider

You probably have a sense of the breadth of data your company collects and relies on to do business.

It’s less likely that you’ve deeply considered the validity of your data.

Are you confident that your company uses big data ethically? How about sensibly?

Let’s explore.

Bad Data

O’Neil argues that there is a dark side to big data collection. To be clear, O’Neil is not advocating stopping all big data efforts. Instead, she asks us to acknowledge that it comes with both good and bad side effects.

O’Neil is more than qualified to comment on such a complex subject. She has a Ph.D. in math from Harvard. O’Neil has been a Barnard College professor, a quant in the financial industry, a data scientist, and other impressive titles. In addition to authoring 2016 National Book Award nominee “Weapons of Math Destruction,” O’Neil wrote “On Being a Data Skeptic” and co-authored “Doing Data Science: Straight Talk from the Frontlines.”

She gives numerous “weapons of math destruction” examples. The stories cross all different types of businesses, from social media to education, travel to finance, and so on. She also tackles how weapons of math destruction are affecting the political process.

Here are a few of the “weapons of math destruction” that she describes.

Proxy Problems

O’Neil explains how insurance agencies use big data analytics to generate price ranges that vary dramatically from one consumer to another. In many cases, prices are based on proxy data like the neighborhood the person drives through or their credit score. Shockingly, proxy data sometimes holds more sway than the individual’s driving record.

This is quite harmful. Consumers are not graded on merit (driving behavior). Further, consumers may have little to no personal control over the proxy data they’re being graded on.

Insurance companies also leverage big data to develop byzantine pricing structures. Such structures are far too complex for most individual consumers to understand.

As O’Neil puts it in the book, “These pricing tiers are based on how much each group can be expected to pay. Consequently, some receive discounts of up to 90 percent off the average rate, while others face an increase of 800.”

This proxy data may have no real impact on the number of insurance claims filed over a driver’s lifetime. This results in a system that is less than fair.

Narrow Newsfeed

Elsewhere, O’Neil covers how Facebook modifies users’ newsfeeds. Unfortunately, this filtering often culminates in dramatically polarized results. This issue has captured the public’s attention with the proliferation of fake news and various platforms filtering social media feeds.

Rankings Run Amok

Perhaps the most compelling example in the book focuses on college rankings.

Since 1983, students and parents have relied on US News & World Report’s college rankings when choosing colleges. The report was designed to evaluate 1,800 colleges and universities throughout the US and rank them for excellence.

However, there was a problem early in the process.

“They had no direct way to quantify how a four-year process affected one single student, much less tens of millions of them,” O’Neil writes.

Sadly, the US News & World Report didn’t have data on the success rates of college students once they graduated. Surprisingly, they didn’t even know how to define success. Would it mean that they made X amount of money over their lifetime, or that they had a good credit rating, or that they somehow contributed to society?

Since they didn’t have more relevant data, US News & World Report relied on proxy data.

As O’Neil puts it, “They looked at SAT Scores, Student-Teacher ratios, and acceptance rates. They analyzed the percentage of incoming freshman who made it to sophomore year and the percentage of those who graduated. They calculated the percentage of living alumni who contributed money to their alma mater, surmising that if they gave a college money, there was a good chance they appreciated the education there. Three-quarters of the ranking would be produced by an algorithm – an opinion formulized in code – that incorporated these proxies. In the other quarter, they would factor in the subjective views of college officials throughout the country.”

Cheating By Proxy

Colleges have been racing to game the rankings ever since.  O’Neil makes this point quite clearly by examining the actions of Baylor, Texas Christian University, Iona College, and other institutions. (Read the book for these scandalous stories.)

For instance, a reviewer of the US News & World Report found that a Saudi university had contacted a host of highly regarded mathematicians and offered them $72,000 to serve as adjunct faculty.

O’Neil writes that the deal “…stipulated that the mathematicians had to work three weeks a year in Saudi Arabia. The university would fly them there in business class and put them up at a five-star hotel. Conceivably, their work in Saudi Arabia added value locally. But the university also required them to change their affiliation on the Thomson Reuters academic citation website, a key reference for the U.S. News rankings. That meant the Saudi university could claim the publications of their new adjunct faculty as its own.”

The Data That’s Missing

Importantly, O’Neil points out that the US News & World Report didn’t include data on tuition and fees in their ranking criteria. The absence of this meaningful data had a huge impact. Rankings didn’t suffer if colleges posed a huge and disproportionate cost to students.

Between 1985 and 2013, college tuition has skyrocketed 500 percent. While some of this can be attributed to other factors, the US News & World Report rankings are at least partly to blame.

Consequently, many students based their college selection on rankings that did not include any criteria for financial success. These rankings may lead students to pick expensive colleges with dubious payoffs. And remain in debt for their education for years to come.

O’Neil states, “…many poisonous assumptions are camouflaged by math and go largely untested and unquestioned.”

Misinformation That Keeps On Misinforming

You may be thinking, “Assumptions get fixed, right? Good programmers fix bad code. Good data scientists fix bad algorithms.”

Not always. Perhaps not even all that often.

In the case of the US News & World Report, there are assumptions that haven’t been challenged by real data in 29 years. The report is still living off proxy data. In other words, that’s one powerful weapon of math destruction.

This book teaches us that proxies are dangerous when not handled correctly.

Proxy Data Isn’t Always Bad.

In the absence of meaningful data, companies are often forced to rely on proxy data.

Many businesses must use proxy data when real data is inaccessible. Perhaps the appropriate research sample is too costly to obtain. In other cases, there is not enough time to gather real data before a decision needs to be made. In such cases, companies turn to proxies.

Is this an inherently bad practice? No. Sometimes it’s quite expedient.

For example, if you want to know the size of a tech company’s employee base, you might look at LinkedIn data. LinkedIn will give the number of employees who work at the company. Obviously, LinkedIn’s data won’t be 100 percent accurate. LinkedIn can only base its estimates on employees with LinkedIn profiles. Luckily, for tech companies, LinkedIn’s employee estimate will likely only be off by a few percentage points. For companies in industries that don’t utilize LinkedIn as much, that margin of error will probably grow. However, LinkedIn data is a reasonable source of proxy data when tech companies don’t publicly release their employee counts.

Here is another example of the reasonable use of proxy data. You might look at search traffic to a competitor’s site to estimate their growth.

Essentially, you just need to recognize when you’re looking at proxy data. Wherever you can, add real data to your evaluations.

Are You Using a Weapon of Math Destruction?

These are warning signs that you’re using proxy data dangerously:

  • Proxy data models influence decision making at scale.
  • Unclear criteria form the basis of accepted data models.
  • There is huge potential for damage to individual consumers, citizens, or businesses.

We Need To Think Hard About The Validity of Our Data.

This book forces us to ask hard questions about big data ethics. We should be asking ourselves:

  • Am I looking at proxy data? If so, am I okay with that?
  • Is the model I’m using fair? Does it need more real or qualitative data?
  • Does the data model I’m using qualify as a weapon of math destruction?
  • Do I need to build a new data model?

To be able to answer these questions, every tech sales, marketing, or product development leader should read this book.

In closing, we’ll let O’Neil have the final word.

She writes, “Data is not going away. Nor are computers—much less mathematics. Predictive models are, increasingly, the tools we will be relying on to run our institutions, deploy our resources, and manage our lives. But as I’ve tried to show throughout this book, these models are constructed not just from data but from the choices we make about which data to pay attention to—and which to leave out. Those choices are not just about logistics, profits, and efficiency. They are fundamentally moral.”


This podcast is brought to you by Cascade Insights. We specialize in market research and marketing for B2B technology companies. 

 

Get in touch

"*" indicates required fields

Name
Cascade Insights will never share your information with third parties. View our privacy policy.
This field is for validation purposes and should be left unchanged.
  • Bio
  • Twitter
  • LinkedIn
  • Latest Posts
Sean Campbell

Sean Campbell

CEO
Sean Campbell is the CEO of Cascade Insights, a competitive intelligence and market research firm for B2B technology companies. A 20-year technology veteran, Sean oversees the health of the firm, drives the company’s thought leadership efforts and maintains relationships with key clients.
Sean Campbell

@sean_campbell

Sean Campbell
Sean Campbell

Latest posts by Sean Campbell (see all)

  • B2B Market Research: Exposing the Truth - August 23, 2022
  • B2B Focus Groups: Virtual for the Win - June 7, 2022
  • B2B Buyer’s Journey: Buyers and Marketers in the Blender - March 8, 2022
  • Bio
  • Twitter
  • LinkedIn
  • Latest Posts
Isabel Gautschi

Isabel Gautschi

Director of Marketing
Isabel Gautschi

@Isa_belG

Isabel Gautschi
Isabel Gautschi

Latest posts by Isabel Gautschi (see all)

  • B2B Thought Leadership: How To Use Data Effectively - January 1, 2021
  • Nearly 80% of Business Activity Is Disrupted By Coronavirus - March 25, 2020
  • “I Have No Idea What Your Product Does”: A Cautionary Tale of Marketing Buzzwords - November 19, 2019
Tags: big data, big data ethics, Cascade Insights, Cathy O'Neil, Isabel Gautschi, market research, marketing, math, product teams, proxy data, research, sales, Sean Campbell, tech, Weapons of Math Destruction
Share this entry
  • Share on Facebook
  • Share on Twitter
  • Share on LinkedIn
  • Share by Mail
You might also like
Marketing Madness: Chasing Viral DreamsMarketing Madness: Chasing Viral Dreams
Riveting Research: It’s Not a Mission Impossible
5 Clues It's Time for B2B Concept Testing5 Clues It’s Time for B2B Concept Testing
Ageism In TechTech Needs More Age Diversity
Message Testing: Because Intent ≠ InterpretationMessage Testing: Because Intent ≠ Interpretation
Welcome to the B2B Revealed Podcast
In-Depth Interviews: The Answer To 'None of The Above'In-Depth Interviews: The Answer To ‘None of The Above’
4 Types of Bad News B2B Brand Research Delivers

GET IN TOUCH

hello@cascadeinsights.com

(503) 898-0004

WHAT WE DO

Market Research Services
Marketing Services
Subscribeto RSS Feed

Blog Categories

  • B2B Buyer Persona Research
  • B2B Channel Market Research
  • B2B Churn Analysis
  • B2B Competitive Landscape Analysis
  • B2B Customer Experience Research
  • B2B Customer Journey Mapping
  • B2B Go-To-Market Research
  • B2B Market Opportunity Research
  • B2B Market Research Blog
  • B2B Market Segmentation Research
  • B2B Marketing Blog
    • Uncategorized
  • B2B Messaging
  • B2B New Product Launch Research
  • B2B Thought Leadership
  • B2B Usability Testing
  • Brand Research
  • Concept Testing
  • Content Marketing
  • Customer Satisfaction Research
  • Data-Driven Marketing Research
  • Jobs-To-Be-Done
  • Key Buying Criteria
  • Marketing Enablement
  • Marketing Strategy
  • Message Testing Research
  • Partner Enablement
  • Product/Service Research
  • Sales Enablement Marketing
  • Sales Enablement Research
  • User Personas
  • Videos
  • Win/Loss Analysis

Written by

Sean Campbell
Sean Campbell
Isa Gautschi
Isabel Gautschi

RECENT POSTS

  1. Win-Loss Analysis: 6 Reasons Why You’re Losing Deals
  2. B2B Market Research: 3 Steps to Recruiting a Superb Sample
  3. Ageism in Tech: The Silent Career Killer
  4. Nearly 80% of Business Activity Is Disrupted By Coronavirus

Connect With Us

503.212.0687

hello@cascadeinsights.com

  • LinkedIn
  • Twitter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

B2B Market Research

  • Customer Experience Research
  • — Buyer Persona Research
  • — Buyer's Journey Research
  • — Key Buying Criteria Research
  • — Jobs-To-Be-Done Research
  • — User Personas
  • — Customer Satisfaction Research

  • B2B Product/Service Research
  • — Market Opportunity Research
  • — Concept Testing
  • — Go-To-Market Research

  • Marketing Enablement Research
  • — B2B Data-Driven Marketing Research
  • — Message Testing
  • — Brand Research
  • — Thought Leadership
  • — Partner Enablement

  • Sales Enablement Research
  • — Competitive Landscape Analysis
  • — Win Loss Analysis
  • — Churn Analysis
  • — Channel Research

B2B Marketing

  • — B2B Marketing Strategy
  • — B2B Messaging Services
  • — B2B Content Marketing
  • — B2B Sales Enablement

About Us

  • — Our Story
  • — Our Clients
  • — Client Testimonials
  • — Careers
  • — Ethics Policy
  • — Privacy Policy

Blogs

  • — B2B Market Research Blog
  • — B2B Marketing Blog
  • — Cascade Insights Blog
Cascade Insights is proud to be a member of the Inc. 5000.
Oregon 100 Best
The Pre-Flight Checklist for Research PresentationsPowerPoint Tips for Market ResearchersWhat LinkedIn ad data reveals about the tech gender gap.The Gender Gap: A Look at 50 Tech Giants
Scroll to top