标记偏见_分析师的偏见

标记偏见

“Beware of the HiPPO in the room” — The risks and dangers of top-down, intuition-based decision making are well known in the business world. Experimentation and data-based decision making become widely acknowledged as the right way to steer a business.

“当心机房中的HiPPO” —自上而下,基于直觉的决策制定的风险和危险在商业界众所周知。 实验和基于数据的决策被公认为是指导业务的正确方法。

For a good reason: Leading experimenters such as Netflix, Google and Booking show that making decisions based on facts and evidence rather than intuition can lead to exceptional business success.

有一个很好的理由:Netflix,Google和Booking等领先的实验者表明,根据事实和证据而不是凭直觉做出决策可以带来非凡的业务成功。

But what if in the course of this development the HiPPO (Highest Paid Person’s Opinion) is not the one to be afraid of anymore? What if the person that should help to fight top-down decision making took his place?

但是,如果在这种发展过程中,HiPPO(最高付费人士的意见)不再是一个令人恐惧的东西呢? 如果应该帮助反对自上而下的决策的人接任该怎么办?

What if the analyst is the new biasing factor in decision making?

如果分析师是决策中的新偏见因素怎么办?

Let me be clear. Having personal opinions about new ideas, suggestions and approaches couldn’t be more natural. We all have our cognitive biases. But we can not ignore this fact just because of the allegedly safe framework of evidence-based decision making.

让我清楚一点。 对新想法,建议和方法有个人见解不会再自然。 我们都有我们的认知偏见。 但是,我们不能仅仅因为所谓的基于证据的决策安全框架而忽视这一事实。

Image for post
UnsplashUnsplash

偏差分析结果 (Biasing Analytics Results)

The analyst’s responsibility is to paint a clear picture of the business’ situation and inform decision-making. And there are plenty of ways how an analyst can, willingly or entirely unaware, bias the final decision that is made. I like to split those into conscious and unconscious biases.

分析师的责任是清楚地描述业务情况并为决策提供依据。 分析师可以通过多种方式自愿或完全不了解最终决策。 我喜欢将这些分为有意识的和无意识的偏见。

自觉偏见 (Conscious Biases)

Let’s have a look at the first kind, where analysts make deliberate decisions that will impact their results. Conscious biases are closely connected to the analyst’s personal opinion. This can be about a new marketing campaign or a new product feature. Whether the analyst believes in a specific idea can significantly impact how the following research is being conducted. And if it is conducted at all. Let’s have a look at a few potential sources for conscious biases:

让我们看一下第一种类型,在这种类型中,分析师做出会影响其结果的深思熟虑的决策。 有意识的偏见与分析师的个人看法紧密相关。 这可能与新的营销活动或新的产品功能有关。 分析人员是否相信特定想法会严重影响以下研究的进行方式。 如果是进行的话。 让我们看一下一些有意识的偏见的潜在来源:

“Can you just give us a rough estimate for this particular metric?”

“您能给我们这个特定指标的大概估算吗?”

a) Making Guesses. Questions for estimates and opinions are more or less an invitation for introducing biases. Obviously you can’t know all the numbers by heart and the best thing to do would be to go back to your desk, check the metrics and report them back. But checking every single metric costs too much time. Often enough, we simply trust our intuition, which strongly correlates with our personal opinion about a specific idea. So you make an educated guess about what the number might be. At this point, analysts can already substantially impact whether an initiative is pursued and what everybody’s expectations are. The first number one comes up with serves as an anchor figure and sets expectations stakeholders might reference in the future to assess an idea’s potential.

a)猜测。 有关估计和意见的问题或多或少地引起了人们引入偏见。 显然,您不能一味地知道所有数字,而最好的办法是回到办公桌前,查看指标​​并将其报告回来。 但是检查每个指标会花费太多时间。 通常,我们只是相信我们的直觉,这与我们对特定想法的个人看法紧密相关。 因此,您可以对数字可能进行合理的猜测。 在这一点上,分析人员已经可以对是否采取主动行动以及每个人的期望产生实质性影响。 第一个数字一个想出了作为锚人物和套的预期利益相关者在未来可能会引用到评估一个想法的潜力。

“Traffic is so low on this page, it’s not worth looking further into this.”

“此页面上的流量如此之低,因此不值得进一步研究。”

b) Giving Personal Opinions. Sometimes we might be tempted to provide no number at all and instead give a personal opinion. While this opinion is (hopefully) based on facts and the analyst’s experience, it can still strongly correlate with one’s subjective opinion about the idea discussed.

b)发表个人意见。 有时我们可能会不愿提供任何电话号码,而是发表个人意见。 尽管这种观点(希望)基于事实和分析师的经验,但仍可以与人们对所讨论想法的主观观点密切相关。

c) Depth of Research. After kicking off the research, the question is:

c)研究深度。 在开始研究之后,问题是:

When does an analyst have enough information to give a good recommendation or overview for a particular problem?

分析师何时有足够的信息为特定问题给出好的建议或概述?

Of course, you can always drill deeper into a specific topic to get more evidence to support a decision-making process. Analysts might be inclined to dig deeper into an area to prove or disprove a particular idea they have a strong opinion about. Simultaneously, we might put less effort into a research question where the outcome is expected to be less exciting or the decision that has to be made seems to be pretty trivial anyway.

当然,您总是可以更深入地研究特定主题,以获取更多证据来支持决策过程。 分析师可能倾向于更深入地研究某个领域,以证明或反对他们有强烈看法的特定想法。 同时,我们可能会在研究问题上投入更少的精力,因为预期结果不会那么令人兴奋,或者必须做出的决定似乎微不足道。

d) Setting targets. The analytics and experimentation landscape itself invites analysts and anybody who operates in it to introduce biases at some points. Be it setting the right significance or power level for an AB-test or selecting an appropriate metric to measure a new feature or a campaign’s success? Those are, to a certain degree, subjective decisions the analyst has to make to produce any results. But at the same time, those can have a significant effect on the actual outcome of the research.

d)设定目标。 分析和实验环境本身会邀请分析师和其中的任何人在某些时候引入偏见。 是为AB测试设置正确的重要性或功率级别,还是选择适当的度量标准来衡量新功能或活动的成功? 在某种程度上,这些是分析人员必须做出的主观决定才能产生任何结果。 但是同时,这些可能会对研究的实际结果产生重大影响。

Image for post
UnsplashUnsplash

无意识的偏见 (Unconscious Biases)

Unconscious biases are not introduced by the analyst’s active decision making. This sort of bias is less connected to the personal opinion about a specific idea or research question but can have the same magnitude of impact on the results. Biases that fall into this category are for example:

分析人员的主动决策不会引入无意识的偏见。 这种偏见与对特定想法或研究问题的个人看法联系较少,但对结果的影响程度相同。 属于此类的偏差例如:

a) Programming Errors. Be it an error in a SQL-query, a wrong logic statement when filtering a pandas dataframe or an incorrect regex expression. All these programming errors can occur when we’re trying to get insights from the data in front of us. Other than syntax errors, this kind of programming error can remain wholly unnoticed when running our code and thus can have a substantial impact on the results of our analysis.

a)编程错误。 是SQL查询中的错误,过滤熊猫数据框时的错误逻辑语句还是不正确的正则表达式。 当我们试图从眼前的数据中获取洞察力时,所有这些编程错误都可能发生。 除了语法错误外,在运行我们的代码时,这种编程错误可能仍然完全未被注意到,从而可能对我们的分析结果产生重大影响。

b) Wrong handling of data. Usually, the data we want to examine to answer a particular research question does not come in a usable format. Before we can use a statistical model to derive insights from our data, we might have to clean it, select and engineer appropriate features, and eventually perform data transformations. All those actions can bias our dataset and thus our decisions in one direction or another.

b)错误处理数据。 通常,我们要检查以回答特定研究问题的数据不是可用的格式。 在使用统计模型从数据中获取见解之前,我们可能必须清理数据,选择和设计适当的功能,并最终执行数据转换。 所有这些动作都会使我们的数据集产生偏差,从而使我们的决策朝着一个方向或另一个方向倾斜。

c) Wrong interpretation of data. We might have done everything done when handling and modelling our dataset. But in the end, we can still derive the wrong conclusions from the results in front of us. Classic misinterpretations are confusing correlation with causation or drawing the wrong conclusions about the relationship of two parameters.

c)数据解释错误。 在处理和建模数据集时,我们可能已经完成了所有工作。 但是最后,我们仍然可以从我们面前的结果中得出错误的结论。 经典的误解使因果关系变得混乱,或者对两个参数之间的关系得出了错误的结论。

那么分析师是不值得信任的吗? (So is the analyst to be mistrusted?)

Today’s most valuable companies such as Netflix, Amazon and Google show that experimentation and data have to replace intuition as a basis for making decisions.

如今,诸如Netflix,Amazon和Google之类的最有价值的公司表明,实验和数据必须取代直觉作为决策的基础。

Hence having trust in the data and therefore the analyst’s output is essential. It’s the analyst’s responsibility to build and maintain that trust. Analysts have to do their best to provide unbiased, informative insights to support decision-making and drive businesses in the right direction.

因此,对数据以及对分析人员的输出的信任至关重要。 建立和维护这种信任是分析师的责任。 分析师必须尽力提供​​公正,有用的见解,以支持决策制定并推动业务朝着正确的方向发展。

Therefore, it is imperative to be aware of your own biases and to overcome them where possible.

因此,必须意识到自己的偏见并在可能的情况下克服它们。

When being asked for a rough estimate or when thinking about how deep you want to drill into a specific topic, take a step back. Reflect on your decision and thought process and try to get a neutral perspective on your current issue.

当被要求进行粗略估算或考虑要深入到特定主题的深度时,请后退一步。 反思您的决策和思考过程,并尝试对当前问题持中立观点。

To avoid unconscious biases, it helps to adopt some best practices from the world of software engineering: Use unit tests in your queries and notebooks, start pair programming and ask colleagues to review your code and approach.

为避免无意识的偏见,它有助于采用一些软件工程领域的最佳实践:在查询和笔记本中使用单元测试,开始结对编程,并要求同事审查您的代码和方法。

Liked this article? Then you might also like one of my other posts:

喜欢这篇文章吗? 然后,您可能还会喜欢我的其他帖子之一:

翻译自: https://towardsdatascience.com/the-analysts-bias-5c84825c0f48

标记偏见

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/390041.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

用户体验数据分析 书单_如何使用数据改善用户体验设计

用户体验数据分析 书单In the current age of technology, if an entrepreneur comes up with a grand idea, chances are they’ll need a pretty sweet website to go along with it. And if they want their idea to really sell, they will also need a website that reall…

推荐11个实用的JavaScript库

2019独角兽企业重金招聘Python工程师标准>>> JavaScript 仍然是 2018 年最受欢迎和使用最为广泛的编程语言,因此 JavaScript 生态系统也会继续发展壮大。 然而,JavaScript 的标准库仍然继续保持“短小精悍”的身材。为了填补标准库功能方面的…

371. 两整数之和

371. 两整数之和 给你两个整数 a 和 b ,不使用 运算符 和 - ​​​​​​​,计算并返回两整数之和。 示例 1: 输入:a 1, b 2 输出:3 示例 2: 输入:a 2, b 3 输出:5 提示&a…

【福利】微信小程序精选Demo合集

小编最近在开发小程序,也读到了不少优秀的小程序源码,项目中有些需求可以直接从源码里粘贴复制过来,虽然这样做不利于自己独立编写代码,但比较是给公司做项目啊,秉着效率第一的原则,简直没有什么比ctrlc,ct…

为什么选择做班级管理系统_为什么即使在平衡的班级下准确性也很麻烦

为什么选择做班级管理系统Accuracy is a go-to metric because it’s highly interpretable and low-cost to evaluate. For this reason, accuracy — perhaps the most simple of machine learning metrics — is (rightfully) commonplace. However, it’s also true that m…

网站漏洞检测针对区块链网站安全分析

2019独角兽企业重金招聘Python工程师标准>>> 目前移动互联网中,区块链的网站越来越多,在区块链安全上,很多都存在着网站漏洞,区块链的充值,会员账号的存储性XSS窃取漏洞,账号安全,等…

223. 矩形面积

223. 矩形面积 给你 二维 平面上两个 由直线构成的 矩形,请你计算并返回两个矩形覆盖的总面积。 每个矩形由其 左下 顶点和 右上 顶点坐标表示: 第一个矩形由其左下顶点 (ax1, ay1) 和右上顶点 (ax2, ay2) 定义。 第二个矩形由其左下顶点 (bx1, by1) …

微观计量经济学_微观经济学与数据科学

微观计量经济学什么是经济学和微观经济学? (What are Economics and Microeconomics?) Economics is a social science concerned with the production, distribution, and consumption of goods and services. It studies how individuals, businesses, governmen…

Python基础综合练习

Pycharm开发环境设置与熟悉。 练习基本输入输出&#xff1a; print(你好,{}..format(name)) print(sys.argv) 库的使用方法&#xff1a; import ... from ... import ... 条件语句&#xff1a; if (abs(pos()))<1: break 循环语句&#xff1a; for i in range(5): while Tru…

安装mariadb、安装Apache

2019独角兽企业重金招聘Python工程师标准>>> 安装mariadb 安装mariadb的步骤与安装mysql的一样 下载二进制源码包 再用tar 解压&#xff0c;创建/data/mariadb目录和用户 初始化 编译启动脚本 启动 安装Apache Apache是软件基金会的名字&#xff0c;软件的名字叫htt…

惯性张量的推理_选择合适的intel工作站处理器进行张量流推理和开发

惯性张量的推理With the increasing number of data scientists using TensorFlow, it might be a good time to discuss which workstation processor to choose from Intel’s lineup. You have several options to choose from:随着使用TensorFlow的数据科学家数量的增加&am…

MongoDB数据库查询性能提高40倍

MongoDB数据库查询性能提高40倍 大家在使用 MongoDB 的时候有没有碰到过性能问题呢&#xff1f;下面这篇文章主要给大家分享了MongoDB数据库查询性能提高40倍的经历&#xff0c;需要的朋友可以参考借鉴&#xff0c;下面来一起看看吧。 前言 数据库性能对软件整体性能有着至关重…

牛客网_Go语言相关练习_选择题(2)

注&#xff1a;题目来源均出自牛客网。 一、选择题 Map&#xff08;集合&#xff09;属于Go的内置类型&#xff0c;不需要引入其它库即可使用。 Go-Map_菜鸟教程 在函数声明中&#xff0c;返回的参数要么都有变量名&#xff0c;要么都没有。 C选项函数声明语法有错误&#xff0…

Java常用的八种排序算法与代码实现

排序问题一直是程序员工作与面试的重点&#xff0c;今天特意整理研究下与大家共勉&#xff01;这里列出8种常见的经典排序&#xff0c;基本涵盖了所有的排序算法。 1.直接插入排序 我们经常会到这样一类排序问题&#xff1a;把新的数据插入到已经排好的数据列中。将第一个数和第…

熊猫ai智能机器人量化_机器学习中的熊猫是什么

熊猫ai智能机器人量化Machine learning is a complex discipline. The implementation of machine learning models is now far much easier than it used to be, this is as a result of Machine learning frameworks such as pandas. Wait!! isnt panda an animal? As I rec…

441. 排列硬币

441. 排列硬币 你总共有 n 枚硬币&#xff0c;并计划将它们按阶梯状排列。对于一个由 k 行组成的阶梯&#xff0c;其第 i 行必须正好有 i 枚硬币。阶梯的最后一行 可能 是不完整的。 给你一个数字 n &#xff0c;计算并返回可形成 完整阶梯行 的总行数。 示例 1&#xff1a;…

调用百度 Echarts 显示重庆市地图

因为 Echarts 官方不再提供地图数据的下载&#xff0c;在这里保存一份&#xff0c;供日后使用&#xff0c;重庆地图数据的 JSON 文件在 CSDN 上下载。 <!DOCTYPE html> <html style"height: 100%"><head><meta charset"utf-8"><…

JEESZ-SSO解决方案

2019独角兽企业重金招聘Python工程师标准>>> 第一节&#xff1a;单点登录简介 第一步&#xff1a;了解单点登录 SSO主要特点是: SSO应用之间使用Web协议(如HTTPS)&#xff0c;并且只有一个登录入口. SSO的体系中有下面三种角色: 1) User(多个) 2) Web应用(多个) 3) …

女朋友天天气我怎么办_关于我的天气很奇怪

女朋友天天气我怎么办带有扭曲的天气应用 (A Weather App with a Twist) Is My Weather Weird?™ is a weather app with a twist — it offers a simple answer to a common question we’ve all asked. To do this we look at how often weather like today’s used to happ…

5895. 获取单值网格的最小操作数

5895. 获取单值网格的最小操作数 给你一支股票价格的数据流。数据流中每一条记录包含一个 时间戳 和该时间点股票对应的 价格 。 不巧的是&#xff0c;由于股票市场内在的波动性&#xff0c;股票价格记录可能不是按时间顺序到来的。某些情况下&#xff0c;有的记录可能是错的…