实验人员考评指标_了解实验指标

实验人员考评指标

In the first part of my series on experimental design Thinking About Experimental Design, we covered the foundations of an experiment: the goals, the conditions, and the metrics. In this post, we will move away from the initial experimental set up to begin understanding baseline metrics and the nuances of picking appropriate conversion metrics and rates.

在我的实验设计系列的第一部分中,“ 思考实验设计”中 ,我们介绍了实验的基础:目标,条件和度量。 在这篇文章中,我们将脱离最初的实验设置,开始了解基线指标以及选择合适的转化指标和费率的细微差别。

介绍 (Introduction)

I introduced the goals of an experiment a business context, using the example of a lemonade stand: measuring the difference in outcomes (# of cups sold) under controlled conditions (color of cup) during comparable time-frames. In the example, I failed to mention a critical element of any experiment — a hypothesis, a preliminary presumption based on limited evidence. Strictly speaking, the goal of an experiment is to validate or refute ideas put forth by a hypothesis. A reasonable hypothesis for our lemonade stand example might be that “the color of the lemonade cup affects the number of cups sold.”

我以柠檬水摊位为例,介绍了在商业环境中进行实验的目标:在可比较的时间范围内,在受控条件 (杯子的颜色)下测量结果 (所售杯子的数量)的差异 。 在示例中,我没有提到任何实验的关键要素- 假设 ,即基于有限证据的初步假设。 严格来说,实验的目的是验证或驳斥假设提出的想法。 我们的柠檬水摊位示例的合理假设可能是“柠檬水杯的颜色会影响所售杯子的数量”。

从基线开始 (Starting from a Baseline)

If we’re just starting our lemonade stand (or other small business/business line), can you see a problem with immediately designing this particular experiment (or any experiment) to validate a hypothesis? Market research aside, we don’t have any reliable (backed by hard-data) predictions on what to expect for our number of cups sold!

如果我们只是开始我们的柠檬水摊位(或其他小型企业/业务线),您是否会立即设计此特定实验(或任何实验)以验证假设是否存在问题? 除了市场研究,我们对售出的杯子数量还没有可靠的预测(有硬数据支持)!

If we have poor sales the first two weeks with red cups, followed by boosted sales in the next two weeks with blue cups, I wouldn’t be comfortable concluding that blue cups are superior. The sudden change in sales may be due to the fact that people didn’t know about our stand in its opening weeks, and only recently discovered our stand. Remember, the goal of an experiment is to understand the effects of incremental changes. When we introduce a big change (ie. starting our stand or completely revamping our storefront), we introduce instability into our existing business, which will mask and/or distort the effect of our experimental conditions.

如果我们在头两周使用红色杯子的销售情况不佳,然后在接下来的两周使用蓝色杯子的销售量有所增长,那么我不能肯定地说蓝色杯子是上乘的。 销售量的突然变化可能是由于人们在开放的几周内并不了解我们的展位,而只是在最近才发现了我们的展位。 请记住, 实验的目的是了解增量变化的影响。 当我们进行重大更改时(即启动我们的展位或完全改造店面),我们会将不稳定因素引入现有业务,这将掩盖和/或扭曲我们的实验条件的影响。

Before conducting an experiment it’s important to establish a stable baseline that will be used the judge the effects our incremental changes.

在进行实验之前,重要的是要建立一个稳定的基线,该基线将用于判断我们的增量变化的影响。

To make our example more concrete, let’s say we’ve been running our lemonade stand for 1.5 years. We’re relatively well-known in the community, but we’ve run out of ideas on how we can continue growing our business. After prioritizing our business goals and brainstorming relevant conversion metrics and rates*, we’ve decided to analyze the conversion metric:

为了使我们的示例更具体,假设我们已经运行柠檬水摊位1.5年了。 我们在社区中相对知名,但是关于如何继续发展业务的想法已经耗尽。 在确定了业务目标的优先级并集体讨论了相关的转化指标和转化率*之后,我们决定分析转化指标:

# of cups sold / people who stop by our stand.

售出的杯子数/站在我们展位旁的人。

I have generated some dummy data representing our monthly sales for the past year. The relative stability of our business with respect to the conversion rate. Now, when we introduce our experimental condition (changing the color of the cup), we have a reliably predictable baseline conversion rate (3.62% average) with which we can compare our new outcome.

我已经生成了一些虚拟数据,这些数据代表了过去一年的每月销售额。 我们的业务相对于转换率的相对稳定性。 现在,当我们介绍实验条件(改变杯子的颜色)时,我们有了可靠的可预测的基线转化率(平均3.62%),可以与我们比较新的结果。

Image for post
Image by Author
图片作者

考虑转化率的变化 (Thinking about changes in our conversion rate)

At this point, it is easy to forget that our target metric is a conversion rate and begin brainstorming incremental changes that increase # of cups sold. The use of a conversion rate instead of an absolute value requires us to expand our focus from a single metric, to the relationship between two related metrics — from a focus on scale to a focus on scale & efficiency.

在这一点上,很容易忘记我们的目标指标是转化率,并开始集思广益,以增加销售杯数的增量变化。 使用转换率而非绝对值要求我们将重点从单一指标扩展到两个相关指标之间的关系-从关注规模到关注规模和效率。

To increase our conversion rate, we must develop a strategy to increase the number of cups sold at a greater rate than we increase the number of people who stop by our stand. Altering the color or design of the cup may be an interesting business initiative; this is assuming that we believe people stop by our stand mostly for the lemonade and our beautiful stand and not due to the color of the cup.

为了提高转化率,我们必须制定一项战略,以比增加增加在我们展台前停留的人数更多的速度增加杯子的销售数量。 改变杯子的颜色或设计可能是一个有趣的商业尝试; 这是假设我们相信人们主要是为了柠檬水和漂亮的立场而停下来,而不是因为杯子的颜色。

From a business perspective, we often read about using data and experiments to deliver actionable insights. In addition to our baseline rate, it is important to set reasonable target rates under each of our experimental conditions; we do not want to be taking action on any small change in conversion rates. Setting a target conversion rate is as much an art as it is a science, and can be based on a combination of past data and intuitive business sense. In our lemonade example, we might say that we will switch the color of our cups moving forward if our conversion rate is 4.12% over the next couple of months, an increase of .5%.

从业务角度来看,我们经常阅读有关使用数据和实验来提供可行见解的信息。 除了我们的基准速率外,在每个实验条件下设定合理的目标速率也很重要; 我们不希望对转化率的任何小变化采取行动。 设定目标转化率既是一门艺术,也是一门科学,并且可以基于过去的数据和直观的商业意识相结合。 在我们的柠檬水示例中,我们可以说,如果未来几个月我们的转化率为4.12%(增加0.5%),我们将改变杯子的颜色。

结论: (Conclusion:)

To summarize what we have accomplished in our lemonade example:

总结一下我们在柠檬水示例中所取得的成就:

  1. We defined our business goal: increase cups sold

    我们确定了我们的业务目标: 增加杯子销量

2. We defined our conversion metric and conversion rate: cups sold & cups sold / foot traffic

2.我们定义了转化指标和转化率: 售出杯数和售出杯数/人流量

3. We developed a controlled incremental change that will (hypothetically) affect our outcome

3.我们开发了受控的增量更改,该更改将(假设地)影响我们的结果

4. We established a stable baseline for comparison.

4.我们建立了稳定的基线进行比较。

If our conversion metric achieves the target goal, we can conduct business under the new conditions moving forward right? Well, not quite.

如果我们的转化指标实现了目标,那么我们可以在新的条件下开展业务吗? 好吧,不完全是。

So far, you may have realized that I have not introduced any statistics in our experiment! At this point, it may not be entirely clear why we need statistics to validate our hypothesis and complete our experiment. Nonetheless, when dealing with such uncertainty, we want a way of quantifying our decision-making process; we use statistics quantify the strength of our experimental evidence. In the next article of this series, I will begin introducing ideas of basic statistical concepts and test statistics as they apply to our experimental design.

到目前为止,您可能已经意识到我没有在实验中引入任何统计信息! 在这一点上,可能还不清楚,为什么我们需要统计数据来验证我们的假设并完成我们的实验。 但是,在处理此类不确定性时,我们需要一种量化决策过程的方法。 我们使用统计数据来量化我们的实验证据的强度。 在本系列的下一篇文章中,我将开始介绍适用于我们的实验设计的基本统计概念和测试统计概念。

[1]: Monika Wahi. (2020). The Data Science of Experimental Design. LinkedIn Learning. https://www.linkedin.com/learning/the-data-science-of-experimental-design

[1]:莫妮卡·瓦希(Monika Wahi)。 (2020)。 实验设计的数据科学。 LinkedIn学习。 https://www.linkedin.com/learning/the-data-science-of-experimental-design

[2]: Icons & Images. Pexels: https://www.pexels.com/

[2]: 图标和图像。 Pexels: https ://www.pexels.com/

翻译自: https://towardsdatascience.com/understanding-experiment-metrics-ecb0d759f743

实验人员考评指标

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/392414.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

leetcode 188. 买卖股票的最佳时机 IV(dp)

给定一个整数数组 prices ,它的第 i 个元素 prices[i] 是一支给定的股票在第 i 天的价格。 设计一个算法来计算你所能获取的最大利润。你最多可以完成 k 笔交易。 注意:你不能同时参与多笔交易(你必须在再次购买前出售掉之前的股票&#xf…

kotlin编写后台_在Kotlin编写图书馆的提示

kotlin编写后台by Adam Arold亚当阿罗德(Adam Arold) 在Kotlin编写图书馆的提示 (Tips for Writing a Library in Kotlin) Writing a library in Kotlin seems easy but it can get tricky if you want to support multiple platforms. In this article we’ll explore ways f…

1.Swift教程翻译系列——关于Swift

英文版PDF下载地址http://download.csdn.net/detail/tsingheng/7480427 我本来是做JAVA的。可是有一颗折腾的心,苹果公布Swift以后就下载了苹果的开发文档。啃了几天。朦朦胧胧的看了个几乎相同,想静下心看能不能整个翻译出来。我英语一般般,…

核心技术java基础_JAVA核心技术I---JAVA基础知识(集合set)

一:集合了解(一)确定性,互异性,无序性确定性:对任意对象都能判定其是否属于某一个集合互异性:集合内每个元素都是无差异的,注意是内容差异无序性:集合内的顺序无关(二)集合接口HashSet&#xff…

nba数据库统计_NBA板块的价值-从统计学上讲

nba数据库统计The idea is not to block every shot. The idea is to make your opponent believe that you might block every shot. — Bill Russel这个想法不是要阻止每一个镜头。 这个想法是让你的对手相信你可能会阻挡每一个投篮。 —比尔罗素 The block in basketball ha…

leetcode 330. 按要求补齐数组(贪心算法)

给定一个已排序的正整数数组 nums,和一个正整数 n 。从 [1, n] 区间内选取任意个数字补充到 nums 中,使得 [1, n] 区间内的任何数字都可以用 nums 中某几个数字的和来表示。请输出满足上述要求的最少需要补充的数字个数。 示例 1: 输入: nums [1,3], …

【炼数成金 NOSQL引航 三】 Redis使用场景与案例分析

验证redis的主从复制,将实验过程抓图 复制配置文件 更改slave的端口 和相关master配置 主从复制测试 研究在OAuth中的“一次数”nonce有什么用途?怎样使用?以此熟悉OAuth的全流程 nonce ,一个随机的混淆字符串,仅仅被…

SQL Server需要监控哪些计数器 ---指尖流淌

http://www.cnblogs.com/zhijianliutang/p/4174697.html转载于:https://www.cnblogs.com/zengkefu/p/7044095.html

akka 简介_Akka HTTP路由简介

akka 简介by Miguel Lopez由Miguel Lopez Akka HTTP路由简介 (An introduction to Akka HTTP routing) Akka HTTP’s routing DSL might seem complicated at first, but once you get the hang of it you’ll see how powerful it is.Akka HTTP的路由DSL乍一看似乎很复杂&…

leetcode 1046. 最后一块石头的重量(堆)

有一堆石头&#xff0c;每块石头的重量都是正整数。 每一回合&#xff0c;从中选出两块 最重的 石头&#xff0c;然后将它们一起粉碎。假设石头的重量分别为 x 和 y&#xff0c;且 x < y。那么粉碎的可能结果如下&#xff1a; 如果 x y&#xff0c;那么两块石头都会被完全…

java2d方法_Java SunGraphics2D.fillRect方法代码示例

import sun.java2d.SunGraphics2D; //导入方法依赖的package包/类/*** Return a non-accelerated BufferedImage of the requested type with the* indicated subimage of the original image located at 0,0 in the new image.* If a bgColor is supplied, composite the orig…

js建立excel表格_建立Excel足球联赛表格-传统vs动态数组方法

js建立excel表格介绍 (Introduction) I am going to show you the different ways you can build a football league table in Excel. Some of the methods are old school but others utilise Excel’s new capabilities.我将向您展示在Excel中建立足球联赛表格的不同方法。 其…

postman+newman生成html报告

作为测试菜鸟,在学习postmannewman的使用过程中真的是颇费周折......没办法技术太菜,只能多学习. postman的下载安装不多言说,下载地址:https://www.getpostman.com/downloads/ newman的安装过程: 1.首先需要安装node.js,可以去官网下载,地址:https://nodejs.org/en/#download …

java jdk1.9新特性_JDK1.9-新特性

1. Java平台级模块系统该特性使Java9最大的一个特性&#xff0c;Java提供该功能的主要的动机在于&#xff0c;减少内存的开销&#xff0c;JVM启动的时候&#xff0c;至少会有30~60MB的内存加载&#xff0c;主要原因是JVM需要加载rt.jar&#xff0c;不管其中的类是否被classload…

如何在10分钟内让Redux发挥作用

Hi everyone ❤️大家好❤️ For a while now I’ve been hearing my friends and colleagues complaining about how hard it was to get into Redux.一段时间以来&#xff0c;我一直在听我的朋友和同事抱怨进入Redux有多困难。 I run a freeCodeCamp Study Group in the So…

两个链接合并_如何找到两个链接列表的合并点

两个链接合并了解问题 (Understand the Problem) We are given two singly linked lists and we have to find the point at which they merge.我们给了两个单链表&#xff0c;我们必须找到它们合并的点。 [SLL 1] 1--->3--->5 \ …

安装veket到移动硬盘NTFS分区

如果你已经看过《手动安装veket到硬盘》和《简单的将veket安装到U盘的方法》两篇文章并且安装成功的话&#xff0c;说明不适用本文的安装环境&#xff0c;就不用往下看了。 《手动安装veket到硬盘》一文采用grub4dos来引导硬盘上的veket&#xff0c;主要是用来在本机已安装Wind…

简书使用小技巧

1、不同字体  在 设置->基础设置->富文本 模式下可以实现 2、添加图片&#xff0c;让文章更生动 3、添加代码框 &#xff01;注意&#xff1a;设置为Markdown模式后&#xff0c;只对新创建的文章起作用。转载于:https://www.cnblogs.com/HMJ-29/p/7049540.html

掩码 项目编码_每天进行20天的编码项目

掩码 项目编码by Angela He通过何安佳 每天进行20天的编码项目 (A coding project a day for 20 days) 我如何在20天内自学Web开发 (How I taught myself web development in 20 days) It was the first day of winter break for Stanford students. Back at home, I opened a…

java循环一年月份天数和_javawhile循环编写输入某年某月某日,判断这一天是这一年的第几…...

该楼层疑似违规已被系统折叠 隐藏此楼查看此楼public class ZuoYe9 {public static void main(String[] args) {int days0; //存储变量这一年的第几天//1.输入年&#xff0c;月&#xff0c;日Scanner inputnew Scanner(System.in);System.out.println("请输入年份&#xf…