英国脑科学领域_来自英国A级算法崩溃的数据科学家的4课

英国脑科学领域

In the UK, families, educators, and government officials are in an uproar about the effects of a new algorithm for scoring “A-levels,” the advanced level qualifications used to evaluate students’ knowledge of specific subjects in preparation for university study.

在英国,家庭,教育工作者和政府官员对一种新的“ A-levels ”评分算法的效果感到震惊,“ A-levels ”是用于评估学生对特定科目的知识以准备大学学习的高级证书。

A-level courses usually culminate with exams conducted in testing centers. Because of COVID-19, this year’s exams were canceled. In lieu of those exams and their decisive scores, Ofqual, the government agency responsible for scoring students’ work in A-level classes, opted to use a new algorithm to grade students’ work by applying statistical models of school performance from earlier years. Teachers had already graded the students’ work as part of their coursework, but the algorithm overrode those grades, dropping final scores a full letter for 36% of entries and two full letters for 3% of entries. For thousands of students, A’s became B’s, B’s became C’s, and in a few cases, B’s became D’s. Some students failed their classes, because the algorithm determined that some students must fail. Meanwhile, 5% of well-off students attending private schools saw their scores increase.

A级课程通常以在测试中心进行的考试达到高潮。 由于COVID-19,今年的考试被取消了。 代替对这些考试及其决定性成绩的评分,负责对学生在A级课程中的工作进行评分的政府机构Ofqual选择采用一种新算法,通过应用早些年的学校成绩统计模型来对学生的工作进行评分。 老师已经为学生的作业评分,这是他们课程学习的一部分,但是该算法取代了这些成绩, 最终分数降低了36%的满分和2%的3% 。 对于成千上万的学生,A变成B,B变成C,在少数情况下,B变成D。 一些学生的课程不及格,因为该算法确定某些学生必须不及格。 同时,上私立学校的小康学生中有5%的分数有所提高。

The algorithm’s grading scheme affected disadvantaged students the most. As The Guardian notes:

该算法的评分方案对处境不利的学生影响最大。 正如《卫报》所述

“Pupils from disadvantaged backgrounds have been worst hit by the controversial standardisation process used to award A-level grades in England this year, while pupils at private schools benefited the most. Private schools increased the proportion of students achieving top grades — A* and A — twice as much as pupils at comprehensives. . . . Pupils in lower socioeconomic backgrounds were most likely to have the grades proposed by their teachers overruled, while those in wealthier areas were less likely to be downgraded, according to the analysis.”

“来自贫困家庭的学生受到今年用于授予英国A级成绩的有争议的标准化程序的打击最大,而私立学校的学生受益最大。 私立学校增加了达到最高成绩A *和A的学生比例,是综合学生的两倍。 。 。 。 分析认为,社会经济背景较低的学生最有可能推翻老师建议的成绩,而较富裕地区的学生则不太可能被降级。

The lower grades led some universities and medical schools to revoke the acceptance letters. Affected students were crushed.

低年级导致一些大学和医学院撤销了录取通知书。 受影响的学生被压碎了。

Now many universities are reversing those decisions, and the government is performing a “U-turn,” accepting teachers’ grades as the final A-level scores. Still not impressed, a senior Tory MP is now calling for the abolition of Ofqual itself.

现在,许多大学正在扭转这些决定,而政府正在执行“掉头”,接受教师的成绩作为最终的A级成绩。 仍然没有留下深刻印象的是, 一位高级保守党议员现在呼吁取消Ofqual本身 。

数据科学家的经验教训 (Lessons for Data Scientists)

Here are four lessons this debacle offers data scientists and data engineers.

这是这场灾难给数据科学家和数据工程师的四课。

1.如果结果看起来很奇怪,请仔细检查算法。 (1. If the results seem odd, double-check your algorithm.)

If you’re developing an algorithm that lowers results for a significant number of entries — let alone 40% of entries — it’s time to re-evaluate your algorithm, especially if the results affect people in life-altering ways, such as denying them a mortgage or affecting which university they can attend.

如果您正在开发一种算法,该算法会降低大量条目的结果(更不用说40%的条目了),那么该是重新评估算法的时候了,尤其是当结果以改变生活的方式影响人们时,例如拒绝他们抵押或影响他们可以参加的大学。

Again, from The Guardian:

再次,从卫报

“Ofqual instead chose to focus on its own measure of accuracy — whether it was right ‘within a grade’. . . . But as any A-level student will tell you, accuracy ‘within a grade’ is meaningless. Ofqual may mark itself highly if it gives an A student a B, but for that student, the difference is life-changing.”

“ Ofqual而是选择专注于自己的准确性衡量标准-是否“在等级内”是正确的。 。 。 。 但是,正如任何A级学生都会告诉您的那样,“在年级内”的准确性是没有意义的。 如果给A学生一个B,Ofqual可能会给予很高的评价,但是对于那个学生来说,差异是改变人生的。”

Keep your eyes open for shifts in data patterns, and understand what constitutes a significant change for the data science use case you’re working on.

睁大眼睛注意数据模式的变化,并了解什么构成了您正在研究的数据科学用例的重大变化。

2.如果结果似乎有偏差,请对算法进行三遍检查。 (2. If the results seem biased, triple-check your algorithm.)

It’s one thing to produce unexpected results. It’s another thing to produce unexpected results that favor the wealthy and disadvantage everyone else. There’s a growing concern among data scientists and the public about the effects of bias in data science algorithms. If results are not only unexpected but clearly biased against an economic or racial cohort, the algorithm should be re-examined and corrected.

产生意想不到的结果是一回事。 产生意想不到的结果,有利于其他所有人的富人和弱势,这是另一回事。 数据科学家和公众对数据科学算法中偏差的影响越来越关注。 如果结果不仅出乎意料,而且明显不利于经济或种族,则应重新检查和纠正该算法。

3.只要有可能,请寻求专家的帮助。 (3. Whenever possible, get help from experts.)

In April 2020, the Royal Statistical Society (RSS), a charity that promotes statistics for the common good, offered Ofqual the assistance of two of its fellows: Guy Nason, professor of statistics at Imperial College London, and Paula Williamson, professor of medical statistics at the University of Liverpool. But Ofqual would accept their assistance only if they agreed to sign a five-year non-disclosure agreement. The professors understandably refused, so Ofqual ended up applying its scoring algorithm without their guidance.

2020年4月,为促进公共利益而促进统计的慈善机构皇家统计学会 (RSS)向Ofqual 提供了两个研究员的协助 :伦敦帝国理工学院统计学教授Guy Nason和医学教授Paula Williamson利物浦大学的统计数据。 但是,只有当他们同意签署为期五年的保密协议时,Ofqual才会接受他们的帮助。 教授们拒绝了,这是可以理解的,因此Ofqual最终在他们的指导下应用了其评分算法。

Many projects can benefit from a fresh perspective and outside expertise. If you can get second or third opinions, do so.

许多项目可以从崭新的视角和外部专业知识中受益。 如果您可以获得第二或第三意见,请这样做。

4.要透明。 (4. Be transparent.)

It’s troubling that a government agency would try to keep its grading algorithm secret — especially when that algorithm determines which students will end up attending which universities. One can’t help but wonder if Ofqual realized that its algorithm was biased and wished to conceal the details.

令人不安的是,政府机构将试图保密其评分算法,尤其是当该算法确定哪些学生最终将进入哪所大学时。 人们不禁要问,Ofqual是否意识到其算法有偏见,并希望隐瞒细节。

If data scientists want the public to trust the results of their algorithm, then it’s best to be open about how that algorithm works.

如果数据科学家希望公众信任其算法的结果,那么最好对算法的工作方式持开放态度。

As the leadership of the RSS wrote in a letter to the Office of Statistics Regulation on August 14, 2020:

正如RSS的领导在2020年8月14日给统计局的信中写道:

“One issue underpinning trustworthiness of statistics is their quality and accuracy, which is why we have summarised some of our technical concerns. But another element in trustworthiness is the transparency with which the statistics have been set out and considered, and the extent to which they meet public need.”

“统计数据可信赖性的一个问题是统计数据的质量和准确性,这就是我们总结一些技术问题的原因。 但是可信性的另一个要素是,统计数据的制定和考虑的透明度以及满足公众需求的程度。”

Transparency matters. People need to be able understand how criteria are evaluated and decisions are made. Critically, transparent discussions of algorithms should take place before analytical results are shared with the public. Transparency should help guide decision-making, not excuse it.

透明度很重要。 人们需要能够理解如何评估标准和制定决策。 至关重要的是,在与公众分享分析结果之前,应该对算法进行透明的讨论。 透明应该帮助指导决策,而不是原谅。

透明,公平和道德的重要性 (The Importance of Transparency, Fairness, and Ethics)

Ultimately, data science involves more than statistics. It also requires ethics, an open mind, and a clear understanding of the results that algorithms can have on people’s lives.

最终,数据科学不仅涉及统计。 它还需要道德,开放的心态以及对算法可能对人们的生活产生的影响的清晰理解。

Let’s close with these words from the RSS:

让我们从RSS中的这些词结束:

The use of statistics for public good is based only partly on technical statistical issues. Some statistics are technically bad, wrong or worse than others because of the way that data are gathered, or the statistical modelling that takes place. But in many cases, statistics or statistical models are inadequate for the weight being put on them in decision-making, or embed various other judgements that need to be clear. . . . So while we continue to have concerns about various technical decisions made by the qualification regulators, we also believe that having an more open discussion about this well before individual results were announced would have resulted in more trust in, and more trustworthy, statistical choices, in part because there would have been greater understanding of the underlying principles being applied and more detailed justifications of them.”

为公共利益使用统计信息仅部分基于技术统计问题。 由于收集数据的方式或进行的统计建模,某些统计在技术上比其他统计差,错或差。 但是在许多情况下,统计数据或统计模型不足以在决策中施加重担,或者嵌入各种其他需要明确的判断。 。 。 。 因此,尽管我们继续对资格认证监管机构做出的各种技术决策表示担忧,但我们也相信,在宣布单个结果之前就此问题进行更加公开的讨论,将会使人们更加信任,更值得信赖的统计选择。部分是因为人们将对所应用的基本原理有更深入的了解,并对其有更详细的论据。”

They point out that fairness is more than a matter of statistics:

他们指出,公平不仅仅是统计问题:

“‘Fairness’ is not of course a statistical concept. Different and reasonable people will have different judgements about what is ‘fair’, both in general and about this particular issue. . . . But a statistical procedure should be capable of being judged as ‘fair’ or ‘reasonable’ in advance of its being used or knowing which individuals may be affected.”

“公平”当然不是一个统计概念。 总体而言,对于这个“特殊”问题,不同的,合理的人会有不同的判断。 。 。 。 但是统计程序应该能够在被使用或知道哪些人可能受到影响之前被判断为“公平”或“合理”。

The importance of attention to detail, of openness to expert opinion, of transparency, and of a keen sense of what’s fair and how data science results affect real people — these are the lessons that data scientists can take away from the UK’s A-levels debacle.

注重细节,保持专家意见的开放性,透明性以及对公平事物以及数据科学成果如何影响真实人的敏锐感知的重要性,这些都是数据科学家可以从英国A级考试崩溃中吸取的教训。

On this occasion, even the manner of testing itself proves to be educational.

在这种情况下,甚至测试方式本身也被证明具有教育意义。

翻译自: https://medium.com/data-culpa/four-lessons-for-data-scientists-from-the-uks-a-levels-algorithm-debacle-e0e7ea41bd59

英国脑科学领域

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/392544.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

MVC发布后项目存在于根目录中的子目录中时的css与js、图片路径问题

加载固定资源js与css <script src"Url.Content("~/Scripts/js/jquery.min.js")" type"text/javascript"></script> <link href"Url.Content("~/Content/css/shop.css")" rel"stylesheet" type&quo…

telegram 机器人_学习使用Python在Telegram中构建您的第一个机器人

telegram 机器人Imagine this, there is a message bot that will send you a random cute dog image whenever you want, sounds cool right? Let’s make one!想象一下&#xff0c;有一个消息机器人可以随时随地向您发送随机的可爱狗图像&#xff0c;听起来很酷吧&#xff1…

判断输入的字符串是否为回文_刷题之路(九)--判断数字是否回文

Palindrome Number问题简介&#xff1a;判断输入数字是否是回文,不是返回0,负数返回0举例:1:输入: 121输出: true2:输入: -121输出: false解释: 回文为121-&#xff0c;所以负数都不符合3:输入: 10输出: false解释: 倒序为01&#xff0c;不符合要求解法一&#xff1a;这道题比较…

python + selenium 搭建环境步骤

介绍在windows下&#xff0c;selenium python的安装以及配置。1、首先要下载必要的安装工具。 下载python&#xff0c;我安装的python3.0版本,根据你自己的需要安装下载setuptools下载pip(python的安装包管理工具) 配置系统的环境变量 python,需要配置2个环境变量C:\Users\AppD…

VirtualBox 虚拟机复制

本文简单讲两种情况下的复制方式 1 跨电脑复制 2 同一virtrul box下 虚拟机复制 ---------------------------------------------- 1 跨电脑复制 a虚拟机 是老的虚拟机 b虚拟机 是新的虚拟机 新虚拟机b 新建&#xff0c; 点击下一步会生成 相应的文件夹 找到老虚拟机a的 vdi 文…

javascript实用库_编写实用JavaScript的实用指南

javascript实用库by Nadeesha Cabral通过Nadeesha Cabral 编写实用JavaScript的实用指南 (A practical guide to writing more functional JavaScript) Functional programming is great. With the introduction of React, more and more JavaScript front-end code is being …

数据库数据过长避免_为什么要避免使用商业数据科学平台

数据库数据过长避免让我们从一个类比开始 (Lets start with an analogy) Stick with me, I promise it’s relevant.坚持下去&#xff0c;我保证这很重要。 If your selling vegetables in a grocery store your business value lies in your loyal customers and your positi…

mysql case快捷方法_MySQL case when使用方法实例解析

首先我们创建数据库表&#xff1a; CREATE TABLE t_demo (id int(32) NOT NULL,name varchar(255) DEFAULT NULL,age int(2) DEFAULT NULL,num int(3) DEFAULT NULL,PRIMARY KEY (id)) ENGINEInnoDB DEFAULT CHARSETutf8;插入数据&#xff1a;INSERT INTO t_demo VALUES (1, 张…

【~~~】POJ-1006

很简单的一道题目&#xff0c;但是引出了很多知识点。 这是一道中国剩余问题&#xff0c;先贴一下1006的代码。 #include "stdio.h" #define MAX 21252 int main() { int p , e , i , d , n 1 , days 0; while(1) { scanf("%d %d %d %d",&p,&e,&…

Java快速扫盲指南

文章转自&#xff1a;https://segmentfault.com/a/1190000004817465#articleHeader22 JDK&#xff0c;JRE和 JVM 的区别 JVM&#xff1a;java 虚拟机&#xff0c;负责将编译产生的字节码转换为特定机器代码&#xff0c;实现一次编译多处执行&#xff1b; JRE&#xff1a;java运…

xcode扩展_如何将Xcode插件转换为Xcode扩展名

xcode扩展by Khoa Pham通过Khoa Pham 如何将Xcode插件转换为Xcode扩展名 (How to convert your Xcode plugins to Xcode extensions) Xcode is an indispensable IDE for iOS and macOS developers. From the early days, the ability to build and install custom plugins ha…

leetcode 861. 翻转矩阵后的得分(贪心算法)

有一个二维矩阵 A 其中每个元素的值为 0 或 1 。 移动是指选择任一行或列&#xff0c;并转换该行或列中的每一个值&#xff1a;将所有 0 都更改为 1&#xff0c;将所有 1 都更改为 0。 在做出任意次数的移动后&#xff0c;将该矩阵的每一行都按照二进制数来解释&#xff0c;矩…

数据分析团队的价值_您的数据科学团队的价值

数据分析团队的价值This is the first article in a 2-part series!!这是分两部分的系列文章中的第一篇&#xff01; 组织数据科学 (Organisational Data Science) Few would argue against the importance of data in today’s highly competitive corporate world. The tech…

mysql 保留5位小数_小猿圈分享-MySQL保留几位小数的4种方法

今天小猿圈给大家分享的是MySQL使用中4种保留小数的方法&#xff0c;希望可以帮助到大家&#xff0c;让大家的工作更加方便。1 round(x,d)用于数据x的四舍五入, round(x) ,其实就是round(x,0),也就是默认d为0&#xff1b;这里有个值得注意的地方是&#xff0c;d可以是负数&…

leetcode 842. 将数组拆分成斐波那契序列(回溯算法)

给定一个数字字符串 S&#xff0c;比如 S “123456579”&#xff0c;我们可以将它分成斐波那契式的序列 [123, 456, 579]。 形式上&#xff0c;斐波那契式序列是一个非负整数列表 F&#xff0c;且满足&#xff1a; 0 < F[i] < 2^31 - 1&#xff0c;&#xff08;也就是…

博主简介

面向各层次&#xff08;从中学到博士&#xff09;提供GIS和Python GIS案例实验实习培训&#xff0c;以解决问题为导向&#xff0c;以项目实战为主线&#xff0c;以科学研究为思维&#xff0c;不讲概念&#xff0c;不局限理论&#xff0c;简单照做&#xff0c;即学即会。 研究背…

自定义Toast 很简单就可以达到一些对话框的效果 使用起来很方便

自定义一个layout布局 通过toast.setView 设置布局弹出一些警示框 等一些不会改变的提示框 很方便public class CustomToast {public static void showUSBToast(Context context) {//加载Toast布局 View toastRoot LayoutInflater.from(context).inflate(R.layout.toas…

微信小程序阻止冒泡点击_微信小程序bindtap事件与冒泡阻止详解

bindtap就是点击事件在.wxml文件绑定:cilck here在一个组件的属性上添加bindtap并赋予一个值(一个函数名)当点击该组件时, 会触发相应的函数执行在后台.js文件中定义tapMessage函数://index.jsPage({data: {mo: Hello World!!,userid : 1234,},// 定义函数tapMessage: function…

同情机器人_同情心如何帮助您建立更好的工作文化

同情机器人Empathy is one of those things that can help in any part of life whether it’s your family, friends, that special person and even also at work. Understanding what empathy is and how it effects people took me long time. I struggle with human inter…

数据库课程设计结论_结论

数据库课程设计结论When writing about learning or breaking into data science, I always advise building projects.在撰写有关学习或涉足数据科学的文章时&#xff0c;我总是建议构建项目。 It is the best way to learn as well as showcase your skills.这是学习和展示技…