bi数据分析师_BI工程师和数据分析师的5个格式塔原则

bi数据分析师

Image for post
Image by Author
图片作者

将美丽融入数据 (Putting the Beauty in Data)

Have you ever been ravished by Vizzes on Tableau Public that look like only magic could be in play to display so much data in such a pleasing way?

您是否曾经被Tableau Public上的Vizzes迷住了,看起来只有魔术才能发挥作用,以这种令人愉悦的方式显示如此多的数据?

Most of those Vizzes are a result of exploiting how humans perceive visual landscapes (parts of a whole). The principles in play here are called the Gestalt principles. Gestalt in German means “Pattern” or “Configuration”. The Gestalt principles aim to make the process of visual design aesthetically pleasing as well as the data easier to consume.

这些Vizzes中的大多数是利用人类如何看待视觉景观(整体的一部分)的结果。 这里发挥作用的原则称为格式塔原则。 格式塔在德国的意思是“样式”或“配置”。 格式塔原理旨在使视觉设计过程更加美观,并使数据更易于使用。

There are numerous Gestalt principles, but I will be demonstrating the 6 most common ones, using Tableau’s most frequently used ‘Sample Superstore’ dataset. Though these principles are being explained individually, often more than one Gestalt principle is used in a single visualisation. However, a single principle tends to dominate.

格式塔(Gestalt)原理有很多,但我将使用Tableau最常用的“样本超市”(Sample Superstore)数据集演示6种最常见的原理。 尽管对这些原理进行了单独说明,但在一次可视化中通常会使用多个格式塔原理。 但是,一个原则倾向于占主导地位。

1.邻近 (1. Proximity)

The principle of proximity states that objects that are placed close to each other are perceived as more related as compared to the ones that are placed far apart

接近原理指出,相较于相隔较远的物体,彼此靠近的物体被认为更相关

Image for post
Proximity Principle (Image by Author)
接近原则(作者提供)

In the above visualisation, profit realised by the store is presented by region and by month. Using the principle of proximity, we see profits of all the months in East region as related to each other. Similarly for Central, West and South regions.

在上面的可视化中,商店实现的利润按地区和月份显示。 使用接近原理,我们看到东部地区所有月份的利润彼此相关。 中部,西部和南部地区也是如此。

2.相似性 (2. Similarity)

The principle of similarity states that objects that are similar to each other are perceived as more related that the dissimilar ones

相似性原理指出,彼此相似的对象比不相似的对象更相关

Here, similarity means similarity by shape, colour, size, font, texture, etc.

在这里,相似性是指形状,颜色,大小,字体,纹理等之间的相似性。

Image for post
Similarity of colour and size (Image by Author)
颜色和尺寸的相似性(作者提供的图片)

In the above graph, which depicts the profit realised by the store each month from 2011 to 2013, and by product category, the concept of similarity plays out well. The circles belonging to a single colour pertain to a single region, which is the use of the Similarity Principle. The area of the circle depicts the relative magnitude of the profit in each group for each month.

上图描绘了商店从2011年到2013年每个月实现的利润,并按产品类别描述了相似性的概念。 属于单一颜色的圆属于一个区域,这是使用相似原理的。 圆圈区域表示每个月每个组的利润的相对大小。

3.关闭 (3. Closure)

The principle of closure states that the human mind tends to perceive a collection of multiple elements as a components of a whole

封闭原则指出,人类的思维倾向于将多个元素的集合视为整体的组成部分

Image for post
Closure Principle (Image by Author)
封闭原则(作者提供)

The above Viz shows the profit earned by region and by month. The arrangement of the components is in such a way that it easy to perceive four polygons (West and East as rectangles, South and Central as squares). Though each of the larger polygons are made of numerous individual blocks.

上面的Viz显示了按地区和按月获得的利润。 组件的排列方式很容易感知四个多边形(西和东为矩形,南和中为方形)。 虽然每个较大的多边形均由许多单独的块组成。

4.连续性 (4. Continuity)

The principle of continuity states that elements arrange on a line or curve are perceived to be more related as compared to those that are not on it

连续性原则指出,与不位于线上或曲线上的元素相比,排列在直线或曲线上的元素被认为更相关

Image for post
Image for post
Sense of Continuity (Image by Author)
连续感(作者提供)

The two pictures above demonstrate this principle. The data that is being plotted is the total profit realised by the store each month from 2011 to 2013. The graph on the left plots the data as disconnected dots. This seems to be difficult to understand. Has the profit grown or reduced month over month?

上面的两张图片展示了这一原理。 正在绘制的数据是商店从2011年到2013年每个月实现的总利润。左侧的图形将数据绘制为不连续的点。 这似乎很难理解。 利润是否逐月增加或减少?

The graph on the right makes it far more comprehensible. The same data is plotted as a line connecting the data points. The connectivity gives a better perspective of the month on month profit trend.

右图使它更容易理解。 相同的数据绘制为连​​接数据点的线。 连接性使您可以更好地了解月度利润趋势。

5.并行性 (5. Parallelism)

The principle of parallelism states that elements arranged parallel to each other are perceived to be more related as compared to those that aren’t parallel

并行性原则指出,与不平行的元素相比,彼此平行排列的元素被认为更相关

Image for post
Parallel and Non-Parallel Trends (Image by Author)
并行和非并行趋势(作者提供的图像)

This might seem like the easiest principle to demonstrate. The lines that are parallel to each other show a similar trend in profits while the non-parallel ones show dissimilar trends.

这似乎是最容易证明的原理。 彼此平行的线显示出相似的利润趋势,而非平行线则显示出不同趋势。

大! 但是我们如何使用这些原则? (Great! But how do we use these principles?)

It would be a miracle to fit all of them into one single Viz and managing to make it look appealing. Finding the right number of principles to use is best left to the artist (now that you are working on appealing to the visual senses, you’re an Artist!)

将所有这些安装到单个Viz中并设法使其具有吸引力将是一个奇迹。 找到合适数量的使用原则最好由艺术家决定(既然您正在努力吸引视觉感官,那么您就是艺术家!)

However, using the Gestalt principles each time you ideate your dashboards or vizzes might help in making them aesthetically enticing as well as easy to consume.

但是,每次构思仪表盘或Vizz时都使用格式塔原则,可能有助于使它们具有美观的吸引力并易于使用。

翻译自: https://medium.com/@analyticsfortheimpatient/5-gestalt-principles-for-bi-engineers-and-data-analysts-69033502a209

bi数据分析师

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/389599.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

BSOJ 2423 -- 【PA2014】Final Zarowki

Description 有n个房间和n盏灯,你需要在每个房间里放入一盏灯。每盏灯都有一定功率,每间房间都需要不少于一定功率的灯泡才可以完全照亮。 你可以去附近的商店换新灯泡,商店里所有正整数功率的灯泡都有售。但由于背包空间有限,你…

WPF绑定资源文件错误(error in binding resource string with a view in wpf)

报错:无法将“***Properties.Resources.***”StaticExtension 值解析为枚举、静态字段或静态属性 解决办法:尝试右键单击在Visual Studio解决方案资源管理器的资源文件,并选择属性选项,然后设置自定义工具属性 PublicResXFile cod…

因果推论第六章

因果推论 (Causal Inference) This is the sixth post on the series we work our way through “Causal Inference In Statistics” a nice Primer co-authored by Judea Pearl himself.这是本系列的第六篇文章,我们将通过Judea Pearl本人与他人合着的《引诱统计学…

如何优化网站加载时间

一、背景 我们要监测网站的加载情况,可以使用 window.performance 来简单的检测。 window.performance 是W3C性能小组引入的新的API,目前IE9以上的浏览器都支持。一个performance对象的完整结构如下图所示: memory字段代表JavaScript对内存的…

熊猫数据集_处理熊猫数据框中的列表值

熊猫数据集Have you ever dealt with a dataset that required you to work with list values? If so, you will understand how painful this can be. If you have not, you better prepare for it.您是否曾经处理过需要使用列表值的数据集? 如果是这样&#xff0…

旋转变换(一)旋转矩阵

1. 简介 计算机图形学中的应用非常广泛的变换是一种称为仿射变换的特殊变换,在仿射变换中的基本变换包括平移、旋转、缩放、剪切这几种。本文以及接下来的几篇文章重点介绍一下关于旋转的变换,包括二维旋转变换、三维旋转变换以及它的一些表达方式&#…

数据预处理 泰坦尼克号_了解泰坦尼克号数据集的数据预处理

数据预处理 泰坦尼克号什么是数据预处理? (What is Data Pre-Processing?) We know from my last blog that data preprocessing is a data mining technique that involves transforming raw data into an understandable format. Real-world data is often incom…

Pytorch中DNN入门思想及实现

DNN全连接层(线性层) 计算公式: y w * x b W和b是参与训练的参数 W的维度决定了隐含层输出的维度,一般称为隐单元个数(hidden size) b是偏差值(本文没考虑) 举例: 输…

IDEA去除mapper.xml文件中的sql语句的背景色

2019独角兽企业重金招聘Python工程师标准>>> IDEA版本 2017.3 mapper.xml文件中的sql语句,总是黄色一大片,看起来不舒服。 按如下设置进行设置即可 此时设置完还有点背景色 再进行一个设置 Ok,完美解决 转载于:https://my.oschina.net/u/3939…

vc6.0 绘制散点图_vc有关散点图的一切

vc6.0 绘制散点图Scatterplots are one of the most popular visualization techniques in the world. Its purposes are recognizing clusters and correlations in ‘pairs’ of variables. There are many variations of scatter plots. We will look at some of them.散点图…

Pytorch中RNN入门思想及实现

RNN循环神经网络 整体思想: 将整个序列划分成多个时间步,将每一个时间步的信息依次输入模型,同时将模型输出的结果传给下一个时间步,也就是说后面的结果受前面输入的影响。 RNN的实现公式: 个人思路: 首…

小扎不哭!FB又陷数据泄露风波,9000万用户受影响

对小扎来说,又是多灾多难的一个月。 继不久前Twitter曝出修补了一个可能造成数以百万计用户私密消息被共享给第三方开发人员的漏洞,连累Facebook股价跟着短线跳水之后,9月28日,Facebook又双叒叕曝出因安全漏洞遭到黑客攻击&#…

在衡量欧洲的政治意识形态时,调查规模的微小变化可能会很重要

(Related post: On a scale from 1 to 10, how much do the numbers used in survey scales really matter?)(相关文章: 从1到10的量表,调查量表中使用的数字到底有多重要? ) At Pew Research Center, survey questions about respondents’…

Pytorch中CNN入门思想及实现

CNN卷积神经网络 基础概念: 以卷积操作为基础的网络结构,每个卷积核可以看成一个特征提取器。 思想: 每次观察数据的一部分,如图,在整个矩阵中只观察黄色部分33的矩阵,将这【33】矩阵(点乘)权重得到特…

事件映射 消息映射_映射幻影收费站

事件映射 消息映射When I was a child, I had a voracious appetite for books. I was constantly visiting the library and picking new volumes to read, but one I always came back to was The Phantom Tollbooth, written by Norton Juster and illustrated by Jules Fei…

前端代码调试常用

转载于:https://www.cnblogs.com/tabCtrlShift/p/9076752.html

Pytorch中BN层入门思想及实现

批归一化层-BN层(Batch Normalization) 作用及影响: 直接作用:对输入BN层的张量进行数值归一化,使其成为均值为零,方差为一的张量。 带来影响: 1.使得网络更加稳定,结果不容易受到…

匿名内部类和匿名类_匿名schanonymous

匿名内部类和匿名类Everybody loves a fad. You can pinpoint someone’s generation better than carbon dating by asking them what their favorite toys and gadgets were as a kid. Tamagotchi and pogs? You were born around 1988, weren’t you? Coleco Electronic Q…

Pytorch框架中SGD&Adam优化器以及BP反向传播入门思想及实现

因为这章内容比较多,分开来叙述,前面先讲理论后面是讲代码。最重要的是代码部分,结合代码去理解思想。 SGD优化器 思想: 根据梯度,控制调整权重的幅度 公式: 权重(新) 权重(旧) - 学习率 梯度 Adam…

朱晔和你聊Spring系列S1E3:Spring咖啡罐里的豆子

标题中的咖啡罐指的是Spring容器,容器里装的当然就是被称作Bean的豆子。本文我们会以一个最基本的例子来熟悉Spring的容器管理和扩展点。阅读PDF版本 为什么要让容器来管理对象? 首先我们来聊聊这个问题,为什么我们要用Spring来管理对象&…