matlab界area_Matlab的数据科学界

matlab界area

意见 (Opinion)

My personal interest in Data Science spans back to 2011. I was learning more about Economies and wanted to experiment with some of the ‘classic’ theories and whilst many of them held ground, at a micro level, many were also purely fictitious. Many of the themes that you learn: on how savings and Investment are directly related, or even on how Supply and Demand are directly related to price just didn’t hold true.

我个人对数据科学的兴趣可以追溯到2011年。我正在学习更多关于经济的知识,并想尝试一些“经典”理论,尽管其中许多在微观上站稳了脚步,但许多理论也完全是虚构的。 您所学习的许多主题:关于储蓄和投资如何直接相关,甚至关于供求关系与价格直接相关的主题都不成立。

To be fully conclusive on my research though, I had to be sure that any conclusion I drew was purely unadulterated and spoke from the data. It’s all good and well that some notable academic from some notable University informs us about a theory, but is it actually true? Is it true everywhere?

但是,要完全确定我的研究结论,我必须确保我得出的任何结论都是纯正的,并且是从数据中得出的结论。 某著名大学的一些知名学者向我们介绍了一种理论,这一切都很好,但实际上是真的吗? 到处都是吗?

Collecting data wasn’t too hard but from there I had to teach myself programming. Python was free, easy to use, and the ‘new thing’ that people said to learn to ‘future proof’ my knowledge. However, after learning it and persuading a company to let me join their Graduate Scheme, I began to use Matlab at work.

收集数据并不难,但是我必须从那里自学编程。 Python是免费的,易于使用的,人们说要学会“证明未来”的“新事物”是我的知识。 但是,在学习并说服一家公司让我加入他们的研究生计划后,我开始在工作中使用Matlab

From here, I’ve always had this conflict where Matlab feels like a better language to work in but Python has always been my, you could say, ‘mother tongue’.

从这里开始,我总是遇到这样的冲突, Matlab感觉像是一种更好的语言,但是Python一直是我的母语(可以说)。

The following article will highlight why after almost 10 years of experience in both languages, I feel that Python absolutely dominates Matlab in Data Science and also, why new Data Scientists should focus on solely on Python.

下一篇文章将重点介绍为什么在经过这两种语言近10年的经验之后,我感到Python绝对在数据科学领域主导了Matlab ,而且,为什么新的数据科学家应该只专注于Python

Python and Matlab are similar and different at the same time. Matlab was created as a private enterprise and as a closed form platform solution with a high price tag. On the other side, Python was created with ‘openness’ in mind to be easy and simple to use for all general tasks.

PythonMatlab相似又不同。 Matlab是作为一家私营企业和具有高价的封闭式平台解决方案而创建的。 另一方面,在创建Python时考虑到了“开放性”,以使其易于使用和简单地用于所有常规任务。

Matlab got a head start in the popularity contest as it was released in 1984 and despite me not being around back then, the various permutations and iterations of the language lent themselves well to the discipline of Mathematics. This is because vectors and multi-dimensional matrices are super simple to use in Matlab — a feature which only came in later at the time of Numpy (which was still kind of annoying) but Pandas has made using Python infinitely easier. Given that change, does my thesis hold true?

Matlab在1984年发布的人气竞赛中处于领先地位,尽管那时我还不在身边,但该语言的各种排列和迭代方式非常适合数学学科。 这是因为向量和多维矩阵在Matlab使用非常简单-该功能仅在Numpy出现时才出现(这仍然很烦人),但是Pandas Numpy简化了使用Python 。 有了这种改变,我的论文是否成立?

熊猫是Python开始超越Matlab的原因吗? (Was Pandas the reason why Python began to overtake Matlab?)

So we know that Pandas was first an internal library at AQR Capital and written my Wes McKinney and looking at Trends on Stack Overflow, we can see that from 2012 onwards, the percentage of questions with a Tag of Pandas began to increase where from 2015, there was a sharpe inflection point.

因此,我们知道Pandas首先是AQR Capital的内部图书馆,并写了我的Wes McKinney并研究了Stack Overflow的趋势,我们可以看到,从2012年开始,带有Pandas标签的问题所占的百分比开始从2015年开始增加,有一个尖锐的拐点。

Image for post

Now notice here that since 2013/2014, the number of questions for Python as a langauge began to increase as well. Makes sense right? Pandas is a subset of Python, so naturally, the two are related, however …

现在请注意,自2013/2014年以来,作为语言的Python的问题数量也开始增加。 有道理吧? 熊猫是Python的子集,所以自然地,两者是相关的,但是……

Image for post
source]源 ]

…since 2015, Matlab has been on a downward spiral. The proportion of questions which have a tag of Matlab has seriously been going down and why is that?

…自2015年以来,Matlab呈螺旋式下降。 带有Matlab标签的问题所占的比例已严重下降,这是为什么呢?

Image for post
source]源 ]

Matlab昂贵且缺乏灵活性:学生负担不起 (Matlab is Expensive and Inflexible: students cannot afford it)

Being a closed platform where every new library requires you to pay for the inbuilt functions means that things can become very expensive very quickly.

作为一个封闭的平台,每个新库都需要您为内置功能付费,这意味着事情会很快变得非常昂贵。

It’s pretty crazy but if you do practise Machine Learning in Matlab (as I have before) — you either have to shell out and buy a number of libraries (which have dependencies on each other), or you have to build everything for yourself from scratch.

这非常疯狂,但是如果您确实在Matlab中练习机器学习(就像我以前一样)–您要么必须掏空并购买大量的库(彼此依赖),要么必须从头开始为自己构建一切。

This isn’t a necessarily a problem for those in industry with a big budget but if you’re playing with the latest deep neural network, or looking to create GPT-4, you’ll need to include several libraries and pay through the nose whereas in Python, you can use Tensorflow and Numpy for free.

对于预算巨大的行业人士来说,这并不一定是问题,但是如果您正在使用最新的深度神经网络,或者正在寻求创建GPT-4,则需要包括多个库并从头开始而在Python ,您可以免费使用TensorflowNumpy

Also assuming you want to put into production any code that you’ve built, be prepared to build some form of a Matlab Wrapper or be prepared to code the final version in a different language. And yes, I’ve actually had to do that.

同样,假设您想将已构建的任何代码投入生产,请准备构建某种形式的Matlab包装器,或准备使用其他语言编写最终版本。 是的,我实际上必须这样做。

Matlab and Java do work together but why bother learning two languages and building some weird symbiotic system when you can simply just learn Python and have an entire-tech stack in one language? It’s so much easier, so much more flexible and so much more cost effective.

Matlab和Java确实可以一起工作,但是当您只需要学习Python并用一种语言拥有整个技术栈时,为什么还要花时间学习两种语言并构建一些奇怪的共生系统呢? 它非常容易,灵活得多并且更具成本效益。

Image for post
Photo by Tim Mossholder on Unsplash
Tim Mossholder在Unsplash上拍摄的照片

Matlab几乎没有在线支持 (Matlab has little online support)

Also if the cost isn’t enough to deter you, the amount of Online support in Matlab land is far lower than you expect. Matlab does deserve some respect in that it’s Help functionality is better than most languages and usually solves most of the problems that you face but if you require some additional help on the internet, Python has roughly 100x more questions on Stack Overflow than Matlab does. That’s a lot less people to help you out, and yes, I’ve been stuck with a question that no one could solve before.

同样,如果费用不足以阻止您,则Matlab领域的在线支持量远远低于您的预期。 Matlab确实值得尊重,因为它的“ Help功能比大多数语言要好,并且通常可以解决您面临的大多数问题,但是如果您需要互联网上的一些其他帮助,Python在堆栈溢出方面的问题比Matlab大约多100倍。 帮助您的人减少了很多,是的,我一直被困在一个没人能解决的问题上。

There’s no free lunch but for all the reasons above, the use case for Matlab in Statistics and Machine Learning has been far reduced since Python got good with Pandas. I could go on all day about how much more useful Python is than Matlab but I’ll save you all the ear-ache.

没有免费的午餐,但是由于上述所有原因,自从PythonPandas相处以来,统计和机器学习中Matlab的用例已大大减少。 我可能整天都在谈论PythonMatlab有用的多,但我会省掉所有的麻烦。

But at the same time, Python is free. It’s verbose and it has succeeded in many places where Matlab fell short. I thoroughly encourage all future Data Scientists or Statisticians to please let Python be your mother tongue.

但同时,Python是免费的。 它很冗长,并且在Matlab欠缺的许多地方都取得了成功。 我完全鼓励所有未来的数据科学家或统计学家让Python成为您的母语。

It’ll be the best decision you make.

这将是您做出的最佳决定。

Thanks for reading! If you have any messages, please let me know!

谢谢阅读! 如果您有任何留言,请告诉我!

Keep up to date with my latest articles here!

在这里了解我的最新文章!

翻译自: https://medium.com/swlh/the-demise-of-matlab-in-data-science-bfb74e42dc8e

matlab界area

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/391639.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

hdf5文件和csv的区别_使用HDF5文件并创建CSV文件

hdf5文件和csv的区别In my last article, I discussed the steps to download NASA data from GES DISC. The data files downloaded are in the HDF5 format. HDF5 is a file format, a technology, that enables the management of very large data collections. Thus, it is…

机器学习常用模型:决策树_fairmodels:让我们与有偏见的机器学习模型作斗争

机器学习常用模型:决策树TL; DR (TL;DR) The R Package fairmodels facilitates bias detection through model visualizations. It implements a few mitigation strategies that could reduce bias. It enables easy to use checks for fairness metrics and comparison betw…

高德地图如何将比例尺放大到10米?

2019独角兽企业重金招聘Python工程师标准>>> var map new AMap.Map(container, {resizeEnable: true,expandZoomRange:true,zoom:20,zooms:[3,20],center: [116.397428, 39.90923] }); alert(map.getZoom());http://lbs.amap.com/faq/web/javascript-api/expand-zo…

Android 手把手带你玩转自己定义相机

本文已授权微信公众号《鸿洋》原创首发,转载请务必注明出处。概述 相机差点儿是每一个APP都要用到的功能,万一老板让你定制相机方不方?反正我是有点方。关于相机的两天奋斗总结免费送给你。Intent intent new Intent(); intent.setAction(M…

100米队伍,从队伍后到前_我们的队伍

100米队伍,从队伍后到前The last twelve months have brought us a presidential impeachment trial, the coronavirus pandemic, sweeping racial justice protests triggered by the death of George Floyd, and a critical presidential election. News coverage of these e…

idea使用 git 撤销commit

2019独角兽企业重金招聘Python工程师标准>>> 填写commit的id 就可以取消这一次的commit 转载于:https://my.oschina.net/u/3559695/blog/1596669

mongodb数据可视化_使用MongoDB实时可视化开放数据

mongodb数据可视化Using Python to connect to Taiwan Government PM2.5 open data API, and schedule to update data in real time to MongoDB — Part 2使用Python连接到台湾政府PM2.5开放数据API,并计划将数据实时更新到MongoDB —第2部分 目标 (Goal) This ti…

4.kafka的安装部署

为了安装过程对一些参数的理解,我先在这里提一下kafka一些重点概念,topic,broker,producer,consumer,message,partition,依赖于zookeeper, kafka是一种消息队列,他的服务端是由若干个broker组成的,broker会向zookeeper,producer生成者对应一个…

ecshop 前台个人中心修改侧边栏 和 侧边栏显示不全 或 导航现实不全

怎么给个人中心侧边栏加项或者减项 在模板文件default/user_menu.lbi 文件里添加或者修改,一般看到页面都会知道怎么加,怎么删,这里就不啰嗦了 添加一个栏目以后,这个地址跳的页面怎么写 这是最基本的一个包括左侧个人信息,头部导航栏 <!DOCTYPE html PUBLIC "-//W3C//…

面向对象编程思想-观察者模式

一、引言 相信猿友都大大小小经历过一些面试&#xff0c;其中有道经典题目&#xff0c;场景是猫咪叫了一声&#xff0c;老鼠跑了&#xff0c;主人被惊醒&#xff08;设计有扩展性的可加分&#xff09;。对于初学者来说&#xff0c;可能一脸懵逼&#xff0c;这啥跟啥啊是&#x…

Python:在Pandas数据框中查找缺失值

How to find Missing values in a data frame using Python/Pandas如何使用Python / Pandas查找数据框中的缺失值 介绍&#xff1a; (Introduction:) When you start working on any data science project the data you are provided is never clean. One of the most common …

监督学习-回归分析

一、数学建模概述 监督学习&#xff1a;通过已有的训练样本进行训练得到一个最优模型&#xff0c;再利用这个模型将所有的输入映射为相应的输出。监督学习根据输出数据又分为回归问题&#xff08;regression&#xff09;和分类问题&#xff08;classfication&#xff09;&#…

微服务架构技能

2019独角兽企业重金招聘Python工程师标准>>> 微服务架构技能 博客分类&#xff1a; 架构 &#xff08;StuQ 微服务技能图谱&#xff09; 2课程简介 本课程分为基础篇和高级篇两部分&#xff0c;旨在通过完整的案例&#xff0c;呈现微服务的开发、测试、构建、部署、…

Tableau Desktop认证:为什么要关心以及如何通过

Woah, Tableau!哇&#xff0c;Tableau&#xff01; By now, almost everyone’s heard of the data visualization software that brought visual analytics to the public. Its intuitive drag and drop interface makes connecting to data, creating graphs, and sharing d…

约束布局constraint-layout导入失败的解决方案 - 转

今天有同事用到了约束布局&#xff0c;但是导入我的工程出现错误 **提示错误&#xff1a; Could not find com.Android.support.constraint:constraint-layout:1.0.0-alpha3** 我网上查了一下资料&#xff0c;都说是因为我的androidStudio版本是最新的稳定版导入这个包就会报这…

算法复习:冒泡排序

思想&#xff1a;对于一个列表,每个数都是一个"气泡 "&#xff0c;数字越大表示"越重 "&#xff0c;最重的气泡移动到列表最后一位&#xff0c;冒泡排序后的结果就是“气泡”按照它们的重量依次移动到列表中它们相应的位置。 算法&#xff1a;搜索整个列表…

前端基础进阶(七):函数与函数式编程

纵观JavaScript中所有必须需要掌握的重点知识中&#xff0c;函数是我们在初学的时候最容易忽视的一个知识点。在学习的过程中&#xff0c;可能会有很多人、很多文章告诉你面向对象很重要&#xff0c;原型很重要&#xff0c;可是却很少有人告诉你&#xff0c;面向对象中所有的重…

显示与删除使用工具

右击工具菜单栏中的空白处选择自定义 在弹出的自定义菜单中选择命令选项在选择想要往里面添加工具的菜单&#xff0c;之后在选择要添加的工具 若想要删除工具栏中的某个工具&#xff0c;在打开自定义菜单后&#xff0c;按住鼠标左键拖动要删除工具到空白处 例如 转载于:https:/…

js值的拷贝和值的引用_到达P值的底部:直观的解释

js值的拷贝和值的引用介绍 (Introduction) Welcome to this lesson on calculating p-values.欢迎参加有关计算p值的课程。 Before we jump into how to calculate a p-value, it’s important to think about what the p-value is really for.在我们开始计算p值之前&#xff…

监督学习-KNN最邻近分类算法

分类&#xff08;Classification&#xff09;指的是从数据中选出已经分好类的训练集&#xff0c;在该训练集上运用数据挖掘分类的技术建立分类模型&#xff0c;从而对没有分类的数据进行分类的分析方法。 分类问题的应用场景&#xff1a;用于将事物打上一个标签&#xff0c;通常…