大数据技术 学习之旅_为什么聚焦是您数据科学之旅的关键

大数据技术 学习之旅

David Robinson, a data scientist, has said the following quotes:

数据科学家David Robinson曾说过以下话:

“When you’ve written the same code 3 times, write a function.”

“当您编写了3次相同的代码时,请编写一个函数。”

“When you’ve given the same in-person advice 3 times, write a blog post.”

“当您两次给出相同的面对面建议时,请写一篇博客文章。”

The first quote is something you should adopt soon, but the focus (literally) for this post is the second quote. I wrote an article recently sharing some tips from my data science journey. In this article, I want to share with you the overall theme that I have been giving advice on since that post, focus.

第一个引号是您应该很快采用的,但是本文的重点(从字面上看)是第二个引号。 我最近写了一篇文章, 分享了我在数据科学历程中的一些技巧 。 在本文中,我想与您分享自从发表这篇文章以来,我一直在提供建议的总体主题。

为什么重点很重要? (Why focus is important?)

Image for post
Photo by Nicolas Picard on Unsplash
Nicolas Picard在Unsplash上拍摄的照片

If you were to follow the strands on this spider web, you could end up in many different intersection points.

如果您要跟踪此蜘蛛网上的子线,则可能会遇到许多不同的交点。

Image for post
Photo by Deb Dowd on Unsplash
Deb Dowd在Unsplash上拍摄的照片

You could also take multiple paths to the same intersection point. But there is an optimal path. A shorter path. This is true of the data science field also. Just the number of subfields alone is vast. Even more so if you include the subject knowledge you need for projects if they are not in the same domain. If can quickly feel overwhelming…

您也可以采用多条路径到达相同的交点。 但是有一条最佳的道路。 更短的路径。 数据科学领域也是如此。 仅子域的数量是巨大的。 更重要的是,如果您包含的主题知识不属于同一领域,那么您就需要这些项目。 如果可以很快感到不知所措...

Image for post
Photo by Christian Erfurt on Unsplash
克里斯蒂安·爱尔福特在Unsplash上的照片

It took me 2.5 years to land my data science role. If you haven’t read the prior article, here is some quick background on my situation:

我花了2.5年的时间才能获得数据科学职位。 如果您还没有阅读上一篇文章,请快速了解我的情况:

  1. I am a husband and father to a toddler.

    我是一个小孩的丈夫和父亲。
  2. I was a high school teacher with an hour commute in each direction by car.

    我是一名高中老师,每个方向的通勤时间均为一个小时。
  3. I only had an hour or so a day dedicated to data science since my wife supported me in this career change.

    自从妻子支持我从事这项职业以来,我只有一个小时左右的时间致力于数据科学。

I didn’t focus at the beginning. I started with an overview foundation since I didn’t have much of a programming background. I would still recommend this if you have no background in math and/or coding. The problem came afterward when everything about the field was so fascinating I leaped at everything I could interact with. But it prevented me from mastering anything, leading me into that classic saying…

一开始我没有集中精力。 我从概述基础开始,因为我没有太多的编程背景。 如果您没有数学和/或编码的背景,我仍然会建议这样做。 随后,当有关该领域的所有事情都如此吸引人时,我就跳下了我可以与之互动的一切的问题。 但这阻止了我精通一切,使我陷入了那句经典的话……

“Jack of all trades, master of none.”

“万事通,无精打采。”

Eventually, I felt incredibly overwhelmed. From that, there was a time when I shut down and didn’t practice anything for a few weeks.

最终,我感到难以置信。 从那时起,有一段时间我关闭了并且几周没有练习任何东西。

Image for post
Photo by Ben Weber on Unsplash
本·韦伯在Unsplash上的照片

那么如何避免我的错误呢? (So how can you avoid my mistake?)

There are a couple of approaches you could take and I should have considered sooner:

您可以采取几种方法,我应该早点考虑:

  1. Focus on a particular branch of data science such as natural language processing or data visualization.

    专注于数据科学的特定分支,例如自然语言处理或数据可视化。
  2. Focus on a domain and sculpt your data science skills around projects in that domain.

    专注于某个领域,并围绕该领域的项目雕刻您的数据科学技能。

After I got some help to get out of my rut, I took the second approach. Leveraging my educational background, I focused on solving problems related to the education field from the perspective of a teacher. This led me to:

在获得帮助以摆脱困境后,我采取了第二种方法。 利用我的教育背景,我专注于从老师的角度解决与教育领域有关的问题。 这导致我:

  1. Influencing a hiring decision based on the academic needs of students.

    根据学生的学术需求影响招聘决定。
  2. Created an overview of my school’s performance in a concise report.

    在简明的报告中概述了我学校的表现。
  3. Using a Bayesian version of a T-test to determine if my review lesson improved the student’s understanding and by how much.

    使用贝叶斯T检验确定我的复习课是否提高了学生的理解力以及提高了多少。

  4. Analyzing state exam questions to guide curriculum decisions.

    分析州考试题以指导课程决策。

These projects I put on my LinkedIn profile. They got the attention of people I did not expect. It got the attention of the outside school consultant who ended up providing a lot of future help. It got the attention of a Facebook recruiter for a related data science/education position with a starting salary above $130,000. Discussing my experience with these projects got me past the first round of interviews easily.

这些项目我放在我的LinkedIn个人资料中。 他们引起了我意料之外的人们的注意。 引起了外部学校顾问的注意,他们最终提供了很多未来的帮助。 它吸引了一位Facebook招聘人员的注意,该招聘人员的相关数据科学/教育职位的起薪超过13万美元。 讨论我在这些项目中的经验使我轻松通过了第一轮采访。

My rate of getting interviews and getting further in the rounds soon improved since I became more focused. Again, given my situation, it wasn’t the fastest, but it was a vast improvement compared to my previous rate. Each interview improved how I presented myself. Until eventually…

自从我变得更加专注之后,我获得面试和进一步进步的速度很快就提高了。 同样,鉴于我的情况,它不是最快的,但是与我以前的速度相比,这是一个巨大的进步。 每次采访都改善了我的自我介绍。 直到最后……

Image for post
Photo by bruce mars on Unsplash
布鲁斯· 玛斯 ( Bruce mars)在Unsplash上拍摄的照片

I succeeded! I landed my dream role and broke into the data science field!

我成功了! 我找到了自己梦dream以求的角色,并闯入了数据科学领域!

At the time of writing this, it has been just shy of three months since this new career started and it has been incredible! The people I work with are amazing, I get constant feedback, my work is having an immediate and/or future impact, and I am getting praised for it (as a teacher you don’t get that often so it is important to me…and also I am a kid at heart).

在撰写本文时,距这个新职业生涯还不到三个月,这简直令人难以置信! 与我共事的人很棒,我得到不断的反馈,我的工作具有立竿见影和/或未来的影响,我为此而受到赞誉(作为老师,您很少得到这样的帮助,所以对我来说很重要……)而且我还是个内心的孩子)。

If you are still hunting for your career just know it isn’t impossible. You can do it! Just focus on what you want to do in this field as soon as possible. If you are still experimenting a bit that is ok. But I would recommend doing it quickly if possible. If you are a parent or have a similar situation to me do know it will take longer, but you will get there.

如果您仍在寻找自己的职业,那就知道那并非不可能。 你能行的! 请尽快专注于您要在该领域中要做的事情。 如果您仍在尝试,那还可以。 但是我建议尽可能快地这样做。 如果您是父母或与我有类似的情况,请知道这将花费更长的时间,但是您会到达那里。

When you do get there, you will reflect on your journey up to that point. You will review the good and bad of it all. Finally, you will turn toward the future of your new career, and be amped to get started!

当您到达那里时,您将反思到那时的旅程。 您将回顾所有优点和缺点。 最终,您将转向新职业的未来,并为入门做好准备!

Image for post
Attentie Attentie on Attentie Attentie在UnsplashUnsplash拍摄

Thanks for reading! If you found this post helpful and you haven’t checked out some of the tips from my journey, you can read about them below:

谢谢阅读! 如果您发现这篇文章很有帮助,但还没有从我的旅程中找到一些技巧,则可以在下面阅读有关它们的信息:

Also if you are entering the field with a math background and feel you need help organizing a learning plan, check out my recommendations in this article below:

另外,如果您以数学背景进入该领域,并且认为需要帮助组织学习计划,请在下面的本文中查看我的建议:

You can follow me here or connect with me on Linkedin and Twitter. Open to DM’s on Twitter.

您可以在这里关注我,也可以通过Linkedin和Twitter与我联系。 在Twitter上打开DM。

Until next time,

直到下一次,

John DeJesus

约翰·德耶稣

翻译自: https://towardsdatascience.com/why-focus-is-key-for-your-data-science-journey-b62715b2a1c

大数据技术 学习之旅

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/387897.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

无监督学习 k-means_无监督学习-第4部分

无监督学习 k-means有关深层学习的FAU讲义 (FAU LECTURE NOTES ON DEEP LEARNING) These are the lecture notes for FAU’s YouTube Lecture “Deep Learning”. This is a full transcript of the lecture video & matching slides. We hope, you enjoy this as much as …

vCenter 升级错误 VCSServiceManager 1603

近日,看到了VMware发布的vCenter 6.7 Update 1b的更新消息。其中有一条比较震撼。有误删所有VM的概率,这种BUG谁也承受不起。Removing a virtual machine folder from the inventory by using the vSphere Client might delete all virtual machinesIn t…

day28 socketserver

1. socketserver 多线程用的 例 import socket import timeclientsocket.socket() client.connect(("127.0.0.1",9000))while 1:cmdinput("请输入指令")client.send(cmd.encode("utf-8"))from_server_msgclient.recv(1024).decode("utf…

车牌识别思路

本文源自我之前花了2天时间做的一个简单的车牌识别系统。那个项目,时间太紧,样本也有限,达不到对方要求的95%识别率(主要对于车牌来说,D,0,O,I,1等等太相似了。然后,汉字…

深度学习算法原理_用于对象检测的深度学习算法的基本原理

深度学习算法原理You just got a new drone and you want it to be super smart! Maybe it should detect whether workers are properly wearing their helmets or how big the cracks on a factory rooftop are.您刚刚拥有一架新无人机,并希望它变得超级聪明&…

【python】numpy库linspace相同间隔采样 详解

linspace可以用来实现相同间隔的采样; numpy.linspace(start,stop,num50,endpointTrue,retstepFalse, dtypeNone) 返回num均匀分布的样本,在[start, stop]。 Parameters(参数): start : scalar(标量) The starting value of the sequence(序列的起始点)…

Spring整合JMS——基于ActiveMQ实现(一)

Spring整合JMS——基于ActiveMQ实现(一) 1.1 JMS简介 JMS的全称是Java Message Service,即Java消息服务。它主要用于在生产者和消费者之间进行消息传递,生产者负责产生消息,而消费者负责接收消息。把它应用到实际的…

CentOS7+CDH5.14.0安装全流程记录,图文详解全程实测-8CDH5安装和集群配置

Cloudera Manager Server和Agent都启动以后,就可以进行CDH5的安装配置了。 准备文件 从 http://archive.cloudera.com/cdh5/parcels/中下载CDH5.14.0的相关文件 把CDH5需要的安装文件放到主节点上,新建目录为/opt/cloudera/parcel-repo把我们之前下载的…

node.js安装部署测试

(一)安装配置: 1:从nodejs.org下载需要的版本 2:直接安装,默认设置 ,默认安装在c:\program files\nodejs下。 3:更改npm安装模块的默认目录 (默认目录在安装目录下的node…

社群系统ThinkSNS+ V2.2-V2.3升级教程

WARNING本升级指南仅适用于 2.2 版本升级至 2.3 版本,如果你并非 2.2 版本,请查看其他升级指南,Plus 程序不允许跨版本升级!#更新代码预计耗时: 2 小时这是你自我操作的步骤,确认将你的 2.2 版本代码升级到…

activemq部署安装

一、架构和技术介绍 1、简介 ActiveMQ 是Apache出品,最流行的,能力强劲的开源消息总线。完全支持JMS1.1和J2EE 1.4规范的 JMS Provider实现 2、activemq的特性 1. 多种语言和协议编写客户端。语言: Java, C, C, C#, Ruby, Perl, Python, PHP。应用协议: …

主串与模式串的匹配

主串与模式串的匹配 (1)BF算法: BF算法比较简单直观,其匹配原理是主串S.ch[i]和模式串T.ch[j]比较,若相等,则i和j分别指示串中的下一个位置,继续比较后续字符,若不相等,从…

什么是 DDoS 攻击?

欢迎访问网易云社区,了解更多网易技术产品运营经验。 全称Distributed Denial of Service,中文意思为“分布式拒绝服务”,就是利用大量合法的分布式服务器对目标发送请求,从而导致正常合法用户无法获得服务。通俗点讲就是利用网络…

nginx 并发过十万

一般来说nginx 配置文件中对优化比较有作用的为以下几项: worker_processes 8; nginx 进程数,建议按照cpu 数目来指定,一般为它的倍数。 worker_cpu_affinity 00000001 00000010 00000100 00001000 00010000 00100000 01000000 10000000; 为每…

神经网络使用情景

神经网络使用情景 人脸/图像识别语音搜索文本到语音(转录)垃圾邮件筛选(异常情况探测)欺诈探测推荐系统(客户关系管理、广告技术、避免用户流失)回归分析 为何选择Deeplearning4j? …

GitHub常用命令及使用

GitHub使用介绍 摘要: 常用命令: git init 新建一个空的仓库git status 查看状态git add . 添加文件git commit -m 注释 提交添加的文件并备注说明git remote add origin gitgithub.com:jinzhaogit/git.git 连接远程仓库git push -u origin master 将本地…

deeplearning4j

deeplearning4j 是基于java的深度学习库,当然,它有许多特点,但暂时还没学那么深入,所以就不做介绍了 需要学习dl4j,无从下手,就想着先看看官网的examples,于是,下载了examples程序&a…

推理编程_答案集编程的知识表示和推理

推理编程Read about the difference between declarative and imperative programming and learn from code examples (Answer Set Programming, Python and C).了解声明式和命令式编程之间的区别,并从代码示例(答案集编程,Python和C)中学习。 介绍 (In…

python安装包

由于Google、YouTube等大型公司的推广,Python编程语言越来越受欢迎,很多编程爱好者,也将Python做为了首先的编程语言。 今天我们就来讲一下,学习的第一步,安装Python IDLE编辑器,也它的调试和使用。 第一步…

104 权限 sudo 解压缩

主要内容:https://www.cnblogs.com/pyyu/articles/9355477.html 1 查看系统版本信息: #查看系统版本信息 cat /etc/redhat-release CentOS Linux release 7.4.1708 (Core) #查看内核版本号 uname -r 3.10.0-693.el7.x86_64 #查看系统多少位 uname -m x86_64 #查看内核所有信息…