fastai学习:01_intro Questionnaire

fastAI Questionnaire
感觉还挺多的,怪不得说每一课要额外8小时进行学习。

1.Do you need these for deep learning?
Lots of math T / F
Lots of data T / F
Lots of expensive computers T / F
A PhD T / F
F F F F

2.Name five areas where deep learning is now the best in the world.
自然语言学习(NLP),计算机视觉(CV),医学(Medicine),生物(Biology),图像生成(Image generation),推荐系统(Recommendation systems)

3.What was the name of the first device that was based on the principle of the artificial neuron?
The Mark I Perceptron

4.Based on the book of the same name, what are the requirements for parallel distributed processing (PDP)?
A set of processing units
A state of activation
An output function for each unit
A pattern of connectivity among units
A propagation rule for propagating patterns of activities through the network of connectivities
An activation rule for combining the inputs impinging on a unit with the current state of that unit to produce an output for the unit
A learning rule whereby patterns of connectivity are modified by experience
An environment within which the system must operate

5.What were the two theoretical misunderstandings that held back the field of neural networks?
In the 1980’s most models were built with a second layer of neurons, thus avoiding the problem that had been identified by Minsky and Papert (this was their “pattern of connectivity among units,” to use the framework above). And indeed, neural networks were widely used during the '80s and '90s for real, practical projects. However, again a misunderstanding of the theoretical issues held back the field. In theory, adding just one extra layer of neurons was enough to allow any mathematical function to be approximated with these neural networks, but in practice such networks were often too big and too slow to be useful.

6.What is a GPU?
Graphics Processing Unit (GPU): Also known as a graphics card. A special kind of processor in your computer that can handle thousands of single tasks at the same time, especially designed for displaying 3D environments on a computer for playing games. These same basic tasks are very similar to what neural networks do, such that GPUs can run neural networks hundreds of times faster than regular CPUs. All modern computers contain a GPU, but few contain the right kind of GPU necessary for deep learning.

7.Open a notebook and execute a cell containing: 1+1. What happens?
2

8.Follow through each cell of the stripped version of the notebook for this chapter. Before executing each cell, guess what will happen.
完成

9.Complete the Jupyter Notebook online appendix.
没找到哪里有appendix

10.Why is it hard to use a traditional computer program to recognize images in a photo?
传统意义上来说,我们一般写一个函数,从输出得到结果,这种方式无法应用于图像识别

11.What did Samuel mean by “weight assignment”?
Weights are just variables, and a weight assignment is a particular choice of values for those variables. The program’s inputs are values that it processes in order to produce its results—for instance, taking image pixels as inputs, and returning the classification “dog” as a result. The program’s weight assignments are other values that define how the program will operate.

12.What term do we normally use in deep learning for what Samuel called “weights”?
model parameters

13.Draw a picture that summarizes Samuel’s view of a machine learning model.

14.Why is it hard to understand why a deep learning model makes a particular prediction?
一般程序写好了之后,重复运行应该得到同样的结果,但是深度学习的结果可能存在不同。

15.What is the name of the theorem that shows that a neural network can solve any mathematical problem to any level of accuracy?
万能近似定理(the universal approximation theorem)

16.What do you need in order to train a model?
A model cannot be created without data.
A model can only learn to operate on the patterns seen in the input data used to train it.
This learning approach only creates predictions, not recommended actions.
It’s not enough to just have examples of input data; we need labels for that data too

17.How could a feedback loop impact the rollout of a predictive policing model?
A predictive policing model is created based on where arrests have been made in the past. In practice, this is not actually predicting crime, but rather predicting arrests, and is therefore partially simply reflecting biases in existing policing processes.
Law enforcement officers then might use that model to decide where to focus their police activity, resulting in increased arrests in those areas.
Data on these additional arrests would then be fed back in to retrain future versions of the model.

18.Do we always have to use 224×224-pixel images with the cat recognition model?
不是,提高分辨率可能会增加精度,但是会增加计算量

19.What is the difference between classification and regression?
Classification and regression have very specific meanings in machine learning. These are the two main types of model that we will be investigating in this book. A classification model is one which attempts to predict a class, or category. That is, it’s predicting from a number of discrete possibilities, such as “dog” or “cat.” A regression model is one which attempts to predict one or more numeric quantities, such as a temperature or a location.

20.What is a validation set? What is a test set? Why do we need them?
The validation set is the portion of the dataset that is not used for training the model, but for evaluating the model during training, in order to prevent overfitting. This ensures that the model performance is not due to “cheating” or memorization of the dataset, but rather because it learns the appropriate features to use for prediction. However, it is possible that we overfit the validation data as well. This is because the human modeler is also part of the training process, adjusting hyperparameters and training procedures according to the validation performance. Therefore, another unseen portion of the dataset, the test set, is used for final evaluation of the model. This splitting of the dataset is necessary to ensure that the model generalizes to unseen data.

21.What will fastai do if you don’t provide a validation set?
fastai会自动创建,默认为20%

22.Can we always use a random sample for a validation set? Why or why not?
不能,对于有些类型的数据,随机选择可能并不好,比如有时间相关性的数据,也许分段会更好。

23.What is overfitting? Provide an example.
过拟合,很好地拟合了已有数据,但是缺乏一般性,比如用高阶函数拟合二次函数

24.What is a metric? How does it differ from “loss”?
A metric is a function that measures the quality of the model’s predictions using the validation set, and will be printed at the end of each epoch.
The concept of a metric may remind you of loss, but there is an important distinction. The entire purpose of loss is to define a “measure of performance” that the training system can use to update weights automatically. In other words, a good choice for loss is a choice that is easy for stochastic gradient descent to use. But a metric is defined for human consumption, so a good metric is one that is easy for you to understand, and that hews as closely as possible to what you want the model to do. At times, you might decide that the loss function is a suitable metric, but that is not necessarily the case.

25.How can pretrained models help?
A model that has weights that have already been trained on some other dataset is called a pretrained model. You should nearly always use a pretrained model, because it means that your model, before you’ve even shown it any of your data, is already very capable. And, as you’ll see, in a deep learning model many of these capabilities are things you’ll need, almost regardless of the details of your project. For instance, parts of pretrained models will handle edge, gradient, and color detection, which are needed for many tasks.
预训练模型可以较少计算量,提高效率,从一个较高的起点开始训练

26.What is the “head” of a model?
The head of a model is the part that is newly added to be specific to the new dataset. An epoch is one complete pass through the dataset. After calling fit, the results after each epoch are printed, showing the epoch number, the training and validation set losses (the “measure of performance” used for training the model), and any metrics you’ve requested (error rate, in this case).

27.What kinds of features do the early layers of a CNN find? How about the later layers?
early layers:represent diagonal, horizontal, and vertical edges, as well as various different gradients.
later layers:the features are now able to identify and match with higher-level semantic components, such as car wheels, text, and flower petals.

28.Are image models only useful for photos?
No.An image recognizer can, as its name suggests, only recognize images. But a lot of things can be represented as images, which means that an image recogniser can learn to complete many tasks.

29.What is an “architecture”?
The template of the model that we’re trying to fit; the actual mathematical function that we’re passing the input data and parameters to

30.What is segmentation?
Creating a model that can recognize the content of every individual pixel in an image is called segmentation.

31.What is y_range used for? When do we need it?
用于预测连续型的

32.What are “hyperparameters”?
In realistic scenarios we rarely build a model just by training its weight parameters once. Instead, we are likely to explore many versions of a model through various modeling choices regarding network architecture, learning rates, data augmentation strategies, and other factors we will discuss in upcoming chapters. Many of these choices can be described as choices of hyperparameters. The word reflects that they are parameters about parameters, since they are the higher-level choices that govern the meaning of the weight parameters.

33.What’s the best way to avoid failures when using AI in an organization
Make sure a training, validation, and testing set is defined properly in order to evaluate the model in an appropriate manner.
Try out a simple baseline, which future models should hopefully beat. Or even this simple baseline may be enough in some cases.

反复看了几遍,又看了看书,感觉讲得真的很容易听懂,尤其是讲发展历史讲得很好,问题的设置感觉也很合理,和那种动不动就证明求导过程的舒服多了。
最后一题找了半天没找到,最后论坛搜索抄了一下。

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/550498.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

fastai学习——第二个问题

第二节课需要使用bing image search api获取bing图片搜索中的熊图片,此时发现获取api需要注册azure,卡在绑定卡上很久,想了想还要去弄一张带visa的卡,还是算了,就用猫狗大战数据集实验吧,按照与学习视频中类…

fastai学习:02_production Questionnaire

1.Where do text models currently have a major deficiency? Deep learning is currently not good at generating correct responses! We don’t currently have a reliable way to, for instance, combine a knowledge base of medical information with a deep learning m…

fastai学习:04_mnist_basics Questionnaire

1.How is a grayscale image represented on a computer? How about a color image? 灰度图:单通道,0-256 彩色图:三通道RGB或HSV,0-256 2.How are the files and folders in the MNIST_SAMPLE dataset structured? Why? 分为…

fastai学习:05_pet_breeds Questionnaire

1.Why do we first resize to a large size on the CPU, and then to a smaller size on the GPU? 首先,在训练模型时,我们希望能够将图片的尺寸统一,整理为张量,传入GPU,我们还希望最大限度地减少执行不同增强计算的…

fastai学习:06_multicat Questionnarie

1.How could multi-label classification improve the usability of the bear classifier? 可以对不存在的熊进行分类 2.How do we encode the dependent variable in a multi-label classification problem? One-hot encoding: Using a vector of zeros, with a one in each…

【论文阅读笔记】Detecting Camouflaged Object in Frequency Domain

1.论文介绍 Detecting Camouflaged Object in Frequency Domain 基于频域的视频目标检测 2022年发表于CVPR [Paper] [Code] 2.摘要 隐藏目标检测(COD)旨在识别完美嵌入其环境中的目标,在医学,艺术和农业等领域有各种下游应用。…

ubuntu中使用firefox浏览器播放bilibili的h5网页视频

安装好系统后,直接firefox打开bilibili显示没有flash插件 找了一圈没有发现自动播放h5的选项 搜索了一下发现可能是需要解码器 sudo apt-get install ubuntu-restricted-extras就能看了

ubuntu挂起唤醒后十几秒钟就自动熄屏一次

昨天晚上笔记本没关机,ubuntu挂起一晚上,今天早上打开电脑,发现每过十几秒钟就自动熄屏一次,重启之后好了,不知道什么原因 搜索了一下说可能是DPMS的问题,用xset -dpms可以关闭电源管理选项 但是本来的设置…

python3 上传文件到目标机器_Python3 +服务器搭建私人云盘,再也不怕限速了

先来看看效果电脑访问手机访问Windows版本搭建(1).首先你需要在你的电脑上或者服务器上安装Python3.X。(2).然后通过如下指令来安装updog库,网上有很多关于updog的介绍,我这里就不详细说pip3 install updog(3).静静的等他安装完成,然后执行以…

Ubuntu下绘图软件krita64位无中文问题

ubuntu20 sudo apt install krita-l10n 就有了 参考:https://bbs.deepin.org/post/181669

tableau度量值计算_Tableau图表界面组成介绍

声明:内容来源拉勾教育数据分析训练营课程视频1 Tableau工作表基本界面基础概念:维度、度量、聚合、粒度。维度: 维度包含定量值(例如名称、日期或地理数据),可以使用维度进行分类、分段以及揭示数据中的详细信息。维度影响视图中的详细级别。…

小强升职记思维导图_你学会用 “思维导图” 学英语了吗?

今天我们来讲讲目前比较火爆的“思维导图学习法”。思维导图又叫“MIND MAP”,是英国人托尼博赞发明的一种思维工具。托尼博赞本人在心理学、语言学、数学以及科学方向均获得过学位,而且他还创造了世界脑力奥林匹克运动。虽然大师已逝,但是这…

ubuntu下创建软件图标和直接点文件打开

ubuntu中有一些从github上下载的软件或者是appimage软件,能够使用,但是不在应用程序中显示,也不能直接点文件来打开程序 以cajviewer为例子,下下来是CAJViewer-x86_64-buildubuntu1604-210401.AppImage 打开目录/usr/share/appli…

hive币涨幅空间大吗_自动消防水炮只能安装在大空间场所吗

在大家不了解或者不清楚自动消防水炮的时候,经过一些厂家解释或者了解产品后,都知道是一种能够自动跟踪定位火焰并在短时内灭火的喷水系统,而且适用于安装在一些高大空间场所中,那么这是不是意味着,只能在大空间场所安…

可以直接考甲级吗_函授本科可以考四级吗

函授本科是可以考英语四级的。但必须经过学生所在学校同意,才可以在本校报名参加考试。函授本科可以考四级吗目前来说不管是函授大专还是本科,是可以考英语四级的,但应经所报考的学校同意,可在成人高考报考学校报名参加考试。函授本科用处大不大?函授本…

duration转为时间戳_Flink Table APIamp;SQL编程指南之时间属性(3)

Flink总共有三种时间语义:Processing time(处理时间)、Event time(事件时间)以及Ingestion time(摄入时间)。关于这些时间语义的具体解释,可以参考另一篇文章Flink的时间与watermarks详解。本文主要讲解Flink Table API & SQL中基于时间的算子如何定…

旅游系统_旅游标识系统,必须真的“旅游化”

标识是为游客传递路线,指明景点位置、起安全警示作用以及传达公园发展理念的标识(牌)或标识物,是公园的重要组成部分,有助于旅游者顺利完成游览过程,获得满意的旅游体验。好的完善的标识系统,可以起到画龙点睛的作用&a…

如何在linux下启动和关闭oracle服务

1.前言 确保我们能够访问oracle数据库包含两部分,一个是oracle实例,一个是监听,两个同时开启,我们才能正常的使用数据库,因此我们在关闭和启动oracle服务时,也需要同时操作实例和监听。能够操作linux的工具…

exfat为什么不适合机械硬盘_为什么有人说小排量车不适合跑高速,多少排量的车适合?...

阅读本文前,请您先点击上面的蓝色字体“梅赛德斯丶Benz”,再点击“关注”,这样您就可以继续免费收到祝福了。每天都有分享,完全是免费订阅,请放心关注。 哈喽,小伙伴们关注“梅塞德斯丶Benz”每…

调用第三方接口的几种请求方式

第一种方式: String url4"https://www.showmebug.com/open_api/v1/interviews"; jsonnew JSONObject(); json.put("candidate_name", "张三");//传递的参数 MediaType mediaType MediaType.parse("application/json;charsetut…