1
00:00:02,840 --> 00:00:07,040
If you’ve been ignoring neural nets cuz you think they’re too hard to understand
2
00:00:07,040 --> 00:00:11,040
or you think you don’t need them…boy do I have a treat for you!
3
00:00:11,040 --> 00:00:16,080
In this video you’ll learn about neural nets without any of the math or code –
4
00:00:16,080 --> 00:00:19,200
just an intro to what they are and how they work.
5
00:00:19,200 --> 00:00:23,280
My hope is that you’ll get an idea for why they’re such an important tool.
6
00:00:23,280 --> 00:00:24,560
Let’s get started.
7
00:00:24,560 --> 00:00:25,920
8
00:00:25,920 --> 00:00:30,360
The first thing you need to know is that deep learning is about neural networks.
9
00:00:30,360 --> 00:00:34,360
The structure of a neural network is like any other kind of network;
10
00:00:34,360 --> 00:00:38,160
there is an interconnected web of nodes, which are called neurons,
11
00:00:38,160 --> 00:00:40,560
and the edges that join them together.
12
00:00:40,560 --> 00:00:44,500
A neural network's main function is to receive a set of inputs,
13
00:00:44,500 --> 00:00:47,580
perform progressively complex calculations,
14
00:00:47,580 --> 00:00:50,380
and then use the output to solve a problem.
15
00:00:50,380 --> 00:00:53,800
Neural networks are used for lots of different applications,
16
00:00:53,800 --> 00:00:57,140
but in this series we will focus on classification.
17
00:00:57,140 --> 00:01:01,520
If you wanna learn about neural nets in a bit more detail, including the math,
18
00:01:01,520 --> 00:01:08,360
my two favourite resources are Michael Nielsen's book, and Andrew Ng's class.
19
00:01:08,360 --> 00:01:15,100
Before we talk more about neural networks, I’m gonna give you a quick overview of the problem of classification.
20
00:01:15,100 --> 00:01:19,000
Classification is the process of categorizing a group of objects,
21
00:01:19,000 --> 00:01:23,160
while only using some basic data features that describe them.
22
00:01:23,160 --> 00:01:26,040
There are lots of classifiers available today -
23
00:01:26,040 --> 00:01:32,340
like Logistic Regression, Support Vector Machines, Naive Bayes, and of course, neural networks.
24
00:01:32,340 --> 00:01:38,240
The firing of a classifier, or activation as its commonly called, produces a score.
25
00:01:38,240 --> 00:01:42,400
For example, say you needed to predict if a patient is sick or healthy,
26
00:01:42,400 --> 00:01:46,720
and all you have are their height, weight, and body temperature.
27
00:01:46,720 --> 00:01:53,240
The classifier would receive this data about the patient, process it, and fire out a confidence score.
28
00:01:53,240 --> 00:02:01,900
A high score would mean a high confidence that the patient is sick, and a low score would suggest that they are healthy.
29
00:02:01,900 --> 00:02:05,960
Neural nets are used for classification tasks where an object can fall
30
00:02:05,960 --> 00:02:09,080
into one of at least two different categories.
31
00:02:09,080 --> 00:02:11,820
Unlike other networks like a social network,
32
00:02:11,820 --> 00:02:15,120
a neural network is highly structured and comes in layers.
33
00:02:15,120 --> 00:02:17,300
The first layer is the input layer,
34
00:02:17,300 --> 00:02:19,320
the final layer is the output layer,
35
00:02:19,320 --> 00:02:23,040
and all layers in between are referred to as hidden layers.
36
00:02:23,040 --> 00:02:28,560
A neural net can be viewed as the result of spinning classifiers together in a layered web.
37
00:02:28,560 --> 00:02:33,920
This is because each node in the hidden and output layers has its own classifier.
38
00:02:33,920 --> 00:02:35,760
Take that node for example -
39
00:02:35,760 --> 00:02:39,440
it gets its inputs from the input layer, and activates.
40
00:02:39,440 --> 00:02:45,040
Its score is then passed on as input to the next hidden layer for further activation.
41
00:02:45,040 --> 00:02:45,940
So,
42
00:02:45,940 --> 00:02:50,100
let’s see how this plays out end to end across the entire network.
43
00:02:50,100 --> 00:02:53,020
A set of inputs is passed to the first hidden layer,
44
00:02:53,020 --> 00:02:56,720
the activations from that layer are passed to the next layer and so on,
45
00:02:56,720 --> 00:02:58,960
until you reach the output layer,
46
00:02:58,960 --> 00:03:04,480
where the results of the classification are determined by the scores at each node.
47
00:03:04,480 --> 00:03:07,120
This happens for each set of inputs.
48
00:03:07,120 --> 00:03:09,920
Here's another one...
49
00:03:09,920 --> 00:03:11,460
like so.
50
00:03:11,460 --> 00:03:16,500
This series of events starting from the input where each activation is sent to the next layer,
51
00:03:16,500 --> 00:03:18,840
and then the next, a