[Google_Bootcamp_Day15]

Updated: November 15, 2020

ML strategy 4

Transfer Learning

Transfer learning refers to using the neural network knowledge for another application.

Example: Cat recognition - radiology diagnosis

The following neural network is trained for cat recognition, but we want to adapt it for radiology diagnosis. The neural network will learn about the structure and the nature of images. This initial phase of training on image recognition is called pre-training, since it will pre-initialize the weights of the neural network. Updating all the weights afterwards is called fine-tuning.

Guideline

Delete last layer of neural network
Delete weights feeding into the last output layer of the neural network
Create a new set of randomly initialized weights for the last layer only (if you have large dataset, then you can use more layers to re-train)
Re-train on new dataset (x,y)

When to use transfer learning (Task A -> Task B)

Task A and B have the same input x
A lot more data for Task A than Task B
Low level features from Task A could be helpful for Task B

Multi-Task Learning

Multi-task learning refers to having one neural network do simultaneously several tasks.

Example: Simplified autonomous vehicle

ex1 multi_task

The vehicle has to detect simultaneously several things: pedestrians, cars, road signs, traffic lights, etc. We could have trained four separate neural networks, instead of train one to do four tasks. However, in this case, the performance of the system is better when one neural network is trained to do four tasks than training four separate neural networks since some of the earlier features in the neural network could be shared between the different types of objects.

When to use multi-task learning

Training on a set of tasks that could benefit from having shared lower-level features
Usually : Amount of data you have for each task is quite similar
Can train a big enough neural network to do well on all tasks

End-to-End deep learning

End-to-End deep learning is the simplification of a processing or learning systems into one neural network.

Example: Speech recognition model

end_to_End

End-to-End deep learning cannot be used for every problem since it needs a lot of labeled data. It is used mainly in audio transcripts, image captures, image synthesis, machine translation, steering in self-driving cars, etc.

Whether to use end-to-end learning

Pros
- Let the data speak
- Less hand-designing of components needed
Cons
- May need large amount of data
- Excludes potentially useful hand-designed components
Key Question before applying end-to-end learning
- Do you have sufficient data to learn a function of the complexity needed to map X to Y?

[Source] https://www.coursera.org/learn/machine-learning-projects

Share on

Twitter Facebook LinkedIn

Jeongho Shin (Leo)

[Google_Bootcamp_Day15]

Transfer Learning

Guideline

When to use transfer learning (Task A -> Task B)

Multi-Task Learning

When to use multi-task learning

End-to-End deep learning

Whether to use end-to-end learning

Share on

Leave a comment

You may also enjoy

[Network] Transport Layer 2

[Network] Transport Layer 1

[Web Backend] SQL Basic / JDBC

[Data Structure] Linked_List