TensorFlow 2.0學習筆記(7)：遷移學習

Yanwei Liu

25 min readMay 2, 2020

本文程式碼參考自：

輕鬆學會 Google TensorFlow 2.0 人工智慧深度學習實作開發書中之範例程式

購買連結

taipeitechmmslab/MMSLAB-TF2

Permalink Dismiss GitHub is home to over 50 million developers working together to host and review code, manage…

github.com

遷移學習：一種技巧，利用已使用大量dataset訓練好的模型。在進行遷移學習時，我們必須更改預訓練模型的last layer，使其類別數量與要處理的數據集一樣。

凍結參數：將預訓練模型的變量設為不可訓練。凍結參數後，只有最後分類層級的變量會接受訓練，而預訓練模型的其他層級中的變量將保持不變。

feature_extractor.trainable = False'''
Freeze the variables in the feature extractor layer, so that the training only modifies the final classifier layer.
'''

TensorFlow Hub and Transfer Learning

本Colab程式碼只要有三個學習目標：

[1] Use a TensorFlow Hub MobileNet for prediction

[2] Use a TensorFlow Hub models for the Cats vs. Dogs dataset

[3] Do simple transfer learning with TensorFlow Hub

Google Colaboratory

Edit description

colab.research.google.com

# -*- coding: utf-8 -*-
"""l06c01_tensorflow_hub_and_transfer_learning.ipynbAutomatically generated by Colaboratory.Original file is located at
    https://colab.research.google.com/github/tensorflow/examples/blob/master/courses/udacity_intro_to_tensorflow_for_deep_learning/l06c01_tensorflow_hub_and_transfer_learning.ipynb##### Copyright 2019 The TensorFlow Authors.
"""#@title Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# https://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.# TensorFlow Hub and Transfer Learning[TensorFlow Hub](http://tensorflow.org/hub) is an online repository of already trained TensorFlow models that you can use.
These models can either be used as is, or they can be used for Transfer Learning.Transfer learning is a process where you take an existing trained model, and extend it to do additional work. This involves leaving the bulk of the model unchanged, while adding and retraining the final layers, in order to get a different set of possible outputs.In this Colab we will do both.Here, you can see all the models available in [TensorFlow Module Hub](https://tfhub.dev/).## Concepts that will be covered in this Colab1. Use a TensorFlow Hub model for prediction.
2. Use a TensorFlow Hub model for Dogs vs. Cats dataset.
3. Do simple transfer learning with TensorFlow Hub.Before starting this Colab, you should reset the Colab environment by selecting `Runtime -> Reset all runtimes...` from menu above.# ImportsThis Colab will require us to use some things which are not yet in official releases of TensorFlow. So below, we're first installing a nightly version of TensorFlow as well as TensorFlow Hub.This will switch your installation of TensorFlow in Colab to this TensorFlow version. Once you are finished with this Colab, you should switch batch to the latest stable release of TensorFlow by selecting `Runtime -> Reset all runtimes...` in the menus above. This will reset the Colab environment to its original state.Some normal imports we've seen before. The new one is importing tensorflow_hub which was installed above, and which this Colab will make heavy use of.
"""import tensorflow as tfimport matplotlib.pylab as pltimport tensorflow_hub as hub
import tensorflow_datasets as tfdsfrom tensorflow.keras import layersimport logging
logger = tf.get_logger()
logger.setLevel(logging.ERROR)"""# Part 1: Use a TensorFlow Hub MobileNet for predictionIn this part of the Colab, we'll take a trained model, load it into to Keras, and try it out.The model that we'll use is MobileNet v2 (but any model from [tf2 compatible image classifier URL from tfhub.dev](https://tfhub.dev/s?q=tf2&module-type=image-classification) would work).## Download the classifierDownload the MobileNet model and create a Keras model from it.
MobileNet is expecting images of 224 $\times$ 224 pixels, in 3 color channels (RGB).
"""CLASSIFIER_URL ="https://tfhub.dev/google/tf2-preview/mobilenet_v2/classification/2"
IMAGE_RES = 224model = tf.keras.Sequential([
    hub.KerasLayer(CLASSIFIER_URL, input_shape=(IMAGE_RES, IMAGE_RES, 3))
])"""## Run it on a single imageMobileNet has been trained on the ImageNet dataset. ImageNet has 1000 different output classes, and one of them is military uniforms.
Let's get an image containing a military uniform that is not part of ImageNet, and see if our model can predict that it is a military uniform.
"""import numpy as np
import PIL.Image as Imagegrace_hopper = tf.keras.utils.get_file('image.jpg','https://storage.googleapis.com/download.tensorflow.org/example_images/grace_hopper.jpg')
grace_hopper = Image.open(grace_hopper).resize((IMAGE_RES, IMAGE_RES))
grace_hoppergrace_hopper = np.array(grace_hopper)/255.0
grace_hopper.shape"""Remember, models always want a batch of images to process. So here, we add a batch dimension, and pass the image to the model for prediction."""result = model.predict(grace_hopper[np.newaxis, ...])
result.shape"""The result is a 1001 element vector of logits, rating the probability of each class for the image.So the top class ID can be found with argmax. But how can we know what class this actually is and in particular if that class ID in the ImageNet dataset denotes a military uniform or something else?
"""predicted_class = np.argmax(result[0], axis=-1)
predicted_class"""## Decode the predictionsTo see what our predicted_class is in the ImageNet dataset, download the ImageNet labels and fetch the row that the model predicted.
"""labels_path = tf.keras.utils.get_file('ImageNetLabels.txt','https://storage.googleapis.com/download.tensorflow.org/data/ImageNetLabels.txt')
imagenet_labels = np.array(open(labels_path).read().splitlines())plt.imshow(grace_hopper)
plt.axis('off')
predicted_class_name = imagenet_labels[predicted_class]
_ = plt.title("Prediction: " + predicted_class_name.title())"""Bingo. Our model correctly predicted military uniform!# Part 2: Use a TensorFlow Hub models for the Cats vs. Dogs datasetNow we'll use the full MobileNet model and see how it can perform on the Dogs vs. Cats dataset.## DatasetWe can use TensorFlow Datasets to load the Dogs vs Cats dataset.
"""(train_examples, validation_examples), info = tfds.load(
    'cats_vs_dogs', 
    with_info=True, 
    as_supervised=True, 
    split=['train[:80%]', 'train[80%:]'],
)num_examples = info.splits['train'].num_examples
num_classes = info.features['label'].num_classes"""The images in the Dogs vs. Cats dataset are not all the same size."""for i, example_image in enumerate(train_examples.take(3)):
  print("Image {} shape: {}".format(i+1, example_image[0].shape))"""So we need to reformat all images to the resolution expected by MobileNet (224, 224).The `.repeat()` and `steps_per_epoch` here is not required, but saves ~15s per epoch, since the shuffle-buffer only has to cold-start once.
"""def format_image(image, label):
  image = tf.image.resize(image, (IMAGE_RES, IMAGE_RES))/255.0
  return image, labelBATCH_SIZE = 32train_batches      = train_examples.shuffle(num_examples//4).map(format_image).batch(BATCH_SIZE).prefetch(1)
validation_batches = validation_examples.map(format_image).batch(BATCH_SIZE).prefetch(1)"""## Run the classifier on a batch of imagesRemember our `model` object is still the full MobileNet model trained on ImageNet, so it has 1000 possible output classes.
ImageNet has a lot of dogs and cats in it, so let's see if it can predict the images in our Dogs vs. Cats dataset.
"""image_batch, label_batch = next(iter(train_batches.take(1)))
image_batch = image_batch.numpy()
label_batch = label_batch.numpy()result_batch = model.predict(image_batch)predicted_class_names = imagenet_labels[np.argmax(result_batch, axis=-1)]
predicted_class_names"""The labels seem to match names of Dogs and Cats. Let's now plot the images from our Dogs vs Cats dataset and put the ImageNet labels next to them."""plt.figure(figsize=(10,9))
for n in range(30):
  plt.subplot(6,5,n+1)
  plt.subplots_adjust(hspace = 0.3)
  plt.imshow(image_batch[n])
  plt.title(predicted_class_names[n])
  plt.axis('off')
_ = plt.suptitle("ImageNet predictions")"""# Part 3: Do simple transfer learning with TensorFlow HubLet's now use TensorFlow Hub to do Transfer Learning.With transfer learning we reuse parts of an already trained model and change the final layer, or several layers, of the model, and then retrain those layers on our own dataset.In addition to complete models, TensorFlow Hub also distributes models without the last classification layer. These can be used to easily do transfer learning. We will continue using MobileNet v2 because in later parts of this course, we will take this model and deploy it on a mobile device using [TensorFlow Lite](https://www.tensorflow.org/lite). Any [image feature vector URL from tfhub.dev](https://tfhub.dev/s?module-type=image-feature-vector&q=tf2) would work here.We'll also continue to use the Dogs vs Cats dataset, so we will be able to compare the performance of this model against the ones we created from scratch earlier.Note that we're calling the partial model from TensorFlow Hub (without the final classification layer) a `feature_extractor`. The reasoning for this term is that it will take the input all the way to a layer containing a number of features. So it has done the bulk of the work in identifying the content of an image, except for creating the final probability distribution. That is, it has extracted the features of the image.
"""URL = "https://tfhub.dev/google/tf2-preview/mobilenet_v2/feature_vector/2"
feature_extractor = hub.KerasLayer(URL,
                                   input_shape=(IMAGE_RES, IMAGE_RES,3))"""Let's run a batch of images through this, and see the final shape. 32 is the number of images, and 1280 is the number of neurons in the last layer of the partial model from TensorFlow Hub."""feature_batch = feature_extractor(image_batch)
print(feature_batch.shape)"""Freeze the variables in the feature extractor layer, so that the training only modifies the final classifier layer."""feature_extractor.trainable = False"""## Attach a classification headNow wrap the hub layer in a `tf.keras.Sequential` model, and add a new classification layer.
"""model = tf.keras.Sequential([
  feature_extractor,
  layers.Dense(2)
])model.summary()"""## Train the modelWe now train this model like any other, by first calling `compile` followed by `fit`.
"""model.compile(
  optimizer='adam',
  loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
  metrics=['accuracy'])EPOCHS = 6
history = model.fit(train_batches,
                    epochs=EPOCHS,
                    validation_data=validation_batches)"""You can see we get ~97% validation accuracy, which is absolutely awesome. This is a huge improvement over the model we created in the previous lesson, where we were able to get ~83% accuracy. The reason for this difference is that MobileNet was carefully designed over a long time by experts, then trained on a massive dataset (ImageNet).Although not equivalent to TensorFlow Hub, you can check out how to create MobileNet in Keras [here](https://github.com/keras-team/keras-applications/blob/master/keras_applications/mobilenet.py).Let's plot the training and validation accuracy/loss graphs.
"""acc = history.history['accuracy']
val_acc = history.history['val_accuracy']loss = history.history['loss']
val_loss = history.history['val_loss']epochs_range = range(EPOCHS)plt.figure(figsize=(8, 8))
plt.subplot(1, 2, 1)
plt.plot(epochs_range, acc, label='Training Accuracy')
plt.plot(epochs_range, val_acc, label='Validation Accuracy')
plt.legend(loc='lower right')
plt.title('Training and Validation Accuracy')plt.subplot(1, 2, 2)
plt.plot(epochs_range, loss, label='Training Loss')
plt.plot(epochs_range, val_loss, label='Validation Loss')
plt.legend(loc='upper right')
plt.title('Training and Validation Loss')
plt.show()"""What is a bit curious here is that validation performance is better than training performance, right from the start to the end of execution.One reason for this is that validation performance is measured at the end of the epoch, but training performance is the average values across the epoch.The bigger reason though is that we're reusing a large part of MobileNet which is already trained on Dogs and Cats images. While doing training, the network is still performing image augmentation on the training images, but not on the validation dataset. This means the training images may be harder to classify compared to the normal images in the validation dataset.## Check the predictionsTo redo the plot from before, first get the ordered list of class names.
"""class_names = np.array(info.features['label'].names)
class_names"""Run the image batch through the model and convert the indices to class names."""predicted_batch = model.predict(image_batch)
predicted_batch = tf.squeeze(predicted_batch).numpy()
predicted_ids = np.argmax(predicted_batch, axis=-1)
predicted_class_names = class_names[predicted_ids]
predicted_class_names"""Let's look at the true labels and predicted ones."""print("Labels: ", label_batch)
print("Predicted labels: ", predicted_ids)plt.figure(figsize=(10,9))
for n in range(30):
  plt.subplot(6,5,n+1)
  plt.subplots_adjust(hspace = 0.3)
  plt.imshow(image_batch[n])
  color = "blue" if predicted_ids[n] == label_batch[n] else "red"
  plt.title(predicted_class_names[n].title(), color=color)
  plt.axis('off')
_ = plt.suptitle("Model predictions (blue: correct, red: incorrect)")