amygdala
diff --git a/‎INSTALL.md
Lines changed: 76 additions & 11 deletions b/‎INSTALL.md
Lines changed: 76 additions & 11 deletions
diff --git a/‎workshop_image/Dockerfile
Lines changed: 8 additions & 2 deletions b/‎workshop_image/Dockerfile
Lines changed: 8 additions & 2 deletions
diff --git a/‎workshop_sections/transfer_learning/README.md
Lines changed: 11 additions & 155 deletions b/‎workshop_sections/transfer_learning/README.md
Lines changed: 11 additions & 155 deletions
@@ -1,6 +1,22 @@
 
+
 # Installation instructions for the TensorFlow workshop
 
+  - [Docker-based installation](#docker-based-installation)
+    - [Download the container image](#download-the-container-image)
+    - [Create a directory to hold data files needed by the workshop](#create-a-directory-to-hold-data-files-needed-by-the-workshop)
+    - [Run the container](#run-the-container)
+    - [Restarting the container later](#restarting-the-container-later)
+  - [Virtual environment-based installation](#virtual-environment-based-installation)
+    - [Install Conda + Python 2.7 to use as your local virtual environment](#install-conda--python-27-to-use-as-your-local-virtual-environment)
+    - [Install TensorFlow into a virtual environment](#install-tensorflow-into-a-virtual-environment)
+    - [Install some Python packages](#install-some-python-packages)
+    - [Install the Google Cloud SDK](#install-the-google-cloud-sdk)
+    - [Cloud ML setup](#cloud-ml-setup)
+    - [Cloud ML SDK installation (for 'transfer learning' preprocessing)](#cloud-ml-sdk-installation-for-transfer-learning-preprocessing)
+  - [Set up some data files used in the examples](#set-up-some-data-files-used-in-the-examples)
+  - [Optional: Clone/Download the TensorFlow repo from GitHub](#optional-clonedownload-the-tensorflow-repo-from-github)
+
 You can set up for the workshop in two different, mutually-exclusive ways:
 
 - [Running in a docker container](#docker-based-installation).
@@ -17,7 +33,7 @@ To use it, you'll need to have [Docker installed](https://docs.docker.com/engine
 Once Docker is installed and running, download the workshop image:
 
 ```sh
-$ docker pull gcr.io/google-samples/tf-workshop:v4
+$ docker pull gcr.io/google-samples/tf-workshop:v5
 ```
 
 [Here's the Dockerfile](https://github.com/amygdala/tensorflow-workshop/tree/master/workshop_image) used to build this image.
@@ -32,7 +48,7 @@ Once you've downloaded the container image, you can run it like this:
 
 ```sh
 $ docker run -v `pwd`/workshop-data:/root/tensorflow-workshop-master/workshop-data -it \
-    -p 6006:6006 -p 8888:8888 gcr.io/google-samples/tf-workshop:v4
+    -p 6006:6006 -p 8888:8888 gcr.io/google-samples/tf-workshop:v5
 ```
 
 Edit the path to the directory you're mounting as appropriate. The first component of the `-v` arg is the local directory, and the second component is where you want to mount it in your running container.
@@ -63,7 +79,8 @@ $ docker exec -it <container_id> bash
 
 We highly recommend that you use a virtual environment for your TensorFlow installation rather than a direct install onto your machine.  The instructions below walk you thorough a `conda` install, but a `virtualenv` environment will work as well.
 
-The instructions specify using Python 2.7, but Python 3.x will work for everything but the "Cloud ML" sections of the workshop.
+Note: The 'preprocessing' stage in the [Cloud ML transfer learning](workshop_sections/transfer_learning/cloudml)
+example requires installation of the Cloud ML SDK, which requires Python 2.7. Otherwise, Python 3 should likely work.
 
 ### Install Conda + Python 2.7 to use as your local virtual environment
 
@@ -73,11 +90,11 @@ Follow the instructions [here](https://www.continuum.io/downloads).  The [minico
 
 ### Install TensorFlow into a virtual environment
 
-Follow the instructions [on the TensorFlow site](https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#anaconda-installation) to create a Conda environment with Python 2.7, *activate* it, and then use [conda-forge](https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#using-conda) to install TensorFlow within it.
+Follow the instructions [on the TensorFlow site](https://www.tensorflow.org/get_started/os_setup#anaconda_installation) to create a Conda environment with Python 2.7, *activate* it, and then install TensorFlow within it.
 
-**Note**: as of this writing, `conda-forge` installs TensorFlow 0.11. That is fine for this workshop. If you'd prefer to install using pip, follow the ["using pip" section](https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#using-pip) instead.
+**Note**: Install TensorFlow version 0.12.
 
-If you'd prefer to use virtualenv, see [these instructions](https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#virtualenv-installation) instead.
+If you'd prefer to use virtualenv, see [these instructions](https://www.tensorflow.org/get_started/os_setup#virtualenv_installation) instead.
 
 Remember to activate your environment in all the terminal windows you use during this workshop.
 
@@ -112,17 +129,65 @@ gcloud components install beta
 
 To get the `gcloud beta ml` commands.
 
+### Cloud ML setup
 
-## [Optional: Get Started With Google Cloud Machine Learning](#cloud-ml-setup)
-
-Follow the following instructions in order:
-
-NOTE: You DO NOT need to follow the "Setting up your Environment" section
+Follow the instructions below to create a project, initialize it for Cloud ML, and set up a storage bucket to use for the workshop examples.
 
 * [Setting Up Your GCP Project](https://cloud.google.com/ml/docs/how-tos/getting-set-up#setting_up_your_google_cloud_project )
 * [Initializing Cloud ML for your project](https://cloud.google.com/ml/docs/how-tos/getting-set-up#initializing_your_product_name_short_project)
 * [Setting up your Cloud Storage Bucket](https://cloud.google.com/ml/docs/how-tos/getting-set-up#setting_up_your_cloud_storage_bucket)
 
+### Cloud ML SDK installation (for 'transfer learning' preprocessing)
+
+The Cloud ML SDK is needed to run the 'preprocessing' stage in the [Cloud ML transfer
+learning](workshop_sections/transfer_learning/cloudml) example. It requires Python 2.7 to install. It's possible to
+skip this part of setup for most of the exercises.
+
+To install the SDK, follow the setup instructions
+[on this page](https://cloud.google.com/ml/docs/how-tos/getting-set-up).
+(Assuming you've followed the instructions above, you will have already done some of these steps. **Install TensorFlow version 0.12** as described in [this section](#install-tensorflow-into-a-virtual-environment), not 0.11)
+
+**Note**: if you have issues with the pip install of `python-snappy`, and are running in a conda virtual environment, try `conda install python-snappy` instead.
+
+You don't need to download the Cloud ML samples or docs for this workshop, though you may find it useful to grab them
+anyway.
+
+## Set up some data files used in the examples
+
+### Transfer learning example
+
+Because we have limited workshop time, we've saved a set of
+[TFRecords]([TFRecords](https://www.tensorflow.org/api_docs/python/python_io/))
+generated as part of the [Cloud ML transfer learning](workshop_sections/transfer_learning/cloudml) 
+example. To save time, copy them now to your own bucket as follows.
+
+Copy a zip of the generated records to some directory on your local machine:
+
+```shell
+gsutil cp gs://oscon-tf-workshop-materials/transfer_learning/cloudml/hugs_preproc_tfrecords.zip .
+```
+
+and then expand the zip:
+
+```shell
+unzip hugs_preproc_tfrecords.zip
+```
+
+Set the `BUCKET` variable to point to your GCS bucket (replacing `your-bucket-name` with the actual name):
+
+```shell
+BUCKET=gs://your-bucket-name
+```
+
+Then set the `GCS_PATH` variable as follows, and copy the unzipped records to a `preproc` directory under that path:
+
+```shell
+GCS_PATH=$BUCKET/hugs_preproc_tfrecords
+gsutil cp -r hugs_preproc_tfrecords/ $GCS_PATH/preproc
+```
+
+Once you've done this, you can delete the local zip and `hugs_preproc_tfrecords` directory.
+
 ## Optional: Clone/Download the TensorFlow repo from GitHub
 
 We'll be looking at some examples based on code in the tensorflow repo. While it's not necessary, you might want to clone or download it [here](https://github.com/tensorflow/tensorflow), or grab the latest release [here](https://github.com/tensorflow/tensorflow/releases).
 
@@ -15,10 +15,16 @@ FROM gcr.io/tensorflow/tensorflow:latest-devel
 
 RUN pip install --upgrade pip
 RUN apt-get update
+RUN apt-get install -y unzip python-dev python-pip zlib1g-dev libjpeg-dev libblas-dev
+RUN apt-get install -y liblapack-dev libatlas-base-dev libsnappy-dev libyaml-dev gfortran
 RUN apt-get install -y python-scipy
-RUN pip install sklearn nltk pillow
-RUN python -c "import nltk; nltk.download('punkt')"
 
+RUN pip install sklearn nltk pillow setuptools
+RUN pip install flask google-api-python-client
+RUN pip install pandas python-snappy scipy scikit-learn requests uritemplate
+RUN pip install --upgrade --force-reinstall https://storage.googleapis.com/cloud-ml/sdk/cloudml.latest.tar.gz
+
+# RUN python -c "import nltk; nltk.download('punkt')"
 
 RUN curl https://dl.google.com/dl/cloudsdk/channels/rapid/downloads/google-cloud-sdk-132.0.0-linux-x86_64.tar.gz | tar xvz
 RUN ./google-cloud-sdk/install.sh -q
 
@@ -1,159 +1,15 @@
 
-# Transfer learning
+This directory contains two examples of transfer learning using the "Inception V3" image classification model.
 
-  - [Introduction](#introduction)
-  - [1. Take a look at the the Inception v3 model](#1-take-a-look-at-the-the-inception-v3-model)
-  - [Data sets](#data-sets)
-    - [The "hugs/no-hugs" data set](#the-hugsno-hugs-data-set)
-    - [(Or, you can use the Flowers data set if you want)](#or-you-can-use-the-flowers-data-set-if-you-want)
-    - [Pre-generated 'bottleneck' values for both example datasets](#pre-generated-bottleneck-values-for-both-example-datasets)
-  - [2. Run a training session and use the model for prediction](#2-run-a-training-session-and-use-the-model-for-prediction)
-    - [Train the model](#train-the-model)
-    - [Do prediction using your learned model in an ipython notebook](#do-prediction-using-your-learned-model-in-an-ipython-notebook)
-  - [3. A Custom Esimator for the transfer learning model](#3-a-custom-esimator-for-the-transfer-learning-model)
-  - [Named Scopes and TensorBoard Summary information](#named-scopes-and-tensorboard-summary-information)
-  - [4. Exercise: Building the Custom Estimator's model graph](#4-exercise-building-the-custom-estimators-model-graph)
+The [cloudml](cloudml) example shows how to use [Cloud Dataflow](https://cloud.google.com/dataflow/) ([Apache
+Beam](https://beam.apache.org/)) to do image preprocessing, then train and serve your model on Cloud ML.  It supports
+distributed training on Cloud ML.
+It is based on the example [here](https://github.com/GoogleCloudPlatform/cloudml-samples/tree/master/flowers), with
+some additional modifications to make it easy to use other image sets, and a prediction web server that demos how to
+use the Cloud ML API for prediction once your trained model is serving.
 
+The [TF_Estimator](TF_Estimator) example takes a similar approach, but is not packaged to run on Cloud ML. It also
+shows an example of using a custom [`Estimator`](https://www.tensorflow.org/api_docs/python/contrib.learn/estimators).
 
-## Introduction
-
-This lab shows how we can use an existing model to do *transfer learning* -- effectively bootstrapping an existing model to reduce the effort needed to learn something new.
-
-Specifically, we will take an 'Inception' v3 architecture model trained on ImageNet images, and using its penultimate "bottleneck" layer, train a new top layer that can recognize other classes of images.
-We'll see that our new top layer does not need to be very complex, and that we don't need to do much training of this new model, to get good results for our new image classifications.
-
-The core of the approach is the same as that used in [this TensorFlow example](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/image_retraining), but here we will use a custom [Estimator](https://www.tensorflow.org/versions/r0.11/api_docs/python/contrib.learn.html#estimators) (and train on a different set of photos).
-
-## 1. Take a look at the the Inception v3 model
-
-We can use the `view_inception_model.ipynb` Jupyter notebook to take a look at the structure of the Inception model before we start working with it.
-
-First, download the inception model from:
-http://download.tensorflow.org/models/image/imagenet/inception-2015-12-05.tgz , extract it,
-and copy the model file `classify_image_graph_def.pb` into `/tmp/imagenet` (you may need to first create the directory).  This is where our python scripts will look for it, so we're saving a later download by putting it in the same place.
-
-Then, start a jupyter server in this directory.  For convenience, run it in a new terminal window. (Don't forget to activate your virtual environment first as necessary).
-
-```sh
-$ jupyter notebook
-```
-
-Load and run the `view_inception_model.ipynb` notebook.  Poke around the model graph a bit.
-
-<a href="https://storage.googleapis.com/oscon-tf-workshop-materials/images/incpv3.png" target="_blank"><img src="https://storage.googleapis.com/oscon-tf-workshop-materials/images/incpv3.png" width="500"/></a>
-
-See if you can find the 'DecodeJpeg/contents:0' and 'pool_3/_reshape:0' nodes-- these will be our input and 'bottleneck' nodes, respectively, for the transfer learning.
-
-<a href="https://storage.googleapis.com/oscon-tf-workshop-materials/images/incpv3_pool_3_reshape.png" target="_blank"><img src="https://storage.googleapis.com/oscon-tf-workshop-materials/images/incpv3_pool_3_reshape.png" width="500"/></a>
-
-Note: If you should want to write the model graph to a text file to browse it that way, you can use
-the `tf.train.write_graph()` method. See [`mnist_hidden.py`](../mnist_series/the_hard_way/mnist_hidden.py) for
-a (commented-out) example of how to call it.
-
-## Data sets
-
-We've provided training images for you, but if you want to play around further, you can use any image datasets you like.  The training script simply assumes you have a top-level directory containing class-named subdirectories, each containing images for that class.  It then infers the classes to be learned from the directory structure.
-
-### The "hugs/no-hugs" data set
-
-For this exercise, we'll use a training set of images that have been sorted into two categories -- whether or not one would want to hug the object in the photo.
-(Thanks to Julia Ferraioli for this dataset).
-
-This dataset does not have a large number of images, but as we will see, prediction on new images still works surprisingly well.  This shows the power of 'bootstrapping' the pre-trained Inception model.
-
-
-```sh
-$ curl -O https://storage.googleapis.com/oscon-tf-workshop-materials/transfer_learning/hugs_photos.zip
-$ unzip hugs_photos.zip
-```
-
-### (Or, you can use the Flowers data set if you want)
-
-If you want to do flower classification instead, as with the original tutorial, you can find the data here:
-
-```sh
-$ curl -O http://download.tensorflow.org/example_images/flower_photos.tgz
-$ tar xzf flower_photos.tgz
-```
-
-
-### Pre-generated 'bottleneck' values for both example datasets
-
-When you run the transfer learning training, you'll first need to generate "bottleneck values" for the images, using the Inception v3 model. (We'll take a look at how that works).
-If this process is too time-consuming for the workshop context, you can download the pre-calculated bottleneck files for both the data sets above:
-
-- https://storage.googleapis.com/oscon-tf-workshop-materials/transfer_learning/bottlenecks_hugs.zip
-- https://storage.googleapis.com/oscon-tf-workshop-materials/transfer_learning/bottlenecks_flowers.zip
-
-## 2. Run a training session and use the model for prediction
-
-Let's start by training our new model and using the results to make predictions.
-
-### Train the model
-
-```sh
-$ python transfer_learning.py --image_dir=hugs_photos --bottleneck_dir=bottlenecks_hugs
-```
-
-**Note the name of the model directory** that is created by the script.
-
-### Do prediction using your learned model in an ipython notebook
-
-Start up a jupyter server in this directory as necessary.  Select the `transfer_learning_prediction.ipynb` notebook in the listing that comes up.
-
-Find this line:
-```
-MODEL_DIR = '/tmp/tfmodels/img_classify/your-model-dir'
-```
-
-and edit it to point to the model directory used for your training run.
-
-Then, run the notebook.  
-You should see some predictions made for the images in the `prediction_images` directory!
-
-If you like, you can try adding additional images to that directory, and rerunning the last part of the notebook to find out whether they're more huggable than not.
-
-## 3. A Custom Esimator for the transfer learning model
-
-Before we jump into the coding part of the lab, we'll take a look at `transfer_learning_skeleton.py`.
-It has the scaffolding in place for building a custom Estimator to do the transfer learning.  
-We'll look at how the `fit()`, `evaluate()`, and `predict()` methods are being used.
-
-We'll also take a look at how the Inception model is being loaded and accessed.
-
-## Named Scopes and TensorBoard Summary information
-
-Note that this code includes some examples of use of `tf.name_scope()` when defining nodes, particularly
-in the `add_final_training_ops()` function. You'll be able to spot these scope names when you look at the model graph in TensorBoard.
-We saw use of `tf.name_scope` earlier in ['mnist_hidden.py'](../mnist_series/the_hard_way/mnist_hidden.py) as well.
-
-The code in `add_final_training_ops()` also includes some examples of defining summary information for TensorBoard (we saw a simple example of doing this in ['mnist_hidden.py'](../mnist_series/the_hard_way/mnist_hidden.py) also).
-
-However, here, as we're wrapping things in an Estimator, we don't need to an an explicit `tf.merge_summary` op-- it will do that for us.
-
-
-## 4. Exercise: Building the Custom Estimator's model graph
-
-Start with [`transfer_learning_skeleton.py`](transfer_learning.py), and complete the `_make_model`
-function definition. This function builds the model graph for the custom estimator.
-
-As noted above, the Inception model graph is doing the heavy lifting here. We will just train a new
-top layer to identify our new classes: that is, we will just add a new softmax and fully-connected
-layer.  The input to this layer is the generated "bottleneck" values. The `add_final_training_ops`
-function defines this layer, then defines the loss function and the training op.
-
-Then, the `add_evaluation_step` function adds an op to evaluate the accuracy of the results. Add
-'loss' and 'accuracy' metrics to the prediction_dict, as per the `METRICS` dict below
-`make_model_fn` in the code, which we will then pass to the Estimator's `evaluate()` method.
-
-Then, add support for generating prediction value(s).
-See if you can figure out how to derive the index of the highest-value the entry in the result
-vector, and store that value at the `"index"` key in the `prediction_dict`. As a hint, take a look
-at the ops used in `add_evaluation_step()`.
-
-As shown in the skeleton of `_make_model`, be sure to return the prediction dict, the loss, and the
-training op.  This info sets up the Estimator to handle calls to its `fit()`, `evaluate()`, and
-`predict()` methods.
-
-
-If you get stuck, you can take a peek at `transfer_learning.py`, but try not to do that too soon.
+The list of image sources for the images used in the "hugs/no-hugs" training is here:
+https://storage.googleapis.com/oscon-tf-workshop-materials/transfer_learning/hugs_photos_sources.csv