Time series prediction using RNNs, with TensorFlow and Cloud ML Engine

This notebook illustrates:

Creating a Recurrent Neural Network in TensorFlow
Creating a Custom Estimator in tf.estimator
Training on Cloud ML Engine

Simulate some time-series data

Essentially a set of sinusoids with random amplitudes and frequencies.



In [1]:

    
import os
PROJECT = 'cloud-training-demos' # REPLACE WITH YOUR PROJECT ID
BUCKET = 'cloud-training-demos-ml' # REPLACE WITH YOUR BUCKET NAME
REGION = 'us-central1' # REPLACE WITH YOUR BUCKET REGION e.g. us-central1
os.environ['TFVERSION'] = '1.8'  # Tensorflow version



In [2]:

    
# for bash
os.environ['PROJECT'] = PROJECT
os.environ['BUCKET'] = BUCKET
os.environ['REGION'] = REGION



In [3]:

    
%%bash
gcloud config set project $PROJECT
gcloud config set compute/region $REGION









    



Updated property [core/project].
Updated property [compute/region].



In [26]:

    
import tensorflow as tf
print(tf.__version__)



In [4]:

    
import numpy as np
import seaborn as sns
import pandas as pd

SEQ_LEN = 10
def create_time_series():
  freq = (np.random.random() * 0.5) + 0.1  # 0.1 to 0.6
  ampl = np.random.random() + 0.5  # 0.5 to 1.5
  x = np.sin(np.arange(0, SEQ_LEN) * freq) * ampl
  return x

for i in range(0, 5):
  sns.tsplot( create_time_series() );  # 5 series









    



/usr/local/envs/py3env/lib/python3.5/site-packages/matplotlib/font_manager.py:1320: UserWarning: findfont: Font family ['sans-serif'] not found. Falling back to DejaVu Sans
  (prop.get_family(), self.defaultFamily[fontext]))



In [5]:

    
def to_csv(filename, N):
  with open(filename, 'w') as ofp:
    for lineno in range(0, N):
      seq = create_time_series()
      line = ",".join(map(str, seq))
      ofp.write(line + '\n')

to_csv('train.csv', 1000)  # 1000 sequences
to_csv('valid.csv',  50)



In [29]:

    
!head -5 train.csv valid.csv









    



==> train.csv <==
0.0,0.41494536271196236,0.734451551569238,0.8850308030101335,0.8320492576193983,0.5876928930606661,0.20816469810142113,-0.2192422505391868,-0.5962225619744843,-0.836069183382574
0.0,0.16050934484878304,0.3187737388366111,0.4725796299246527,0.6197758245612592,0.7583035751751281,0.8862253746803034,1.0017520552610126,1.1032678124214037,1.1893528043033446
0.0,0.2019768145665784,0.38954713901631743,0.5493320613627289,0.6699345313575295,0.7427522829645161,0.7625914120164823,0.7280368441286417,0.6415532683388727,0.509309337136115
0.0,0.20513560202370504,0.38685799767444984,0.5244262531017174,0.6021389789663988,0.6111264126796244,0.5503627711095774,0.4267833286114507,0.2544928576485325,0.05315577684418399
0.0,0.20112443738388527,0.3910248758298022,0.559103685595879,0.6959810200217685,0.794018269517251,0.8477443436542725,0.8541609921610173,0.8129101260008574,0.726293800975636

==> valid.csv <==
0.0,0.4518834761927216,0.8463432780938488,1.1332529187241658,1.2761529862603307,1.256884270714006,1.077895368911241,0.7619315258995626,0.3491442536614218,-0.10801097394423191
0.0,0.4790437748864075,0.8201581888176755,0.9251273884779285,0.7637279675979914,0.38243108423806044,-0.1089778135632325,-0.5690091192298001,-0.8652076640769141,-0.9122900603467594
0.0,0.3825159065598339,0.6359357547959016,0.6747323732958791,0.485812213690283,0.13293431847834153,-0.2648078457410006,-0.5731794804944685,-0.6881076260362654,-0.5708049526750818
0.0,0.09017440871015896,0.1785854105056497,0.26350408280046406,0.3432697973071416,0.4163226944765429,0.481234187353148,0.5367348983273075,0.5817394824668976,0.6153678519953671
0.0,0.7084840320523366,1.200134233070997,1.3244794677975964,1.0434634746979885,0.44309210797471377,-0.29288903247151693,-0.9392305365880832,-1.2981174951417012,-1.2597113929215498

RNN

For more info, see:

http://colah.github.io/posts/2015-08-Understanding-LSTMs/ for the theory
https://www.tensorflow.org/tutorials/recurrent for explanations
https://github.com/tensorflow/models/tree/master/tutorials/rnn/ptb for sample code

Here, we are trying to predict from 9 values of a timeseries, the tenth value.

Imports

Several tensorflow packages and shutil



In [6]:

    
import tensorflow as tf
import shutil
import tensorflow.contrib.metrics as metrics
import tensorflow.contrib.rnn as rnn









    



/usr/local/envs/py3env/lib/python3.5/site-packages/h5py/__init__.py:36: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`.
  from ._conv import register_converters as _register_converters

Input Fn to read CSV

Our CSV file structure is quite simple -- a bunch of floating point numbers (note the type of DEFAULTS). We ask for the data to be read BATCH_SIZE sequences at a time. The Estimator API in tf.contrib.learn wants the features returned as a dict. We'll just call this timeseries column 'rawdata'.

Our CSV file sequences consist of 10 numbers. We'll assume that 9 of them are inputs and we need to predict the last one.



In [31]:

    
DEFAULTS = [[0.0] for x in range(0, SEQ_LEN)]
BATCH_SIZE = 20
TIMESERIES_COL = 'rawdata'
# In each sequence, column index 0 to N_INPUTS - 1 are features, and column index N_INPUTS to SEQ_LEN are labels
N_OUTPUTS = 1
N_INPUTS = SEQ_LEN - N_OUTPUTS

Reading data using the Estimator API in tf.estimator requires an input_fn. This input_fn needs to return a dict of features and the corresponding labels.

So, we read the CSV file. The Tensor format here will be a scalar -- entire line. We then decode the CSV. At this point, all_data will contain a list of scalar Tensors. There will be SEQ_LEN of these tensors.

We split this list of SEQ_LEN tensors into a list of N_INPUTS Tensors and a list of N_OUTPUTS Tensors. We stack them along the first dimension to then get a vector Tensor for each. We then put the inputs into a dict and call it features. The other is the ground truth, so labels.



In [32]:

    
# Read data and convert to needed format
def read_dataset(filename, mode, batch_size = 512):
  def _input_fn():
    # Provide the ability to decode a CSV
    def decode_csv(line):
      # all_data is a list of scalar tensors
      all_data = tf.decode_csv(line, record_defaults = DEFAULTS)
      inputs = all_data[:len(all_data) - N_OUTPUTS]  # first N_INPUTS values
      labels = all_data[len(all_data) - N_OUTPUTS:] # last N_OUTPUTS values

      # Convert each list of rank R tensors to one rank R+1 tensor
      inputs = tf.stack(inputs, axis = 0)
      labels = tf.stack(labels, axis = 0)
      
      # Convert input R+1 tensor into a feature dictionary of one R+1 tensor
      features = {TIMESERIES_COL: inputs}

      return features, labels

    # Create list of files that match pattern
    file_list = tf.gfile.Glob(filename)

    # Create dataset from file list
    dataset = tf.data.TextLineDataset(file_list).map(decode_csv)

    if mode == tf.estimator.ModeKeys.TRAIN:
        num_epochs = None # indefinitely
        dataset = dataset.shuffle(buffer_size = 10 * batch_size)
    else:
        num_epochs = 1 # end-of-input after this

    dataset = dataset.repeat(num_epochs).batch(batch_size)

    iterator = dataset.make_one_shot_iterator()
    batch_features, batch_labels = iterator.get_next()
    return batch_features, batch_labels
  return _input_fn

Define RNN

A recursive neural network consists of possibly stacked LSTM cells.

The RNN has one output per input, so it will have 8 output cells. We use only the last output cell, but rather use it directly, we do a matrix multiplication of that cell by a set of weights to get the actual predictions. This allows for a degree of scaling between inputs and predictions if necessary (we don't really need it in this problem).

You have two tasks to complete

Firstly, define loss, train_op and eval_metric_ops as a function of mode
Secondly, use the defined variables to instantiate an EstimatorSpec



In [33]:

    
LSTM_SIZE = 3  # number of hidden layers in each of the LSTM cells

# Create the inference model
def simple_rnn(features, labels, mode):
  # 0. Reformat input shape to become a sequence
  x = tf.split(features[TIMESERIES_COL], N_INPUTS, 1)
    
  # 1. Configure the RNN
  lstm_cell = rnn.BasicLSTMCell(LSTM_SIZE, forget_bias = 1.0)
  outputs, _ = rnn.static_rnn(lstm_cell, x, dtype = tf.float32)

  # Slice to keep only the last cell of the RNN
  outputs = outputs[-1]
  
  # Output is result of linear activation of last layer of RNN
  weight = tf.get_variable("weight", initializer=tf.initializers.random_normal, shape=[LSTM_SIZE, N_OUTPUTS])
  bias = tf.get_variable("bias", initializer=tf.initializers.random_normal, shape=[N_OUTPUTS])
  predictions = tf.matmul(outputs, weight) + bias
    
  # 2. Loss function, training/eval ops
  # TODO: Implement training/eval ops for training, evaluation and prediction
  
  # 3. Create predictions
  predictions_dict = {"predicted": predictions}
  
  # 4. Create export outputs
  export_outputs = {"predict_export_outputs": tf.estimator.export.PredictOutput(outputs = predictions)}
  
  # 5. Return an EstimatorSpec
  return None # TODO

Estimator

Distributed training is launched off using an Estimator. The key line here is that we use tf.estimator.Estimator rather than, say tf.estimator.DNNRegressor. This allows us to provide a model_fn, which will be our RNN defined above. Note also that we specify a serving_input_fn -- this is how we parse the input data provided to us at prediction time.

You have one task to complete: instantiate an estimator using the model function we defined previously.



In [34]:

    
# Create functions to read in respective datasets
def get_train():
  return read_dataset(filename = 'train.csv', mode = tf.estimator.ModeKeys.TRAIN, batch_size = 512)

def get_valid():
  return read_dataset(filename = 'valid.csv', mode = tf.estimator.ModeKeys.EVAL, batch_size = 512)



In [35]:

    
# Create serving input function
def serving_input_fn():
  feature_placeholders = {
      TIMESERIES_COL: tf.placeholder(tf.float32, [None, N_INPUTS])
  }
  
  features = {
    key: tf.expand_dims(tensor, -1)
    for key, tensor in feature_placeholders.items()
  }
  features[TIMESERIES_COL] = tf.squeeze(features[TIMESERIES_COL], axis = [2])
    
  return tf.estimator.export.ServingInputReceiver(features, feature_placeholders)



In [36]:

    
# Create custom estimator's train and evaluate function
def train_and_evaluate(output_dir):
  # TODO: Instantiate an estimator using our model function
  estimator = #
  train_spec = tf.estimator.TrainSpec(input_fn = get_train(),
                                    max_steps = 1000)
  exporter = tf.estimator.LatestExporter('exporter', serving_input_fn)
  eval_spec = tf.estimator.EvalSpec(input_fn = get_valid(),
                                  steps = None,
                                  exporters = exporter)
  tf.estimator.train_and_evaluate(estimator, train_spec, eval_spec)



In [37]:

    
# Run the model
shutil.rmtree('outputdir', ignore_errors = True) # start fresh each time
train_and_evaluate('outputdir')









    



INFO:tensorflow:Using default config.
INFO:tensorflow:Using config: {'_model_dir': 'outputdir', '_service': None, '_keep_checkpoint_every_n_hours': 10000, '_task_type': 'worker', '_num_worker_replicas': 1, '_keep_checkpoint_max': 5, '_num_ps_replicas': 0, '_global_id_in_cluster': 0, '_log_step_count_steps': 100, '_save_checkpoints_secs': 600, '_evaluation_master': '', '_train_distribute': None, '_save_summary_steps': 100, '_tf_random_seed': None, '_cluster_spec': <tensorflow.python.training.server_lib.ClusterSpec object at 0x7f65a2bc30b8>, '_master': '', '_task_id': 0, '_session_config': None, '_save_checkpoints_steps': None, '_is_chief': True}
INFO:tensorflow:Running training and evaluation locally (non-distributed).
INFO:tensorflow:Start train and evaluate loop. The evaluate will happen after 600 secs (eval_spec.throttle_secs) or training is finished.
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Create CheckpointSaverHook.
INFO:tensorflow:Graph was finalized.
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Saving checkpoints for 1 into outputdir/model.ckpt.
INFO:tensorflow:step = 1, loss = 2.1511927
INFO:tensorflow:global_step/sec: 13.8533
INFO:tensorflow:step = 101, loss = 0.5264278 (7.220 sec)
INFO:tensorflow:global_step/sec: 14.175
INFO:tensorflow:step = 201, loss = 0.42215365 (7.055 sec)
INFO:tensorflow:global_step/sec: 14.2523
INFO:tensorflow:step = 301, loss = 0.34791386 (7.017 sec)
INFO:tensorflow:global_step/sec: 14.7247
INFO:tensorflow:step = 401, loss = 0.26609486 (6.791 sec)
INFO:tensorflow:global_step/sec: 14.6274
INFO:tensorflow:step = 501, loss = 0.21945082 (6.836 sec)
INFO:tensorflow:global_step/sec: 14.4637
INFO:tensorflow:step = 601, loss = 0.1646782 (6.914 sec)
INFO:tensorflow:global_step/sec: 14.5217
INFO:tensorflow:step = 701, loss = 0.13758004 (6.887 sec)
INFO:tensorflow:global_step/sec: 14.123
INFO:tensorflow:step = 801, loss = 0.1219064 (7.081 sec)
INFO:tensorflow:global_step/sec: 15.7489
INFO:tensorflow:step = 901, loss = 0.10583858 (6.350 sec)
INFO:tensorflow:Saving checkpoints for 1000 into outputdir/model.ckpt.
INFO:tensorflow:Loss for final step: 0.085023135.
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Starting evaluation at 2018-09-12-20:01:59
INFO:tensorflow:Graph was finalized.
INFO:tensorflow:Restoring parameters from outputdir/model.ckpt-1000
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Finished evaluation at 2018-09-12-20:01:59
INFO:tensorflow:Saving dict for global step 1000: global_step = 1000, loss = 0.069563285, rmse = 0.26374853
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Signatures INCLUDED in export for Classify: None
INFO:tensorflow:Signatures INCLUDED in export for Regress: None
INFO:tensorflow:Signatures INCLUDED in export for Predict: ['serving_default', 'predict_export_outputs']
INFO:tensorflow:Restoring parameters from outputdir/model.ckpt-1000
INFO:tensorflow:Assets added to graph.
INFO:tensorflow:No assets to write.
INFO:tensorflow:SavedModel written to: b"outputdir/export/exporter/temp-b'1536782520'/saved_model.pb"

Standalone Python module

To train this on Cloud ML Engine, we take the code in this notebook and make a standalone Python module.



In [14]:

    
%%bash
# Run module as-is
export parent_dir=$(dirname $(pwd))
echo $parent_dir
rm -rf outputdir
export PYTHONPATH=${PYTHONPATH}:$parent_dir/simplernn
python -m trainer.task \
  --train_data_paths="${PWD}/train.csv*" \
  --eval_data_paths="${PWD}/valid.csv*"  \
  --output_dir=outputdir \
  --job-dir=./tmp









    



/content/training-data-analyst/courses/machine_learning/deepdive/05_artandscience






    



/usr/local/envs/py3env/lib/python3.5/site-packages/h5py/__init__.py:36: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`.
  from ._conv import register_converters as _register_converters
INFO:tensorflow:Using default config.
INFO:tensorflow:Using config: {'_protocol': None, '_train_distribute': None, '_num_worker_replicas': 1, '_experimental_distribute': None, '_eval_distribute': None, '_model_dir': 'outputdir/', '_log_step_count_steps': 100, '_keep_checkpoint_every_n_hours': 10000, '_global_id_in_cluster': 0, '_task_id': 0, '_evaluation_master': '', '_save_summary_steps': 100, '_task_type': 'worker', '_master': '', '_session_config': allow_soft_placement: true
graph_options {
  rewrite_options {
    meta_optimizer_iterations: ONE
  }
}
, '_keep_checkpoint_max': 5, '_is_chief': True, '_service': None, '_tf_random_seed': None, '_device_fn': None, '_save_checkpoints_steps': None, '_cluster_spec': <tensorflow.python.training.server_lib.ClusterSpec object at 0x7efd011d6208>, '_num_ps_replicas': 0, '_save_checkpoints_secs': 600}
INFO:tensorflow:Not using Distribute Coordinator.
INFO:tensorflow:Running training and evaluation locally (non-distributed).
INFO:tensorflow:Start train and evaluate loop. The evaluate will happen after every checkpoint. Checkpoint frequency is determined based on RunConfig arguments: save_checkpoints_steps None or save_checkpoints_secs 600.
WARNING:tensorflow:From /usr/local/envs/py3env/lib/python3.5/site-packages/tensorflow/python/framework/op_def_library.py:263: colocate_with (from tensorflow.python.framework.ops) is deprecated and will be removed in a future version.
Instructions for updating:
Colocations handled automatically by placer.
INFO:tensorflow:Calling model_fn.
WARNING:tensorflow:From /content/training-data-analyst/courses/machine_learning/deepdive/05_artandscience/simplernn/trainer/model.py:90: BasicLSTMCell.__init__ (from tensorflow.python.ops.rnn_cell_impl) is deprecated and will be removed in a future version.
Instructions for updating:
This class is equivalent as tf.keras.layers.LSTMCell, and will be replaced by that in Tensorflow 2.0.
WARNING:tensorflow:From /content/training-data-analyst/courses/machine_learning/deepdive/05_artandscience/simplernn/trainer/model.py:91: static_rnn (from tensorflow.python.ops.rnn) is deprecated and will be removed in a future version.
Instructions for updating:
Please use `keras.layers.RNN(cell, unroll=True)`, which is equivalent to this API
WARNING:tensorflow:From /usr/local/envs/py3env/lib/python3.5/site-packages/tensorflow/python/ops/losses/losses_impl.py:667: to_float (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use tf.cast instead.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Create CheckpointSaverHook.
INFO:tensorflow:Graph was finalized.
2019-04-04 22:20:25.388875: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA
2019-04-04 22:20:25.399578: I tensorflow/core/platform/profile_utils/cpu_utils.cc:94] CPU Frequency: 2200000000 Hz
2019-04-04 22:20:25.400981: I tensorflow/compiler/xla/service/service.cc:150] XLA service 0x5586480e4040 executing computations on platform Host. Devices:
2019-04-04 22:20:25.401122: I tensorflow/compiler/xla/service/service.cc:158]   StreamExecutor device (0): <undefined>, <undefined>
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Saving checkpoints for 0 into outputdir/model.ckpt.
INFO:tensorflow:loss = 0.7419274, step = 1
INFO:tensorflow:global_step/sec: 19.6848
INFO:tensorflow:loss = 0.22478724, step = 101 (5.081 sec)
INFO:tensorflow:global_step/sec: 21.8056
INFO:tensorflow:loss = 0.120992295, step = 201 (4.586 sec)
INFO:tensorflow:global_step/sec: 19.5339
INFO:tensorflow:loss = 0.10022815, step = 301 (5.119 sec)
INFO:tensorflow:global_step/sec: 17.1955
INFO:tensorflow:loss = 0.07411742, step = 401 (5.815 sec)
INFO:tensorflow:global_step/sec: 23.2559
INFO:tensorflow:loss = 0.07213749, step = 501 (4.300 sec)
INFO:tensorflow:global_step/sec: 23.8249
INFO:tensorflow:loss = 0.07263194, step = 601 (4.197 sec)
INFO:tensorflow:global_step/sec: 24.2606
INFO:tensorflow:loss = 0.06830608, step = 701 (4.122 sec)
INFO:tensorflow:global_step/sec: 24.188
INFO:tensorflow:loss = 0.065488726, step = 801 (4.134 sec)
INFO:tensorflow:global_step/sec: 24.158
INFO:tensorflow:loss = 0.06435184, step = 901 (4.139 sec)
INFO:tensorflow:Saving checkpoints for 1000 into outputdir/model.ckpt.
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
INFO:tensorflow:Starting evaluation at 2019-04-04T22:21:14Z
INFO:tensorflow:Graph was finalized.
WARNING:tensorflow:From /usr/local/envs/py3env/lib/python3.5/site-packages/tensorflow/python/training/saver.py:1266: checkpoint_exists (from tensorflow.python.training.checkpoint_management) is deprecated and will be removed in a future version.
Instructions for updating:
Use standard file APIs to check for files with this prefix.
INFO:tensorflow:Restoring parameters from outputdir/model.ckpt-1000
INFO:tensorflow:Running local_init_op.
INFO:tensorflow:Done running local_init_op.
INFO:tensorflow:Finished evaluation at 2019-04-04-22:21:14
INFO:tensorflow:Saving dict for global step 1000: global_step = 1000, loss = 0.05510318, rmse = 0.23474066
INFO:tensorflow:Saving 'checkpoint_path' summary for global step 1000: outputdir/model.ckpt-1000
INFO:tensorflow:Calling model_fn.
INFO:tensorflow:Done calling model_fn.
WARNING:tensorflow:From /usr/local/envs/py3env/lib/python3.5/site-packages/tensorflow/python/saved_model/signature_def_utils_impl.py:205: build_tensor_info (from tensorflow.python.saved_model.utils_impl) is deprecated and will be removed in a future version.
Instructions for updating:
This function will only be available through the v1 compatibility library as tf.compat.v1.saved_model.utils.build_tensor_info or tf.compat.v1.saved_model.build_tensor_info.
INFO:tensorflow:Signatures INCLUDED in export for Regress: None
INFO:tensorflow:Signatures INCLUDED in export for Classify: None
INFO:tensorflow:Signatures INCLUDED in export for Eval: None
INFO:tensorflow:Signatures INCLUDED in export for Predict: ['predict_export_outputs', 'serving_default']
INFO:tensorflow:Signatures INCLUDED in export for Train: None
INFO:tensorflow:Restoring parameters from outputdir/model.ckpt-1000
INFO:tensorflow:Assets added to graph.
INFO:tensorflow:No assets to write.
INFO:tensorflow:SavedModel written to: outputdir/export/exporter/temp-b'1554416475'/saved_model.pb
INFO:tensorflow:Loss for final step: 0.060268052.

Try out online prediction. This is how the REST API will work after you train on Cloud ML Engine



In [39]:

    
%%writefile test.json
{"rawdata_input": [0,0.214,0.406,0.558,0.655,0.687,0.65,0.549,0.393]}









    



Overwriting test.json



In [40]:

    
# local predict doesn't work with Python 3 yet.
# %%bash
# MODEL_DIR=$(ls ./outputdir/export/exporter/)
# gcloud ml-engine local predict --model-dir=./outputdir/export/exporter/$MODEL_DIR --json-instances=test.json

Cloud ML Engine

Now to train on Cloud ML Engine.



In [41]:

    
%%bash
# Run module on Cloud ML Engine
OUTDIR=gs://${BUCKET}/simplernn/model_trained
JOBNAME=simplernn_$(date -u +%y%m%d_%H%M%S)
gsutil -m rm -rf $OUTDIR
gcloud ml-engine jobs submit training $JOBNAME \
   --region=$REGION \
   --module-name=trainer.task \
   --package-path=$(dirname $(pwd))/simplernn/trainer \
   --job-dir=$OUTDIR \
   --staging-bucket=gs://$BUCKET \
   --scale-tier=BASIC \
   --runtime-version=1.4 \
   -- \
   --train_data_paths="gs://${BUCKET}/train.csv*" \
   --eval_data_paths="gs://${BUCKET}/valid.csv*"  \
   --output_dir=$OUTDIR









    



jobId: simplernn_180912_200305
state: QUEUED






    



CommandException: 1 files/objects could not be removed.
Job [simplernn_180912_200305] submitted successfully.
Your job is still active. You may view the status of your job with the command

  $ gcloud ml-engine jobs describe simplernn_180912_200305

or continue streaming the logs with the command

  $ gcloud ml-engine jobs stream-logs simplernn_180912_200305

Variant: long sequence

To create short sequences from a very long sequence.



In [42]:

    
import tensorflow as tf
import numpy as np

def breakup(sess, x, lookback_len):
  N = sess.run(tf.size(x))
  windows = [tf.slice(x, [b], [lookback_len]) for b in range(0, N-lookback_len)]
  windows = tf.stack(windows)
  return windows

x = tf.constant(np.arange(1,11, dtype=np.float32))
with tf.Session() as sess:
    print('input=', x.eval())
    seqx = breakup(sess, x, 5)
    print('output=', seqx.eval())









    



input= [ 1.  2.  3.  4.  5.  6.  7.  8.  9. 10.]
output= [[1. 2. 3. 4. 5.]
 [2. 3. 4. 5. 6.]
 [3. 4. 5. 6. 7.]
 [4. 5. 6. 7. 8.]
 [5. 6. 7. 8. 9.]]

Variant: Keras

You can also invoke a Keras model from within the Estimator framework by creating an estimator from the compiled Keras model:



In [43]:

    
def make_keras_estimator(output_dir):
  from tensorflow import keras
  model = keras.models.Sequential()
  model.add(keras.layers.Dense(32, input_shape=(N_INPUTS,), name=TIMESERIES_INPUT_LAYER))
  model.add(keras.layers.Activation('relu'))
  model.add(keras.layers.Dense(1))
  model.compile(loss = 'mean_squared_error',
                optimizer = 'adam',
                metrics = ['mae', 'mape']) # mean absolute [percentage] error
  return keras.estimator.model_to_estimator(model)



In [ ]:

    
%%bash
# Run module as-is
echo $PWD
rm -rf outputdir
export parent_dir=$(dirname $(pwd))
export PYTHONPATH=${PYTHONPATH}:$parent_dir/simplernn
python -m trainer.task \
  --train_data_paths="${PWD}/train.csv*" \
  --eval_data_paths="${PWD}/valid.csv*"  \
  --output_dir=${PWD}/outputdir \
  --job-dir=./tmp --keras

Copyright 2017 Google Inc. Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License