In this notebook, you play around with the TensorFlow Python API.
In [2]:
import tensorflow as tf
import numpy as np
print(tf.__version__)
First, let's try doing this using numpy, the Python numeric package. numpy code is immediately evaluated.
In [3]:
a = np.array([5, 3, 8])
b = np.array([3, -1, 2])
c = np.add(a, b)
print(c)
The equivalent code in TensorFlow consists of two steps:
In [4]:
a = tf.constant([5, 3, 8])
b = tf.constant([3, -1, 2])
c = tf.add(a, b)
print(c)
c is an Op ("Add") that returns a tensor of shape (3,) and holds int32. The shape is inferred from the computation graph.
Try the following in the cell above:
In [5]:
with tf.Session() as sess:
result = sess.run(c)
print(result)
Same graph, but without hardcoding inputs at build stage
In [6]:
a = tf.placeholder(dtype=tf.int32, shape=(None,)) # batchsize x scalar
b = tf.placeholder(dtype=tf.int32, shape=(None,))
c = tf.add(a, b)
with tf.Session() as sess:
result = sess.run(c, feed_dict={
a: [3, 4, 5],
b: [-1, 2, 3]
})
print(result)
The area of triangle whose three side lengths are $(a, b, c)$ is $\sqrt{s(s-a)(s-b)(s-c)}$ where $s=\frac{a+b+c}{2}$
Look up the available operations at: https://www.tensorflow.org/api_docs/python/tf.
You'll need the tf.sqrt()
operation. Remember tf.add()
, tf.subtract()
and tf.multiply()
are overloaded with the +,- and * operators respectively.
You should get: 6.278497
In [8]:
def compute_area(sides):
#TODO: Write TensorFlow code to compute area of a triangle
# given its side lengths
return area
with tf.Session() as sess:
area = compute_area(tf.constant([5.0, 3.0, 7.1]))
result = sess.run(area)
print(result)
Extend your code to be able to compute the area for several triangles at once.
You should get: [6.278497 4.709139]
In [7]:
def compute_area(sides):
#TODO: Write TensorFlow code to compute area of a
# SET of triangles given by their side lengths
return list_of_areas
with tf.Session() as sess:
# pass in two triangles
area = compute_area(tf.constant([
[5.0, 3.0, 7.1],
[2.3, 4.1, 4.8]
]))
result = sess.run(area)
print(result)
More common is to define the input to a program as a placeholder and then to feed in the inputs. The difference between the code below and the code above is whether the "area" graph is coded up with the input values or whether the "area" graph is coded up with a placeholder through which inputs will be passed in at run-time.
In [ ]:
with tf.Session() as sess:
#TODO: Rather than feeding the side values as a constant,
# use a placeholder and fill it using feed_dict instead.
result = sess.run(...)
print(result)
tf.eager allows you to avoid the build-then-run stages. However, most production code will follow the lazy evaluation paradigm because the lazy evaluation paradigm is what allows for multi-device support and distribution.
One thing you could do is to develop using tf.eager and then comment out the eager execution and add in the session management code.
You will need to restart your session to try this out.
In [ ]:
import tensorflow as tf
tf.enable_eager_execution()
#TODO: Using your non-placeholder solution,
# try it now using tf.eager by removing the session
Use TensorFlow to find the roots of a fourth-degree polynomial using Halley's Method. The five coefficients (i.e. $a_0$ to $a_4$) of
$f(x) = a_0 + a_1 x + a_2 x^2 + a_3 x^3 + a_4 x^4$
will be fed into the program, as will the initial guess $x_0$. Your program will start from that initial guess and then iterate one step using the formula:
If you got the above easily, try iterating indefinitely until the change between $x_n$ and $x_{n+1}$ is less than some specified tolerance. Hint: Use tf.while_loop
Copyright 2017 Google Inc. Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License