内容索引

该小结主要介绍了NumPy数组的基本操作。

子目1中，介绍创建和索引数组，数据类型，dtype类，自定义异构数据类型。

子目2中，介绍数组的索引和切片，主要是对[]运算符的操作。

子目3中，介绍如何改变数组的维度，分别介绍了ravel函数、flatten函数、transpose函数、resize函数、reshape函数的用法。



In [1]:

    
%pylab inline









    



Populating the interactive namespace from numpy and matplotlib

ndarray是一个多维数组对象，该对象由实际的数据、描述这些数据的元数据组成，大部分数组操作仅仅修改元数据部分，而不改变底层的实际数据。

用arange函数创建数组



In [2]:

    
a = arange(5)
a.dtype









    Out[2]:





dtype('int32')



In [3]:

    
a









    Out[3]:





array([0, 1, 2, 3, 4])



In [4]:

    
a.shape









    Out[4]:





(5,)

数组的shape属性返回一个元祖(tuple)，元组中的元素即NumPy数组每一个维度的大小。

1. 创建多维数组

array函数可以依据给定的对象生成数组。给定的对象应是类数组，如python的列表、numpy的arange函数



In [5]:

    
m = array([arange(2), arange(2)])



In [6]:

    
print m
print m.shape
print type(m)
print type(m.shape)









    



[[0 1]
 [0 1]]
(2, 2)
<type 'numpy.ndarray'>
<type 'tuple'>

选取元素



In [7]:

    
a = array([[1,2],[3,4]])
print a[0,0]
print a[0,1]

1
2

NumPy数据类型

Numpy除了Python支持的整型、浮点型、复数型之外，还添加了很多其他的数据类型。

Type Remarks Character code bool_ compatible: Python bool '?' bool8 8 bits
Integers:

byte compatible: C char 'b' short compatible: C short 'h' intc compatible: C int 'i' int_ compatible: Python int 'l' longlong compatible: C long long 'q' intp large enough to fit a pointer 'p' int8 8 bits
int16 16 bits
int32 32 bits
int64 64 bits
Unsigned integers:

ubyte compatible: C unsigned char 'B' ushort compatible: C unsigned short 'H' uintc compatible: C unsigned int 'I' uint compatible: Python int 'L' ulonglong compatible: C long long 'Q' uintp large enough to fit a pointer 'P' uint8 8 bits
uint16 16 bits
uint32 32 bits
uint64 64 bits
Floating-point numbers:

half 'e' single compatible: C float 'f' double compatible: C double
float_ compatible: Python float 'd' longfloat compatible: C long float 'g' float16 16 bits
float32 32 bits
float64 64 bits
float96 96 bits, platform?
float128 128 bits, platform?
Complex floating-point numbers:

csingle 'F' complex_ compatible: Python complex 'D' clongfloat 'G' complex64 two 32-bit floats
complex128 two 64-bit floats
complex192 two 96-bit floats, platform?
complex256 two 128-bit floats, platform?
Any Python object:

object_ any Python object 'O'

每一种数据类型均有对应的类型转换函数



In [8]:

    
print float64(42)
print int8(42.0)
print bool(42)
print float(True)









    



42.0
42
True
1.0



In [9]:

    
arange(8, dtype=uint16)









    Out[9]:





array([0, 1, 2, 3, 4, 5, 6, 7], dtype=uint16)

复数不能转换成整数和浮点数

Numpy数组中每一个元素均为相同的数据类型，现在给出单个元素所占字节



In [10]:

    
a.dtype









    Out[10]:





dtype('int32')



In [11]:

    
a.dtype.itemsize









    Out[11]:





4

dtype类的属性



In [12]:

    
t = dtype('float64')
print t.char
print t.type
print t.str









    



d
<type 'numpy.float64'>
<f8

str属性可以给出数据类型的字符串表示，该字符串的首个字符表示字节序，然后是字符编码，然后是所占字节数字节序是指位长为32和64的字（word）存储的顺序，包括大端序(big-endian)和小端序(little-endian)。大端序是将最高位字节存储在最低的内存地址处，用>表示；与之相反，小端序是将最低位字节存储在最低的内存地址处，用<表示。

创建自定义数据类型

自定义数据类型是一种异构数据类型，可以当做用来记录电子表格或数据库中一行数据的结构。

下面我们创建一种自定义的异构数据类型，该数据类型包括一个用字符串记录的名字、一个用整数记录的数字以及一个用浮点数记录的价格。



In [13]:

    
t = dtype([('name', str_, 40), ('numitems', int32), ('price', float32)])



In [14]:

    
t









    Out[14]:





dtype([('name', 'S40'), ('numitems', '<i4'), ('price', '<f4')])



In [15]:

    
t['name']









    Out[15]:





dtype('S40')



In [16]:

    
itemz = array([('Meaning of life DVD', 32, 3.14), ('Butter', 13, 2.72)], dtype=t)



In [17]:

    
itemz[1]









    Out[17]:





('Butter', 13, 2.7200000286102295)

2. 数组的索引和切片



In [18]:

    
a = arange(9)
#下标0-7， 以2为步长
print a[:7:2]

#以负数下标翻转数组
print a[::-1]
print a[::-2]









    



[0 2 4 6]
[8 7 6 5 4 3 2 1 0]
[8 6 4 2 0]

多维数组的切片和索引



In [19]:

    
b = arange(24).reshape(2,3,4)
print b.shape
print b









    



(2, 3, 4)
[[[ 0  1  2  3]
  [ 4  5  6  7]
  [ 8  9 10 11]]

 [[12 13 14 15]
  [16 17 18 19]
  [20 21 22 23]]]

用三维坐标选定任意一个房间，即楼层、行号、列号



In [20]:

    
#选取第一层楼所有房间
print b[0]
print
print b[0, :, :]









    



[[ 0  1  2  3]
 [ 4  5  6  7]
 [ 8  9 10 11]]

[[ 0  1  2  3]
 [ 4  5  6  7]
 [ 8  9 10 11]]



In [21]:

    
#多个冒号用一个省略号代替
b[0, ...]









    Out[21]:





array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11]])



In [22]:

    
#间隔选元素
b[0,1,::2]









    Out[22]:





array([4, 6])



In [23]:

    
#多维数组执行翻转一维数组的命令，将在最前面的维度上翻转元素的顺序
b[::-1]









    Out[23]:





array([[[12, 13, 14, 15],
        [16, 17, 18, 19],
        [20, 21, 22, 23]],

       [[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]]])



In [24]:

    
b[::-1,::-1,::-1]









    Out[24]:





array([[[23, 22, 21, 20],
        [19, 18, 17, 16],
        [15, 14, 13, 12]],

       [[11, 10,  9,  8],
        [ 7,  6,  5,  4],
        [ 3,  2,  1,  0]]])

3. 改变数组的维度

ravel 完成展平操作



In [25]:

    
b.ravel()









    Out[25]:





array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16,
       17, 18, 19, 20, 21, 22, 23])

flatten 也是展平

flatten函数会请求分配内存来保存结果，而ravel函数只是返回数组的一个视图（view）



In [26]:

    
b.flatten()









    Out[26]:





array([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14, 15, 16,
       17, 18, 19, 20, 21, 22, 23])

用元组设置维度



In [27]:

    
b.shape = (6, 4)



In [28]:

    
b









    Out[28]:





array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15],
       [16, 17, 18, 19],
       [20, 21, 22, 23]])

transpose转置矩阵



In [29]:

    
b.transpose()









    Out[29]:





array([[ 0,  4,  8, 12, 16, 20],
       [ 1,  5,  9, 13, 17, 21],
       [ 2,  6, 10, 14, 18, 22],
       [ 3,  7, 11, 15, 19, 23]])

resize和reshape函数功能一样 但resize会直接改变所操作的数组



In [30]:

    
b.reshape(2,3,4)









    Out[30]:





array([[[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]],

       [[12, 13, 14, 15],
        [16, 17, 18, 19],
        [20, 21, 22, 23]]])



In [31]:

    
b









    Out[31]:





array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15],
       [16, 17, 18, 19],
       [20, 21, 22, 23]])



In [32]:

    
b.resize(2,12)



In [33]:

    
b









    Out[33]:





array([[ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11],
       [12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23]])