Corpus callosum's shape signature for segmentation error detection in large datasets

Abstract

Corpus Callosum (CC) is a subcortical, white matter structure with great importance in clinical and research studies because its shape and volume are correlated with subject's characteristics and neurodegenerative diseases. CC segmentation is a important step for any medical, clinical or research posterior study. Currently, magnetic resonance imaging (MRI) is the main tool for evaluating brain because it offers the better soft tissue contrast. Particullary, segmentation in MRI difussion modality has great importante given information associated to brain microstruture and fiber composition.

In this work a method for detection of erroneous segmentations in large datasets is proposed based-on shape signature. Shape signature is obtained from segmentation, calculating curvature along contour using a spline formulation. A mean correct signature is used as reference for compare new segmentations through root mean square error. This method was applied to 152 subject dataset for three different segmentation methods in diffusion: Watershed, ROQS and pixel-based presenting high accuracy in error detection. This method do not require per-segmentation reference and it can be applied to any MRI modality and other image aplications.



In [1]:

    
## Functions

import sys,os
import copy
path = os.path.abspath('../dev/')
if path not in sys.path:
    sys.path.append(path)

import bib_mri as FW
import numpy as np
import scipy as scipy
import scipy.misc as misc 
import matplotlib as mpl
import matplotlib.pyplot as plt
from mpl_toolkits.mplot3d import Axes3D
from matplotlib import pyplot
from numpy import genfromtxt
import platform

import torch
from torch.autograd import Variable
import torch.nn as nn
import torch.nn.functional as F
%matplotlib inline

def sign_extract(seg, resols): #Function for shape signature extraction
    splines = FW.get_spline(seg,smoothness)

    sign_vect = np.array([]).reshape(0,points) #Initializing temporal signature vector
    for resol in resols:
        sign_vect = np.vstack((sign_vect, FW.get_profile(splines, n_samples=points, radius=resol)))
    
    return sign_vect

def sign_fit(sig_ref, sig_fit): #Function for signature fitting
    dif_curv = []
    for shift in range(points):
        dif_curv.append(np.abs(np.sum((sig_ref - np.roll(sig_fit[0],shift))**2)))
    return np.apply_along_axis(np.roll, 1, sig_fit, np.argmin(dif_curv))

print "Python version: ", platform.python_version()
print "Numpy version: ", np.version.version
print "Scipy version: ", scipy.__version__
print "Matplotlib version: ", mpl.__version__









    



Python version:  2.7.13
Numpy version:  1.12.1
Scipy version:  0.19.0
Matplotlib version:  2.0.2

Introduction

The Corpus Callosum (CC) is the largest white matter structure in the central nervous system that connects both brain hemispheres and allows the communication between them. The CC has great importance in research studies due to the correlation between shape and volume with some subject's characteristics, such as: gender, age, numeric and mathematical skills and handedness. In addition, some neurodegenerative diseases like Alzheimer, autism, schizophrenia and dyslexia could cause CC shape deformation.

CC segmentation is a necessary step for morphological and physiological features extraction in order to analyze the structure in image-based clinical and research applications. Magnetic Resonance Imaging (MRI) is the most suitable image technique for CC segmentation due to its ability to provide contrast between brain tissues however CC segmentation is challenging because of the shape and intensity variability between subjects, volume partial effect in diffusion MRI, fornex proximity and narrow areas in CC. Among the known MRI modalities, Diffusion-MRI arouses special interest to study the CC, despite its low resolution and high complexity, since it provides useful information related to the organization of brain tissues and the magnetic field does not interfere with the diffusion process itself.

Some CC segmentation approaches using Diffusion-MRI were found in the literature. Niogi et al. proposed a method based on thresholding, Freitas et al. e Rittner et al. proposed region methods based on Watershed transform, Nazem-Zadeh et al. implemented based on level surfaces, Kong et al. presented an clustering algorithm for segmentation, Herrera et al. segmented CC directly in diffusion weighted imaging (DWI) using a model based on pixel classification and Garcia et al. proposed a hybrid segmentation method based on active geodesic regions and level surfaces.

With the growing of data and the proliferation of automatic algorithms, segmentation over large databases is affordable. Therefore, error automatic detection is important in order to facilitate and speed up filter on CC segmentation databases. presented proposals for content-based image retrieval (CBIR) using shape signature of the planar object representation.

In this work, a method for automatic detection of segmentation error in large datasets is proposed based on CC shape signature. Signature offers shape characterization of the CC and therefore it is expected that a "typical correct signature" represents well any correct segmentation. Signature is extracted measuring curvature along segmentation contour. The method was implemented in three main stages: mean correct signature generation, signature configuration and method testing. The first one takes 20 corrects segmentations and generates one correct signature of reference (typical correct signature), per-resolution, using mean values in each point. The second stage stage takes 10 correct segmentations and 10 erroneous segmentations and adjusts the optimal resolution and threshold, based on mean correct signature, that lets detection of erroneous segmentations. The third stage labels a new segmentation as correct and erroneous comparing with the mean signature using optimal resolution and threshold.

The comparison between signatures is done using root mean square error (RMSE). True label for each segmentation was done visually. Correct segmentation corresponds to segmentations with at least 50% of agreement with the structure. It is expected that RMSE for correct segmentations is lower than RMSE associated to erroneous segmentation when compared with a typical correct segmentation.



In [2]:

    
#Loading labeled segmentations
seg_label = genfromtxt('../../dataset/Seg_Watershed/watershed_label.csv', delimiter=',').astype('uint8')

list_masks_wt = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 0] #Extracting segmentations
list_labels_wt = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 1] #Extracting labels

Shape signature for comparison

Signature is a shape descriptor that measures the rate of variation along the segmentation contour. As shown in figure, the curvature $k$ in the pivot point $p$, with coordinates ($x_p$,$y_p$), is calculated using the next equation. This curvature depict the angle between the segments $\overline{(x_{p-ls},y_{p-ls})(x_p,y_p)}$ and $\overline{(x_p,y_p)(x_{p+ls},y_{p+ls})}$. These segments are located to a distance $ls>0$, starting in a pivot point and finishing in anterior and posterior points, respectively. The signature is obtained calculating the curvature along all segmentation contour.

\begin{equation} \label{eq:per1} k(x_p,y_p) = \arctan\left(\frac{y_{p+ls}-y_p}{x_{p+ls}-x_p}\right)-\arctan\left(\frac{y_p-y_{p-ls}}{x_p-x_{p-ls}}\right) \end{equation}

Signature construction is performed from segmentation contour of the CC. From contour, spline is obtained. Spline purpose is twofold: to get a smooth representation of the contour and to facilitate calculation of the curvature using its parametric representation. The signature is obtained measuring curvature along spline. $ls$ is the parametric distance between pivot point and both posterior and anterior points and it determines signature resolution. By simplicity, $ls$ is measured in percentage of reconstructed spline points.

In order to achieve quantitative comparison between two signatures root mean square error (RMSE) is introduced. RMSE measures distance, point to point, between signatures $a$ and $b$ along all points $p$ of signatures.

\begin{equation} \label{eq:per4} RMSE = \sqrt{\frac{1}{P}\sum_{p=1}^{P}(k_{ap}-k_{bp})^2} \end{equation}

Frequently, signatures of different segmentations are not fitted along the 'x' axis because of the initial point on the spline calculation starts in different relative positions. This makes impossible to compare directly two signatures and therefore, a prior fitting process must be accomplished. The fitting process is done shifting one of the signature while the other is kept fixed. For each shift, RMSE between the two signatures is measured. The point giving the minor error is the fitting point. Fitting was done at resolution $ls = 0.35$. This resolution represents globally the CC's shape and eases their fitting.

After fitting, RMSE between signatures can be measured in order to achieve final quantitative comparison.

Signature for segmentation error detection

For segmentation error detection, a typical correct signature is obtained calculating mean over a group of signatures from correct segmentations. Because of this signature could be used in any resolution, $ls$ must be chosen for achieve segmentation error detection. The optimal resolution must be able to return the greatest RMSE difference between correct and erroneous segmentation when compared with a typical correct signature.

In the optimal resolution, a threshold must be chosen for separate erroneous and correct segmentations. This threshold stays between RMSE associated to correct ($RMSE_E$) and erroneous ($RMSE_C$) signatures and it is given by the next equation where N (in percentage) represents proximity to correct or erroneous RMSE. If RMSE calculated over a group of signatures, mean value is applied.

\begin{equation} \label{eq:eq3} th = N*(\overline{RMSE_E}-\overline{RMSE_C})+\overline{RMSE_C} \end{equation}

Experiments and results

In this work, comparison of signatures through RMSE is used for segmentation error detection in large datasets. For this, it will be calculated a mean correct signature based on 20 correct segmentation signatures. This mean correct signature represents a tipycal correct segmentation. For a new segmentation, signature is extracted and compared with mean signature.

For experiments, DWI from 152 subjects at the University of Campinas, were acquired on a Philips scanner Achieva 3T in the axial plane with a $1$x$1mm$ spatial resolution and $2mm$ slice thickness, along $32$ directions ($b-value=1000s/mm^2$, $TR=8.5s$, and $TE=61ms$). All data used in this experiment was acquired through a project approved by the research ethics committee from the School of Medicine at UNICAMP. From each acquired DWI volume, only the midsaggital slice was used.

Three segmentation methods were implemented to obtained binary masks over a 152 subject dataset: Watershed, ROQS and pixel-based. 40 Watershed segmentations were chosen as follows: 20 correct segmentations for mean correct signature generation and 10 correct and 10 erroneous segmentations for signature configuration stage. Watershed was chosen to generate and adjust the mean signature because of its higher error rate and its variability in the erroneous segmentation shape. These characteristics allow improve generalization. The method was tested on the remaining Watershed segmentations (108 masks) and two additional segmentations methods: ROQS (152 masks) and pixel-based (152 masks).

Mean correct signature generation

In this work, segmentations based on Watershed method were used for implementation of the first and second stages. From the Watershed dataset, 20 correct segmentations were chosen. Spline for each one was obtained from segmentation contour. The contour was obtained using mathematical morphology, applying xor logical operation, pixel-wise, between original segmentation and the eroded version of itself by an structuring element b:

\begin{equation} \label{eq:per2} G_E = XOR(S,S \ominus b) \end{equation}

From contour, it is calculated spline. The implementation, is a B-spline (Boor's basic spline). This formulation has two parameters: degree, representing polynomial degrees of the spline, and smoothness, being the trade off between proximity and smoothness in the fitness of the spline. Degree was fixed in 5 allowing adequate representation of the contour. Smoothness was fixed in 700. This value is based on the mean quantity of pixels of the contour that are passed for spline calculation. The curvature was measured over 500 points over the spline to generate the signature along 20 segmentations. Signatures were fitted to make possible comparison (Fig. signatures). Fitting resolution was fixed in 0.35.

In order to get a representative correct signature, mean signature per-resolution is generated using 20 correct signatures. The mean is calculated in each point.

Signature configuration

Because of the mean signature was extracted for all the resolutions, it is necessary to find resolution in that diference between RMSE for correct signature and RMSE for erroneous signature is maximum. So, 20 news segmentations were used to find this optimal resolution, being divided as 10 correct segmentations and 10 erroneous segmentations. For each segmentation, it was extracted signature for all resolutions.



In [3]:

    
smoothness = 700 #Smoothness
degree = 5 #Spline degree
fit_res = 0.35
resols = np.arange(0.01,0.5,0.01) #Signature resolutions
resols = np.insert(resols,0,fit_res) #Insert resolution for signature fitting
points = 500 #Points of Spline reconstruction

prof_vec_wt = np.empty((len(list_masks_wt),resols.shape[0],points)) #Initializing correct signature vector
for ind, mask in enumerate(list_masks_wt):
    #Loading correct mask
    mask_pn = np.load('../../dataset/Seg_Watershed/mask_wate_{}.npy'.format(mask))
    refer_temp = sign_extract(mask_pn, resols) #Function for shape signature extraction
    prof_vec_wt[ind] = refer_temp
    if mask > 0: #Fitting curves using the first one as basis
        prof_ref = prof_vec_wt[0]
        prof_vec_wt[ind] = sign_fit(prof_ref[0], refer_temp) #Function for signature fitting

ind_rel_cor = np.where(np.logical_not(list_labels_wt))[0]
ind_rel_err = np.where(list_labels_wt)[0]

print "Correct segmentations' vector: ", prof_vec_wt[ind_rel_cor].shape
print "Erroneous segmentations' vector: ", prof_vec_wt[ind_rel_err].shape









    



Correct segmentations' vector:  (123, 50, 500)
Erroneous segmentations' vector:  (24, 50, 500)



In [4]:

    
res_ex = 15



In [5]:

    
for res_ver in range(prof_vec_wt.shape[1]):
    plt.figure()
    plt.plot(prof_vec_wt[ind_rel_cor,res_ver,:].T)
    plt.title("Correct signatures for res: %f"%(resols[res_ver]))
    plt.show()

    plt.figure()
    plt.plot(prof_vec_wt[ind_rel_err,res_ver,:].T)
    plt.title("Erroneous signatures for res: %f"%(resols[res_ver]))
    plt.show()



In [6]:

    
#Loading labeled segmentations
seg_label = genfromtxt('../../dataset/Seg_ROQS/roqs_label.csv', delimiter=',').astype('uint8')

list_masks_rq = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 0] #Extracting segmentations
list_labels_rq = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 1] #Extracting labels



In [7]:

    
prof_vec_rq = np.empty((len(list_masks_rq),resols.shape[0],points)) #Initializing correct signature vector
for ind, mask in enumerate(list_masks_rq):
    mask_pn = np.load('../../dataset/Seg_ROQS/mask_roqs_{}.npy'.format(mask)) #Loading mask
    refer_temp = sign_extract(mask_pn, resols) #Function for shape signature extraction
    prof_vec_rq[ind] = sign_fit(prof_vec_wt[0][0], refer_temp) #Function for signature fitting using Watershed as basis

ind_rel_cor = np.where(np.logical_not(list_labels_rq))[0]
ind_rel_err = np.where(list_labels_rq)[0]
        
print "Correct segmentations' vector: ", prof_vec_rq[ind_rel_cor].shape
print "Erroneous segmentations' vector: ", prof_vec_rq[ind_rel_err].shape









    



Correct segmentations' vector:  (136, 50, 500)
Erroneous segmentations' vector:  (16, 50, 500)



In [8]:

    
plt.figure()
plt.plot(prof_vec_rq[ind_rel_cor,res_ex,:].T)
plt.title("Correct signatures for res: %f"%(resols[res_ex]))
plt.show()

plt.figure()
plt.plot(prof_vec_rq[ind_rel_err,res_ex,:].T)
plt.title("Erroneous signatures for res: %f"%(resols[res_ex]))
plt.show()



In [9]:

    
#Loading labeled segmentations
seg_label = genfromtxt('../../dataset/Seg_pixel/pixel_label.csv', delimiter=',').astype('uint8')

list_masks_px = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 0] #Extracting segmentations
list_labels_px = seg_label[np.logical_or(seg_label[:,1] == 0, seg_label[:,1] == 1), 1] #Extracting labels



In [10]:

    
prof_vec_px = np.empty((len(list_masks_px),resols.shape[0],points)) #Initializing correct signature vector
for ind, mask in enumerate(list_masks_px):
    mask_pn = np.load('../../dataset/Seg_pixel/mask_pixe_{}.npy'.format(mask)) #Loading mask
    refer_temp = sign_extract(mask_pn, resols) #Function for shape signature extraction
    prof_vec_px[ind] = sign_fit(prof_vec_wt[0][0], refer_temp) #Function for signature fitting using Watershed as basis

ind_rel_cor = np.where(np.logical_not(list_labels_px))[0]
ind_rel_err = np.where(list_labels_px)[0]
        
print "Correct segmentations' vector: ", prof_vec_px[ind_rel_cor].shape
print "Erroneous segmentations' vector: ", prof_vec_px[ind_rel_err].shape









    



Correct segmentations' vector:  (131, 50, 500)
Erroneous segmentations' vector:  (21, 50, 500)



In [11]:

    
plt.figure()
plt.plot(prof_vec_px[ind_rel_cor,res_ex,:].T)
plt.title("Correct signatures for res: %f"%(resols[res_ex]))
plt.show()

plt.figure()
plt.plot(prof_vec_px[ind_rel_err,res_ex,:].T)
plt.title("Erroneous signatures for res: %f"%(resols[res_ex]))
plt.show()

Datasets



In [12]:

    
X_total = np.concatenate((prof_vec_wt, prof_vec_rq, prof_vec_px), axis=0)
list_labels_rq += 2
list_labels_px += 4
Y_total = np.concatenate((list_labels_wt, list_labels_rq, list_labels_px), axis=0)
print 'Profiles: ', X_total.shape
print 'Labels: ', Y_total.shape









    



Profiles:  (451, 50, 500)
Labels:  (451,)



In [13]:

    
split_train = 0.7
#split_valid = split_train + 0.2
set_split_train = int(Y_total.shape[0]*split_train)
#set_split_valid = int(Y_total.shape[0]*split_valid)
index_random = np.arange(Y_total.shape[0])
np.random.shuffle(index_random)

X_train = X_total[index_random[:set_split_train]]
X_valid = X_total[index_random[set_split_train:]]
#X_test = X_total[index_random[set_split_valid:]]
Y_train = Y_total[index_random[:set_split_train]]
Y_valid = Y_total[index_random[set_split_train:]]
#Y_test = Y_total[index_random[set_split_valid:]]
print 'X_train: ', X_train.shape, 'Y_train: ', Y_train.shape
print 'X_valid: ', X_valid.shape, 'Y_valid', Y_valid.shape
#print X_test.shape, Y_test.shape









    



X_train:  (315, 50, 500) Y_train:  (315,)
X_valid:  (136, 50, 500) Y_valid (136,)



In [14]:

    
res_chs = 30
trainloader = X_train[:,res_chs,:]
validloader = X_valid[:,res_chs,:]
val_norm = np.amax(trainloader).astype(float)
print 'Normalization value: ', val_norm

trainloader = trainloader / val_norm
trainloader = torch.FloatTensor(trainloader)
validloader = validloader / val_norm
validloader = torch.FloatTensor(validloader)

print 'Trainloader: ', trainloader.size()
print 'Validloader: ', validloader.size()









    



Normalization value:  224.279297493
Trainloader:  torch.Size([315, 500])
Validloader:  torch.Size([136, 500])

Autoencoder



In [17]:

    
def train(model,train_loader,valid_loader,loss_fn,optimizer,epochs=100,num_break=5,patience=5,criteria_stop="loss"):
    hist_train_loss = hist_val_loss = hist_train_acc = hist_val_acc = np.array([])
    best_epoch = patience_count = 0

    print("Training starts along %i epoch"%epochs)
    for e in range(epochs):
        correct_train = correct_val = total_train = total_val = 0
        cont_i = loss_t_e = loss_v_e = 0
        for data_train, data_valid in zip(train_loader, valid_loader):
            #if cont_i == num_break:
            #    break

            inputs = data_train
            var_inputs = Variable(inputs)

            predict, encode = model(var_inputs)
            loss = loss_fn(predict, var_inputs.view(-1, 500))
            loss_t_e += loss.data[0]
            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

            ##Validating model
            inputs = data_valid
            var_inputs = Variable(inputs)
            predict, encode = model(var_inputs)
            loss_val = loss_fn(predict, var_inputs.view(-1, 500))
            loss_v_e += loss_val.data[0]

            cont_i += 1

        #Stacking historical
        hist_train_loss = np.hstack((hist_train_loss, loss_t_e/(cont_i*1.0)))
        hist_val_loss = np.hstack((hist_val_loss, loss_v_e/(cont_i*1.0)))
        if e%10 == 0:
            print('Epoch: ', e, 'train loss: ', hist_train_loss[-1], 'valid loss: ', hist_val_loss[-1])

        #Early stopping by best accuracy or best loss
        if(criteria_stop == "acc"):
            if(hist_val_acc[best_epoch] < hist_val_acc[-1]):
                patience_count = 0
                best_epoch = e
                best_model = copy.deepcopy(model)
        else:
            if(hist_val_loss[best_epoch] > hist_val_loss[-1]):
                patience_count = 0
                best_epoch = e
                best_model = copy.deepcopy(model)

        if(patience_count > patience):
            print("Early stopped")
            break;

        if(e == epochs-1):
            print("Training stopped")
        patience_count += 1

    return(best_model, hist_train_loss, hist_val_loss)

class autoencoder2(nn.Module):
    def __init__(self):
        super(autoencoder2, self).__init__()
        self.fc1 = nn.Linear(500, 200)
        self.fc21 = nn.Linear(200, 2)
        self.fc3 = nn.Linear(2, 200)
        self.fc4 = nn.Linear(200, 500)
        self.relu = nn.ReLU()
        self.sigmoid = nn.Sigmoid()

    def encode(self, x):
        h1 = self.relu(self.fc1(x))
        return self.fc21(h1)

    def decode(self, z):
        h3 = self.relu(self.fc3(z))
        return self.sigmoid(self.fc4(h3))

    def forward(self, x):
        z = self.encode(x.view(-1, 500))
        return self.decode(z), z

class autoencoder3(nn.Module):
    def __init__(self):
        super(autoencoder3, self).__init__()
        self.fc1 = nn.Linear(500, 200)
        self.fc21 = nn.Linear(200, 3)
        self.fc3 = nn.Linear(3, 200)
        self.fc4 = nn.Linear(200, 500)
        self.relu = nn.ReLU()
        self.sigmoid = nn.Sigmoid()

    def encode(self, x):
        h1 = self.relu(self.fc1(x))
        return self.fc21(h1)

    def decode(self, z):
        h3 = self.relu(self.fc3(z))
        return self.sigmoid(self.fc4(h3))

    def forward(self, x):
        z = self.encode(x.view(-1, 500))
        return self.decode(z), z



In [18]:

    
net = autoencoder2()
print(net)









    



autoencoder2 (
  (fc1): Linear (500 -> 200)
  (fc21): Linear (200 -> 2)
  (fc3): Linear (2 -> 200)
  (fc4): Linear (200 -> 500)
  (relu): ReLU ()
  (sigmoid): Sigmoid ()
)



In [19]:

    
loss_fn = torch.nn.MSELoss()
optimizer = torch.optim.Adam(net.parameters())
epochs = 50
patience = 5
max_batch = 1000
criteria = "loss"

best_model, loss, loss_test = train(net, trainloader, validloader, loss_fn, optimizer, epochs = epochs, 
                                     num_break = max_batch, patience = patience, criteria_stop = criteria)









    



Training starts along 50 epoch
('Epoch: ', 0, 'train loss: ', 0.01468638300887987, 'valid loss: ', 0.012615403779765236)
('Epoch: ', 10, 'train loss: ', 0.0012183356182718212, 'valid loss: ', 0.0013649369490902994)
('Epoch: ', 20, 'train loss: ', 0.00088804597191269481, 'valid loss: ', 0.0011216130587854423)
('Epoch: ', 30, 'train loss: ', 0.00078659657061935459, 'valid loss: ', 0.0010682195767431574)
Early stopped



In [20]:

    
plt.title('Loss')
plt.xlabel('epochs')
plt.ylabel('loss')
plt.plot(loss, label='Train')
plt.plot(loss_test, label='Valid')
plt.legend()
plt.show()



In [21]:

    
__, encode = best_model(Variable(trainloader))
out_enc_tr = encode.data.numpy()
__, encode = best_model(Variable(validloader))
out_enc_vl = encode.data.numpy()
print 'Encoded Train', out_enc_tr.shape, Y_train.shape
print 'Encoded Valid', out_enc_vl.shape, Y_valid.shape









    



Encoded Train (315, 2) (315,)
Encoded Valid (136, 2) (136,)



In [22]:

    
colors = ['darkblue','darkblue','green','green','orange','orange']
labels = ['W_c','W_e','R_c','R_e','P_c','P_e']
markers = ['o','x','o','x','o','x']
alphas = [0.4,1,0.4,1,0.4,1]

def plot_scatter(spoints, y, ind, label):
    px = spoints[:,0]
    py = spoints[:,1]
    unique = list(set(y))
    ax = plt.subplot(1, 2, ind)
    plt.tight_layout()
    ax.set_title(label)
    for i, u in enumerate(unique):
        xi = [px[j] for j  in range(len(px)) if y[j] == u]
        yi = [py[j] for j  in range(len(px)) if y[j] == u]
        plt.scatter(xi, yi, c=colors[i], marker=markers[u], label=labels[u], alpha=alphas[u])
    plt.legend()

fig = plt.figure(figsize=(12,5))
fig.subplots_adjust(left=0, right=0.1, bottom=0, top=0.1, hspace=0, wspace=0.)

plot_scatter(out_enc_tr, Y_train, 1, 'Training Dataset')
plot_scatter(out_enc_vl, Y_valid, 2, 'Validation Dataset')

plt.show()



In [23]:

    
net = autoencoder3()
print(net)









    



autoencoder3 (
  (fc1): Linear (500 -> 200)
  (fc21): Linear (200 -> 3)
  (fc3): Linear (3 -> 200)
  (fc4): Linear (200 -> 500)
  (relu): ReLU ()
  (sigmoid): Sigmoid ()
)



In [24]:

    
loss_fn = torch.nn.MSELoss()
optimizer = torch.optim.Adam(net.parameters())
epochs = 50
patience = 5
max_batch = 1000
criteria = "loss"

best_model, loss, loss_test = train(net, trainloader, validloader, loss_fn, optimizer, epochs = epochs, 
                                     num_break = max_batch, patience = patience, criteria_stop = criteria)









    



Training starts along 50 epoch
('Epoch: ', 0, 'train loss: ', 0.012451977284346119, 'valid loss: ', 0.010514065018522908)
('Epoch: ', 10, 'train loss: ', 0.0011055479303382419, 'valid loss: ', 0.0012517396082651377)
('Epoch: ', 20, 'train loss: ', 0.00080817055197044015, 'valid loss: ', 0.0011050771765226005)
Early stopped



In [25]:

    
plt.title('Loss')
plt.xlabel('epochs')
plt.ylabel('loss')
plt.plot(loss, label='Train')
plt.plot(loss_test, label='Valid')
plt.legend()
plt.show()



In [26]:

    
__, encode = best_model(Variable(trainloader))
out_enc_tr = encode.data.numpy()
__, encode = best_model(Variable(validloader))
out_enc_vl = encode.data.numpy()
print 'Encoded Train', out_enc_tr.shape, Y_train.shape
print 'Encoded Valid', out_enc_vl.shape, Y_valid.shape









    



Encoded Train (315, 3) (315,)
Encoded Valid (136, 3) (136,)



In [33]:

    
colors = ['darkblue','darkblue','green','green','orange','orange']
labels = ['W_c','W_e','R_c','R_e','P_c','P_e']
markers = ['o','x','o','x','o','x']
alphas = [0.4,1,0.4,1,0.4,1]

def plot_scatter(spoints, y, ind, label):
    fig = pyplot.figure()
    ax = Axes3D(fig)
    ax.set_title(label)
    px = spoints[:,0]
    py = spoints[:,1]
    pz = spoints[:,2]
    unique = list(set(y))
    for i, u in enumerate(unique):
        xi = [px[j] for j  in range(len(px)) if y[j] == u]
        yi = [py[j] for j  in range(len(px)) if y[j] == u]
        zi = [pz[j] for j  in range(len(px)) if y[j] == u]
        ax.scatter(xi, yi, zi, c=colors[i], marker=markers[u], label=labels[u], alpha=alphas[u])
    ax.view_init(elev=50., azim=20.)
    plt.legend()
    pyplot.show()

plot_scatter(out_enc_tr, Y_train, 1, 'Training Dataset')
plot_scatter(out_enc_vl, Y_valid, 2, 'Validation Dataset')

The RMSE over the 10 correct segmentations was compared with RMSE over the 10 erroneous segmentations. As expected, RMSE for correct segmentations was greater than RMSE for erroneous segmentations along all the resolutions. In general, this is true, but optimal resolution guarantee the maximum difference between both of RMSE results: correct and erroneous.

So, to find optimal resolution, difference between correct and erroneous RMSE was calculated over all resolutions.

The greatest difference resulted at resolution 0.1. In this resolution, threshold for separate erroneous and correct segmentations is established as 30% of the distance between the mean RMSE of the correct masks and the mean RMSE of the erroneous masks.

Method testing

Finally, method test was performed in the 152 subject dataset: Watershed dataset with 112 segmentations, ROQS dataset with 152 segmentations and pixel-based dataset with 152 segmentations.

Discussion and conclusion

In this work, a method for segmentation error detection in large datasets was proposed based-on shape signature. RMSE was used for comparison between signatures. Signature can be extracted en various resolutions but optimal resolution (ls=0.1) was chosen in order to get maximum separation between correct RMSE and erroneous RMSE. In this optimal resolution, threshold was fixed at 27.95 allowing separation of the two segmentation classes.The method achieved 95% of accuracy on the test Watershed segmentations, and 95% and 94% on new datasets: ROQS and pixel-based, respectively.

40 Watershed segmentations on dataset were used to generation and configuration mean correct signature because of the greater number of erroneous segmentations and major variability on the error shape. Because the signature holds the CC shape, the method can be extended to new datasets segmented with any method. Accuracy and generalization can be improve varying the segmentations used to generate and adjust the mean correct signature.