DataGrabber
and SelectFiles
are great if you are dealing with generic datasets with arbitrary organization. However if you have decided to use Brain Imaging Data Structure (BIDS) to organized your data (or got your hands on a BIDS dataset) you can take advanted of a formal structure BIDS imposes. In this short tutorial you will learn how to do this.
In [3]:
from bids.grabbids import BIDSLayout
layout = BIDSLayout("/data/ds000114/")
In [4]:
!tree -L 4 /data/ds000114/
Let's figure out what are the subject labels in this dataset
In [ ]:
layout.get_subjects()
Out[ ]:
What modalities are included in this dataset?
In [ ]:
layout.get_modalities()
Out[ ]:
What different data types are included in this dataset?
In [ ]:
layout.get_types(modality='func')
Out[ ]:
What are the different tasks included in this dataset?
In [ ]:
layout.get_tasks()
Out[ ]:
We can also ask for all of the data for a particular subject and one modality.
In [ ]:
layout.get(subject='01', modality="anat", session="test")
Out[ ]:
We can also ask for a specific subset of data. Note that we are using extension filter to get just the imaging data (BIDS allows both .nii and .nii.gz so we need to include both).
In [ ]:
layout.get(subject='01', type='bold', extensions=['nii', 'nii.gz'])
Out[ ]:
You probably noticed that this method does not only return the file paths, but objects with relevant query fields. We can easily extract just the file paths.
In [ ]:
[f.filename for f in layout.get(subject='01', type='bold', extensions=['nii', 'nii.gz'])]
Out[ ]:
pybids
in your nipype
workflowThis is great, but what we really want is to include this into our nipype
workflows. How to do this? We can create our own custom BIDSDataGrabber
using a Function
Interface. First we need a plain Python function that for a given subject label and dataset location will return list of BOLD files.
In [ ]:
def get_niftis(subject_id, data_dir):
# Remember that all the necesary imports need to be INSIDE the function for the Function Interface to work!
from bids.grabbids import BIDSLayout
layout = BIDSLayout(data_dir)
bolds = [f.filename for f in layout.get(subject=subject_id, type="bold", extensions=['nii', 'nii.gz'])]
return bolds
In [ ]:
get_niftis('01', '/data/ds000114')
Out[ ]:
Ok we got our function. Now we need to wrap it inside a Node object.
In [ ]:
from nipype.pipeline import Node, MapNode, Workflow
from nipype.interfaces.utility import IdentityInterface, Function
In [ ]:
BIDSDataGrabber = Node(Function(function=get_niftis, input_names=["subject_id",
"data_dir"],
output_names=["bolds"]), name="BIDSDataGrabber")
BIDSDataGrabber.inputs.data_dir = "/data/ds000114"
In [ ]:
BIDSDataGrabber.inputs.subject_id='01'
res = BIDSDataGrabber.run()
res.outputs
Out[ ]:
Works like a charm! (hopefully :) Lets put it in a workflow. We are not going to analyze any data, but for demostrantion purposes we will add a couple of nodes that pretend to analyze their inputs
In [ ]:
def printMe(paths):
print("\n\nanalyzing " + str(paths) + "\n\n")
analyzeBOLD = Node(Function(function=printMe, input_names=["paths"],
output_names=[]), name="analyzeBOLD")
In [ ]:
wf = Workflow(name="bids_demo")
wf.connect(BIDSDataGrabber, "bolds", analyzeBOLD, "paths")
wf.run()
Out[ ]:
In the previous example we demostrated how to use pybids
to "analyze" one subject. How can we scale it for all subjects? Easy - using iterables
(more in Iteration/Iterables.
In [ ]:
BIDSDataGrabber.iterables = ('subject_id', layout.get_subjects()[:2])
wf.run()
Out[ ]:
In [ ]:
layout.get_metadata('/data/ds000114/sub-01/ses-test/func/sub-01_ses-test_task-fingerfootlips_bold.nii.gz')
Out[ ]:
Can we incorporate this into our pipeline? Yes we can! (More about MapNode in MapNode)
In [ ]:
def printMetadata(path, data_dir):
from bids.grabbids import BIDSLayout
layout = BIDSLayout(data_dir)
print("\n\nanalyzing " + path + "\nTR: "+ str(layout.get_metadata(path)["RepetitionTime"]) + "\n\n")
analyzeBOLD2 = MapNode(Function(function=printMetadata, input_names=["path", "data_dir"],
output_names=[]), name="analyzeBOLD2", iterfield="path")
analyzeBOLD2.inputs.data_dir = "/data/ds000114/"
In [ ]:
wf = Workflow(name="bids_demo")
wf.connect(BIDSDataGrabber, "bolds", analyzeBOLD2, "path")
wf.run()
Out[ ]: