Quality control

AUTHOR

Written by Lina Mograby.

Previous version by Elena Pozzi, Yara Toenders, Ilya Veer, Lea Waller, Courtney Haswell, Rajendra Morey, and Lianne Schmaal

Please address questions and comments to [email protected]. You can also join the HALFpipe Mattermost (similar to Slack) using this link.

Before starting the QC

After running the pipeline, an output folder will be created. If the pipeline ran successfully, your output folder should look like this:

📂 /Output
├──reports → this contains all the QC information
├──derivatives → this contains the extracted features for each subject (from the CorrMatrix, DualReg, fALFF, ReHo, SeedCorr analyses), along with the confound file (confounds.tsv), the preprocessed image(s) (preproc.nii.gz) and the mask_file.nii.gz
├──nypipe  → this contains the intermediate pre-processing files. When using the standard pipeline, not all intermediate images are saved to save space. However you can choose to save everything by changing the flag: --keep (more information can be found on GitHub)
├──rawdata → this folder contains the raw T1 images and epi in BIDS
├──log.txt
├──err.txt
├──halfpipe.log.txt 
├──spec.json
├── an execgraph and workflow file (you can ignore this)
└──possibly files starting with ‘crash’

The files important for QC are in the reports folder. This contains:

The web file index.html, which is the main web page that you will use to perform quality control (see below)
reportvals.txt → this is a group file that contains the mean of the quality metrics (aroma_noise_frac, the fd_gt_0_5, mean_fd, mean_gm_tsnr) for each subject
reportpreproc.txt → this is a group file that reports the status of the pre-processing (done/missing) of the resting state image for each subject
In the reports folder there are also individual subject folders (that contain the images used to create the index.html and other files .js and .json.lock. You don’t have to worry about these files.)

Tip

Note: you can copy the QC folder (reports) anywhere (e.g., on a local machine) as long as you copy the entire reports folder. The reports folder will take ~10 MB of space per subject plus 10 MB for each functional image. So if each subject has just the resting state, it would take ~20 MB per subject.

Important files for troubleshooting:

The log.txt file contains the record of all the steps performed by the pipeline when it was running. Checking the log will help you troubleshoot if you run into any problems.
The err.txt file contains the record of the errors
The crash files might contain information about processes that crashed. If you submit an issue on GitHub, please include this file.

Step 1: Open the summary report in the reports folder

The summary report (reportpreproc.txt in the reports folder) should look something like this:

subject	task	status
1	rest	done
2	rest	missing
3	rest	missing
4	rest	done

Done means that all the output files of interest (i.e., preprocessed image, confound files, connectivity matrices, ICA maps, seed connectivity maps, alff, reho) for that subject were created. These derivative files can be found in the folder <path/to/output/derivatives/halfpipe/subjectfolder>.
Conversely, subjects with the status missing did not run completely. This means that not all the output files were created.

Step 1.1: Check for missing files in the summary report

If you don’t have any subject listed under missing
If you have any subject listed under missing

If you are on a HPC, you can download the entire /reports folder on a local machine to open and visualize the index.html file on your browser.

Continue to step 2.

Please have a look at the log.txt file (located in the main output folder). To open the log file, you can use any text editor. Use the search function to search for ‘error’, to understand if something and what went wrong. Common errors are listed in the troubleshooting section.

Step 1.1.1: Re-run missing or failed subjects

If there are any missing subjects, make a text file with all the missing subject IDs, one on each line. These could also be new participants that have not yet been processed.
Re-run HALFpipe as before, but add the option --subject-list <full path to text file> to the end of the command.

Only if running on HPC - Edit your submission script to add the option, and then submit the job to the HPC queue.

If you are unable to troubleshoot, please open an issue on GitHub.

Step 2: Open the index.html page

Open the index.html file in the reports folder in your favourite browser. You can navigate through different windows by clicking on the drop-down menu (the three lines in the top left corner next to Nypipe status, see image below): charts, explore reports and report detail view.

Tip

As you make them, the ratings are saved in the browser’s local storage, and they will stay there unless you delete the cookies/site data. To review the ratings you made, open the index.html file from the same local machine using the same browser.

Charts

This is divided in 3 sections:

Nipype status (pending, success, error)
Summary and ratings for each feature (summary, T1w skull-stripping, T1w spatial normalization, EPI tSNR, EPI Confounds, EPI ICA, EPI spatial normalization)
Group plots for Mean FD, % of frames with FD>0.5, % IC classified as noise, mean GM tSNR → each dot is a subject, if you click on a dot it will take you to their QC page

Explore reports

The reports contain images for each participant for the following pre-processing steps: 1. T1w skull stripping and segmentation 2. T1w spatial normalisation 3. EPI tSNR
4. EPI confounds (carpet plot) 5. EPI ICA-based artefact removal 6. EPI spatial normalisationƒ

Each pre-processing step should be rated for each participant (good, uncertain, bad) using the emoticons. You can zoom in by clicking on the report (or clicking on ‘report detail view’ from the dropdown menu). All images/subjects should be rated (i.e., either good, uncertain, or bad), in order to have a comprehensive QA from each site.

Instead of using the mouse clicks to rate the images, you can also use the following keyboard acuts: [w] rates image as good, [s] as uncertain and [x] as bad. The keys [d] and [a] jump to the next, respectively the previous image.

After rating the images (see below for the explanation), the rating should be exported by going to the menu (the icon in the upper left corner, see example) of the reports page, and then clicking “Export”. They will be saved as ‘exclude.json’. The ratings are also displayed in the summary and ratings for each feature chart. If ratings do not get exported, there’s a risk of losing your progress once you delete your cookies.

Example: shows the menu and export button on the report

You can switch between sort ‘per image type’ or ‘per subject’. Sorting the reports per image type (e.g., by skull stripping reports) will help you get a sense of how each image looks like across all subjects and will give you an idea of wwhat a ‘normal’ image looks like and what stands out.

Sorting per subject will allow you to check how the same subject performed across the different pre-processing steps. We recommend sorting per image type, because that gives you an idea of what a good or bad image looks like. However, if you are unsure about the quality of a certain subject in one step, switch to sorting per subject to see how the same subject performed across the different steps (this could be useful to determine if they need to be excluded).

Below we report examples of each pre-processing step, how they should look like, what to look for to identify errors and when to exclude a participant.

Important

If you rate a subject as ‘bad’* in one of the images, it will later automatically be excluded from the group analysis.

Obvious issues (e.g., large portion of the brain missing after skull stripping, poor T1 or EPI normalisation, very clear artefacts in the EPI showing in the tSNR) are usually worth the exclusion of the subject (or you can try to re-run the subject if the problem is for example skull stripping).

Usually, bad examples of skull stripping, registration etc are the results of a problematic subject (e.g., lot of motion, not enough WM/GM contrast, signal loss, etc), so it is helpful to check all the reports for that subject, especially if you are unsure, before deciding to exclude them.

The carpet plot and the ICA AROMA report can be hard to interpret on their own and (especially if you are unsure about the result) we recommend examining them in conjunction with the other images.

T1w skull stripping

Skull stripping is the process separating the brain (cortex and cerebellum) from the skull. The red line follows the outline of the brain and it separates it from the skull.

Example of a good subject

There are no skull stripping errors, such as portions of the brain missing, or too much of the skull retained
The red line follows the outline of the brain

T1w skull stripping: examples of good subjects

Example of a bad subject

There are skull stripping errors, such as portions of the brain missing, or too much of the skull retained
NOTE: check all the images (slices) in the report. If only one image (slice) looks problematic, it is possible that the subject is okay and it is just a visual issue in that particular screenshot

T1w skull stripping: example of a bad subject: images in the upper and lower row look very distorted, possibly due to an issue with the scanner. Large portions of the brain have been left out even in the middle row.

T1w skull stripping: example of a bad subject: large portions of the brain have been left out.

T1w skull stripping: example of a bad subject: the red line includes areas (in the upper row) which don’t contain brain tissue.

T1w skull stripping: example of a bad subject: portions of the brain have been left out, parts of the cranium and the eyes are included in the red line.

T1w skull stripping: example of a bad subject: parts of the cranium and the eyes are included in the red line.

T1w skull stripping: example of a bad subject: portions of the brain have been left out (especially the cerebellum), parts of the cranium and the eyes are included in the red line.

T1w skull stripping: example of a bad subject: portions of the brain have been left out, large parts of the cranium and the eyes are included in the red line. The shape of the brain does not correspond to the shape drawn by the red line.

Summary

Table summary for skull stripping
Good	Bad
The brain is fully inside the red line	Structures like the cranium or the eyes are inside the red line
No important brain structures are outside of the red line red line follows the natural outline of the brain	Important brain structures are missing inside of the red line

Note

-> if only one slice is problematic, it could be an issue related to the visual depiction of the data instead of an issue related to the test subject