You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Figures added, pretrained weights link added, minor fixes (#456)
* Figures added, pretrained weights link added, minor fixes
Signed-off-by: vnath <vnath@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: vnath <vnath@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
The dataset contains a total of 771 3D CT Volumes. The volumes were split into training and validation sets
22
-
of 600 and 171 3D volumes correspondingly. The data is available for download at this [link](https://wiki.cancerimagingarchive.net/display/Public/CT+Images+in+COVID-19).
23
-
If this dataset is being used in your work
24
-
please use [1] as reference. A json file is provided which contains the suggested training and validation split
25
-
in the json_files directory of the self-supervised training tutorial.
26
-
27
-
Fine-tuning Dataset: The dataset from Beyond the Cranial Vault Challenge [(BTCV)](https://www.synapse.org/#!Synapse:syn3193805/wiki/217789)
28
-
2015 hosted at MICCAI was used as a fully supervised fine-tuning task on the pre-trained weights. The dataset
29
-
consists of 30 3D Volumes with annotated labels of upto 13 different organs [2]. There are 3 json files provided in the
30
-
json_files directory for the dataset. They correspond to having different number of training volumes ranging 6, 12 and 24.
31
-
All 3 json files have the same validation split.
28
+
of 600 and 171 3D volumes correspondingly. The data is available for download at this
The pretraining pipeline has two aspects to it. The first it uses augmentation to mutate the data and the second is
63
-
it utilizes to a regularized [constrastive loss](https://docs.monai.io/en/latest/losses.html#contrastiveloss)[3] to
64
-
learn feature representations of the unlabeled data. The multiple augmentations are applied on a randomly selected 3D
65
-
foreground patch from a 3D volume. Two augmented views of the same 3D patch are generated for the constrastive loss as
66
-
it functions by drawing the two augmented views closer to each other.
71
+
The pre-training pipeline has two aspects to it (Refer figure shown below). First, it uses augmentation (top row) to
72
+
mutate the data and the second is it utilizes to a regularized
73
+
[constrastive loss](https://docs.monai.io/en/latest/losses.html#contrastiveloss)[3] to learn feature representations
74
+
of the unlabeled data. The multiple augmentations are applied on a randomly selected 3D foreground patch from a 3D
75
+
volume. Two augmented views of the same 3D patch are generated for the contrastive loss as it functions by drawing
76
+
the two augmented views closer to each other if the views are generated from the same patch, if not then it tries to
77
+
maximize the disagreement. The CL offers this functionality on a mini-batch.
78
+
79
+

67
80
68
-
The augmentations mutate the 3D patch in different ways and the primary task of the network is to reconstruct
81
+
The augmentations mutate the 3D patch in various ways, the primary task of the network is to reconstruct
69
82
the original image. The different augmentations used are classical techniques such as in-painting [1], out-painting [1]
70
83
and noise augmentation to the image by local pixel shuffling [2]. The secondary task of the network is to simultaneously
71
-
reconstruct the two augmented views as similar to each other as possible via the regularized contrastive loss [3] as it's
72
-
objective is to maximize the agreement. The term regularized has been used here because the contrastive loss is adjusted
84
+
reconstruct the two augmented views as similar to each other as possible via regularized contrastive loss [3] as it's
85
+
objective is to maximize the agreement. The term regularized has been used here because contrastive loss is adjusted
73
86
by the reconstruction loss as a dynamic weight itself.
74
87
75
-
The below example image depicts the usage of the augmentations pipeline where two augmented views are drawn of the same
88
+
The below example image depicts the usage of the augmentation pipeline where two augmented views are drawn of the same
76
89
3D patch:
77
90
78
-

91
+

79
92
80
-
The three columns are the three views of axial, coronal, sagittal of a randomly selected patch of size 96x96x96.
81
-
The top row is the ground truth image which is not augmented. The middle row is the same image when mutated by augmentations.
82
-
The bottom row is a 2nd view of the same patch but augmented with different probabilities
83
-
The objective of the SSL network is to reconstruct the original top row image from the first view. The contrastive loss
84
-
is driven by maximizing agreement of the reconstruction based on input of the two augmented views .
93
+
Multiple axial slice of a 96x96x96 patch are shown before the augmentation (Ref Original Patch in the above figure).
94
+
Augmented View 1 & 2 are different augmentations generated via the transforms on the same cubic patch. The objective
95
+
of the SSL network is to reconstruct the original top row image from the first view. The contrastive loss
96
+
is driven by maximizing agreement of the reconstruction based on input of the two augmented views.
97
+
`matshow3d` from `monai.visualize` was used for creating this figure, a tutorial for using can be found [here](https://github.com/Project-MONAI/tutorials/blob/master/modules/transform_visualization.ipynb)
85
98
86
99
References:
87
100
@@ -104,23 +117,25 @@ Batch size: 4 3D Volumes (Total of 8 as 2 samples were drawn per 3D Volume) \
104
117
Loss Function: L1
105
118
Contrastive Loss Temperature: 0.005
106
119
107
-
Training Hyper-parameters for Fine-tuning BTCV task (All settings have been kept consistent with prior [UNETR 3D
120
+
Training Hyper-parameters for Fine-tuning BTCV task (All settings have been kept consistent with prior
0 commit comments