Skip to content

Commit f30eba0

Browse files
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
1 parent c93248c commit f30eba0

File tree

2 files changed

+8
-10
lines changed

2 files changed

+8
-10
lines changed
Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,23 @@
11
# Preprocessing Open-I Dataset
22

3-
The Open-I dataset provides a collection of 3,996 radiology reports
4-
with 8,121 associated images in PA, AP and lateral views. In this tutorial, we utilize the images from fronal view with their corresponding reports for training and
5-
evaluation of the TransChex model. The chest x-ray images and reports are originally from the Indiana University hospital (see the licencing information below).
6-
The 14 finding categories in this work include Atelectasis, Cardiomegaly, Consolidation, Edema, Enlarged-Cardiomediastinum, Fracture, Lung-Lesion, Lung-Opacity, No-Finding, Pleural-Effusion, Pleural-Other, Pneumonia, Pneumothorax and Support-Devices. More information can be found in the following link:
3+
The Open-I dataset provides a collection of 3,996 radiology reports
4+
with 8,121 associated images in PA, AP and lateral views. In this tutorial, we utilize the images from fronal view with their corresponding reports for training and
5+
evaluation of the TransChex model. The chest x-ray images and reports are originally from the Indiana University hospital (see the licencing information below).
6+
The 14 finding categories in this work include Atelectasis, Cardiomegaly, Consolidation, Edema, Enlarged-Cardiomediastinum, Fracture, Lung-Lesion, Lung-Opacity, No-Finding, Pleural-Effusion, Pleural-Other, Pneumonia, Pneumothorax and Support-Devices. More information can be found in the following link:
77
https://openi.nlm.nih.gov/faq
88

99
License: Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
1010

1111
In this section, we provide the steps that are needed for preprocessing the Open-I dataset for
1212
the multi-label disease classification tutorial using TransCheX model. As a result, once the following steps are
13-
completed, the dataset can be readily used for the tutorial.
13+
completed, the dataset can be readily used for the tutorial.
1414

1515
### Preprocessing Steps
16-
1) Create a new folder named 'monai_data' for downloading the raw data and preprocessing.
16+
1) Create a new folder named 'monai_data' for downloading the raw data and preprocessing.
1717
2) Download the chest X-ray images in PNG format from this [link](https://openi.nlm.nih.gov/imgs/collections/NLMCXR_png.tgz). Copy the downloaded file (NLMCXR_png.tgz)
18-
to 'monai_data' directory and extract it.
18+
to 'monai_data' directory and extract it.
1919
3) Download the reports in XML format from this [link](https://openi.nlm.nih.gov/imgs/collections/NLMCXR_reports.tgz). Copy the downloaded file (NLMCXR_reports.tgz)
2020
to 'monai_data' directory and extract it.
2121
4) Download the splits of train, validation and test datasets from this [link](https://drive.google.com/u/1/uc?id=1_CThgwbDQPeTrr2Gvi6zflqr32_5t87j&export=download). Copy the downloaded file (TransChex_openi.zip)
2222
to 'monai_data' directory and extract it.
23-
5) Run 'preprocess_openi.py' to process the images and reports.
24-
23+
5) Run 'preprocess_openi.py' to process the images and reports.

multimodal/openi_multilabel_classification_transchex/preprocess_openi.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -117,4 +117,3 @@ def create_report(img_names_list_, report_list_, gt_list_, save_add):
117117
create_report(dataset["img_name"], dataset["report"], dataset["gt"], dataset["save_add"])
118118

119119
print('Processed Dataset Files Are Saved !')
120-

0 commit comments

Comments
 (0)