shakes76 · KnowerSmyf · Sep 22, 2023 · Sep 22, 2023 · Oct 20, 2023 · Oct 20, 2023
diff --git a/recognition/README.md b/recognition/README.md
@@ -1,10 +1,73 @@
-# Recognition Tasks
-Various recognition tasks solved in deep learning frameworks.
-
-Tasks may include:
-* Image Segmentation
-* Object detection
-* Graph node classification
-* Image super resolution
-* Disease classification
-* Generative modelling with StyleGAN and Stable Diffusion
+# ISIC Lesion Segmentation Algorithm
+
+## Description
+The ISIC Lesion Segmentation Algorithm was designed to automatically segment skin lesion boundaries from dermatoscopic images. Early detection of malignant skin lesions is crucial in improving the prognosis of skin cancers such as melanoma. The algorithm operates by analyzing input images and leverages a convolutional neural network (CNN) to identify and segment potential skin lesions, distinguishing them from healthy skin. The Dice similarity coefficient is used to compare the algorithm output to the ground truth reference mask, which essentially measures the proportion of output pixels that match the true image.
+
+The model is a modified UNet and is composed of several CNN layers, skip connections, and uses deep supervision facilitated by segmentation layers that connect different levels of the network to the final output. The architecture of the model was inspired by the [improved UNet](https://arxiv.org/abs/1802.10508v1) (Figure 1), which proved to be an effective 3D brain tumor segmentation model during the BRATS 2017 challenge. The network is trained using the 2018 ISIC (International Skin Imaging Collaboration) dataset, which contains annotated images of various skin lesions.
+
+![Image of the improved UNet architecture](./UNet_Segmentation_s4745275/images/Figure_1.png)
+Figure 1: Improved UNet architecture. Designed by F. Isensee et al.
+
+## Dependencies
+
+To run the ISIC Lesion Segmentation Algorithm, you'll need the following libraries:
+
+- Python (only verified for 3.7+)
+- numpy: For numerical computations and some tensor operations
+- PyTorch: For building and training the neural network
+- matplotlib: For plotting and visualisation
+- PIL (Pillow): For loading the dataset and visualisation
+
+To install any dependencies you can use `pip install`
+
+## Reproducability
+
+To run the algorithm and reproduce the results I've obtained, please be aware of the following considerations:
+
+1. Directory Paths for ISICDataset: The paths specified when initializing ISICDataset may need to be modified to match the directory structure on your machine. Ensure that you point it to the correct location where your dataset resides.
+
+2. Model State Dictionary Directory: The directory where the model state dictionary is saved/loaded may differ based on your setup. Adjust the path accordingly to ensure the algorithm can access the model or save it correctly.
+
+Always ensure that you have the necessary permissions to read/write in the specified directories and that the paths are correctly formatted.
+
+## Usage
+#### See predict.py for a full usage demonstration of the model.
+### Input
+torch.Tensor with shape [batch_size, 6, 256, 256] 
+- The batch_size denotes the number of inputted images, this is the only argument that varies
+- 6 channels (3 for RGB and 3 for HSV)
+- The image has dimensions 256x256
+
+### Output
+
+torch.Tensor was shape [batch_size, 1, 256, 256] 
+- The batch_size denotes the number of inputted images, this is the only argument that varies
+- 1 channel containing probabilities of being 
+- The image has dimensions 256x256
+
+## Results
+Ultimately, after extensive training over 50 epochs, the model attained an average Dice similarity coefficient of 0.7364 on the test set. This performance indicates potential areas for improvement. Given more time, I would delve into techniques like hyperparameter tuning and possibly experiment with alternative optimizers.
+
+![Beautiful demonstation of the model efficacy](./UNet_Segmentation_s4745275/images/Figure_2.png)
+Figure 2: An example output from a random sample. Black indicates non-lesion, white indicates lesion. (25 epochs)
+
+That said, the model does exhibit proficiency in segmenting the image. This is evident in Figure 2, where the output mask closely mirrors the true mask, especially around the edges.
+
+## Pre-processing
+Various transformation pipelines were implemented for both pre-processing and data augmentation. You can find these in the dataset.py file. They serve to convert the provided images or masks into tensors compatible with the model (refer to the Input and Output section), as well as to normalize the inputs. During training, the process_and_augment pipeline was employed, performing random scalings, flips, rotations, and more to enhance the model's generalizability during learning.
+
+
+# Data Splits
+
+The data was partitioned as follows:
+
+- Training: 70%
+- Validation: 20%
+- Testing: 10%
+
+With this configuration, a significant majority (70%) of the data is allocated for training. Deep learning models, like the UNet I implemented, require a robust volume of data for effective training. By dedicating a larger segment of the dataset to training, the model can encounter a more diverse array of samples, which is essential for discerning and internalizing underlying patterns. Given the dataset's substantial size (over 2500 samples), allocating 70% to training felt appropriate.
+
+The validation set serves a dual purpose: it allows for ongoing evaluation during training and aids in determining when to cease training — a tactic known as early stopping — to mitigate overfitting. A generous validation set is imperative to ensure that the decision to halt training is anchored in a trustworthy performance metric rather than the inconsistencies of a smaller subset.
+
+Finally, the test set offers an objective assessment of the model's performance post-training. While 10% might seem modest, given the dataset's magnitude, it still yields a significant number of samples. Consequently, the test set furnishes a dependable measure of how the model is likely to perform in real-world scenarios.
+The data was divided as follows:
diff --git a/recognition/UNet_Segmentation_s4745275/best_model.pth b/recognition/UNet_Segmentation_s4745275/best_model.pth
diff --git a/recognition/UNet_Segmentation_s4745275/dataset.py b/recognition/UNet_Segmentation_s4745275/dataset.py
@@ -0,0 +1,158 @@
+"""
+File containing the data loaders used for loading and preprocessing the data.
+"""
+
+import os
+import torch
+from utils import RandomCenterCrop, RandomRotate90, DictTransform
+from torch.utils.data import Dataset
+from torchvision import transforms
+import torchvision.transforms.functional as TF
+from PIL import Image
+import numpy as np
+
+# These are the default paths for me, they may not apply to you. Modify as required
+image_path = "/home/groups/comp3710/ISIC2018/ISIC2018_Task1-2_Training_Input_x2"
+mask_path = "/home/groups/comp3710/ISIC2018/ISIC2018_Task1_Training_GroundTruth_x2"
+inconsistent_path = "/home/Student/s4745275/PatternAnalysis-2023/recognition/UNet_Segmentation_s4745275/inconsistent_ids.txt"
+
+
+def check_consistency(
+    image_dir=image_path, mask_dir=mask_path, inconsistent_path=inconsistent_path
+):
+    image_ids = {
+        img.split(".")[0] for img in os.listdir(image_dir) if img.endswith(".jpg")
+    }
+    mask_ids = {
+        mask.split("_segmentation.")[0]
+        for mask in os.listdir(mask_dir)
+        if mask.endswith("_segmentation.png")
+    }
+
+    # Using list differences to find inconsistencies
+    images_without_masks = image_ids - mask_ids
+    masks_without_images = mask_ids - image_ids
+
+    if images_without_masks or masks_without_images:
+        inconsistent_ids = images_without_masks.union(masks_without_images)
+        # Save to a file
+        with open(inconsistent_path, "w") as file:
+            for ID in inconsistent_ids:
+                file.write(f"{ID}\n")
+
+        print(f"Detected {len(inconsistent_ids)} inconsistencies, fixed em tho")
+
+
+class ISICDataset(Dataset):
+    def __init__(
+        self,
+        transform,
+        image_dir=image_path,
+        mask_dir=mask_path,
+        inconsistent_path=inconsistent_path,
+    ):
+        # Load the inconsistent IDs
+        with open(inconsistent_path, "r") as file:
+            excluded_ids = set(line.strip() for line in file)
+
+        self.image_dir = image_dir
+        self.mask_dir = mask_dir
+        self.image_ids = [
+            img.split(".")[0]
+            for img in os.listdir(image_dir)
+            if img.endswith(".jpg") and img.split(".")[0] not in excluded_ids
+        ]
+        self.transform = transform
+
+    def __len__(self):
+        return len(self.image_ids)
+
+    def handle_inconsistency(self):
+        images_without_masks, masks_without_images = check_consistency(
+            self.image_dir, self.mask_dir
+        )
+        inconsistent_ids = images_without_masks.union(masks_without_images)
+
+        # Save to a file
+        with open(inconsistent_path, "a") as file:  # 'a' mode for appending
+            for ID in inconsistent_ids:
+                file.write(f"{ID}\n")
+
+    def __getitem__(self, idx):
+        img_name = os.path.join(self.image_dir, self.image_ids[idx] + ".jpg")
+        mask_name = os.path.join(
+            self.mask_dir, self.image_ids[idx] + "_segmentation.png"
+        )
+
+        try:
+            with Image.open(img_name) as image, Image.open(mask_name) as mask:
+                image = image.convert("RGB")
+                mask = mask.convert("L")
+                sample = {"image": image, "mask": mask}
+
+                if self.transform:
+                    sample = self.transform(sample)
+
+            # Convert mask to binary 0/1 tensor
+            sample["mask"] = (torch.tensor(np.array(sample["mask"])) > 0.5).float()
+
+            return sample["image"], sample["mask"]
+
+        except FileNotFoundError:
+            self.handle_inconsistency()
+            return self.__getitem__(idx)
+
+
+pre_process_image = transforms.Compose(
+    [
+        transforms.Resize((256, 256)),
+        transforms.ToTensor(),
+        transforms.Lambda(
+            lambda img_tensor: torch.cat(
+                [
+                    img_tensor,
+                    TF.to_tensor(TF.to_pil_image(img_tensor).convert("HSV")),
+                ],
+                dim=0,
+            )
+        ),
+        transforms.Normalize(
+            [0.5, 0.5, 0.5, 0.5, 0.5, 0.5], [0.5, 0.5, 0.5, 0.5, 0.5, 0.5]
+        ),
+    ]
+)
+
+pre_process_mask = transforms.Compose(
+    [transforms.Resize((256, 256)), transforms.ToTensor()]
+)
+
+
+# Transformation pipeline to pre-process and augment the dataset
+process_and_augment = transforms.Compose(
+    [
+        RandomRotate90(),
+        RandomCenterCrop(),
+        DictTransform(transforms.RandomHorizontalFlip()),
+        DictTransform(transforms.RandomVerticalFlip()),
+        DictTransform(transforms.Resize((256, 256))),
+        DictTransform(transforms.ToTensor()),
+        DictTransform(
+            transforms.Lambda(
+                lambda img_tensor: torch.cat(
+                    [
+                        img_tensor,
+                        TF.to_tensor(TF.to_pil_image(img_tensor).convert("HSV")),
+                    ],
+                    dim=0,
+                )
+            ),
+            False,
+        ),
+        DictTransform(
+            transforms.Normalize(
+                [0.5, 0.5, 0.5, 0.5, 0.5, 0.5], [0.5, 0.5, 0.5, 0.5, 0.5, 0.5]
+            ),
+            False,
+        ),
+    ]
+)
diff --git a/recognition/UNet_Segmentation_s4745275/images/Figure_1.png b/recognition/UNet_Segmentation_s4745275/images/Figure_1.png
diff --git a/recognition/UNet_Segmentation_s4745275/images/Figure_2.png b/recognition/UNet_Segmentation_s4745275/images/Figure_2.png
diff --git a/recognition/UNet_Segmentation_s4745275/images/ISIC_0000000.jpg b/recognition/UNet_Segmentation_s4745275/images/ISIC_0000000.jpg
diff --git a/recognition/UNet_Segmentation_s4745275/images/ISIC_0000000_segmentation.png b/recognition/UNet_Segmentation_s4745275/images/ISIC_0000000_segmentation.png
diff --git a/recognition/UNet_Segmentation_s4745275/images/Training_evolution.png b/recognition/UNet_Segmentation_s4745275/images/Training_evolution.png