Medical Image Segmentation with U-Net

Representation Learning 2022 Computer Vision Prototype

Overview

This project explored deep-learning methods for medical image segmentation, with a focus on CT image masks and convolutional encoder-decoder architectures.

The workflow built a segmentation pipeline from image-mask matching and preprocessing to U-Net-style model construction, training diagnostics and pixel-level evaluation with Intersection over Union.

The project should be read as an applied computer-vision prototype, not as a clinical diagnostic system.

Problem

Medical image segmentation requires predicting a class label for each pixel or voxel region of an image. In CT imaging, this can be used to identify lesion regions or anatomical structures from grayscale images and corresponding masks.

The goal was to construct a deep-learning workflow that could learn from paired CT images and segmentation masks, then evaluate predicted masks against held-out ground truth.

Data Preparation

The pipeline matched input CT images with their corresponding segmentation masks, filtered image paths using shared file-name structure and converted images into NumPy arrays suitable for neural-network training.

Model Architecture

The core model was a U-Net-style convolutional encoder-decoder architecture. The contraction path used convolution, dropout and max-pooling layers to extract hierarchical image features. The expansion path used transposed convolutions and skip connections to recover spatial resolution.

CT image
  -> convolution / pooling encoder
  -> latent feature representation
  -> transposed-convolution decoder
  -> per-pixel softmax segmentation mask

The output layer produced pixel-level class probabilities through a softmax activation.

Training and Evaluation

The model was trained on an image-mask split and evaluated on held-out data. Training diagnostics included loss and accuracy curves, while segmentation quality was assessed using mean Intersection over Union and class-level IoU calculations.

Implemented Elements

Evaluation Limits

The project was an educational and applied prototype. It was not validated for clinical use.

Modern Extension

A modern version of this project would strengthen the medical-imaging evaluation protocol and compare more robust segmentation architectures.

Technologies and Methods Used

Resources

Code and raw medical-image data are not public.

An anonymized technical note can be prepared upon request.