Implementation of U-Net and SegNet for building segmentation

Last update: Dec 07, 2022

Overview

Specialized project

Created by Katrine Nguyen and Martin Wangen-Eriksen as a part of our specialized project at Norwegian University of Science and Technology (NTNU).

Models

Most of our code and the U-net model is significantly inspired by this project Unet-for-Person-Segmentation. The SegNet model we created on our own based on other implementations of SegNet in Tensorflow.

Data

The model is trained and tested on Massachusetts Buildings Dataset from Kaggle. The original images where 1500X1500 pixels each over an area of 1500x1500 meters (1mx1m resolution). The original 137 images were cropped into 64x64 pixels and images without building were filtered out.

To make the masks compatible with our model the masks was changed from white (255,255,255) labels to greyscale with value 1. This is done in image_fix.py found in the repo.

Folder structure

Images and masks are saved in local directories and used in data.py and test.py. This is of course possible to change, however if you want to use the exact same code you can follow this folder structure.


.
├── ...
├── building-segmentation                # Directory for all images
│   ├── Images                           # Directory for raw images
│   │   ├── cropped_images_train_64      # Directory for cropped images where number specifies resolution, containg .jpg
│   │   ├── cropped_images_train_128     # Directory for cropped images where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   ├── Masks                            # Directory for all maskes
│   │   ├── cropped_masks_train_64       # Directory for cropped masks where number specifies resolution, containg .jpg
│   │   ├── cropped_masks_train_128      # Directory for cropped masks where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   └── Test                             # Miscellaneous information
│       ├── test_64                      # Directory for images where number specifies resolution, containing .jpg
│       └── ...                          # More directories with other resolutions
└── ...

# data.py
    images = glob(os.path.join(dataset_path, "images/cropped_images_train_64/*"))
    masks = glob(os.path.join(dataset_path, "masks/cropped_masks_train_64/*"))
    
    # In main:
        dataset_path = "building-segmentation"
    
# test.py
    test_images = glob("building-segmentation/test/test_64/*")

Implementation of U-Net and SegNet for building segmentation

Related tags

Overview

Specialized project

Models

Data

Folder structure

Running the project

Requirements

Training

Testing

Owner

Martin.w-e

A Player for Kanye West's Stem Player. Sort of an emulator.

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

CowHerd is a partially-observed reinforcement learning environment

PyTorch implementation of Decoupling Value and Policy for Generalization in Reinforcement Learning

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

An end-to-end project on customer segmentation

Improving Query Representations for DenseRetrieval with Pseudo Relevance Feedback:A Reproducibility Study.

Simple PyTorch implementations of Badnets on MNIST and CIFAR10.

Neural Network to colorize grayscale images

PyTorch DepthNet Training on Still Box dataset

Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Implementation of Fast Transformer in Pytorch

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

Progressive Domain Adaptation for Object Detection

Optimus: the first large-scale pre-trained VAE language model

Production First and Production Ready End-to-End Speech Recognition Toolkit

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Deep motion generator collections

NeuroGen: activation optimized image synthesis for discovery neuroscience