Object segmentation and classifiation on OpenCL-compatible GPUs

Object segmentation and classifiation on OpenCL-compatible GPUs#

APOC is a Python library for pixel and object classification using machine learning based on pyclesperanto and sklearn. It accelerates Random Forest Classifiers to predict faster on graphics processing units supporting the OpenCL standard. For object segmentation, it uses a pixel classifier and connected components labeling.

If there are errors in the first two cells, containing messages such as “module not found: apoc”, “module not found: pyclesperanto_prototype” or “clGetPlatformIDs failed: PLATFORM_NOT_FOUND_KHR”

Check these installation instructions
Install graphics card drivers if neccessary

from skimage.io import imread, imshow, imsave
import pyclesperanto_prototype as cle
import numpy as np
import apoc

Let’s start with loading an example image and some ground truth:

image = imread('data/blobs.tif')
cle.imshow(image)

../../_images/40f3a9b2cbf4ae2b8018c5e487701ae147e8533eaaf5fe09af90019ba5659332.png

if False: # you can use this to make manual annotations
    import napari

    # start napari
    viewer = napari.Viewer()
    napari.run()

    # add image
    viewer.add_image(image)

    # add an empty labels layer and keep it in a variable
    labels = np.zeros(image.shape).astype(int)
    viewer.add_labels(labels)
else:
    labels = imread('data/blobs_annotations.tif')

#imsave('annotations.tif', labels)
manual_annotations = labels

cle.imshow(manual_annotations, labels=True)

../../_images/e4c672aa55c6103f4e2265b960ffd860ba32f31bdbe692e3fa636519b06580a4.png

To see the image and the annotation on top, we overlay both:

cle.imshow(image, continue_drawing=True)
cle.imshow(manual_annotations, labels=True, alpha=0.7)

../../_images/6f32039ce67a6ed0a9e213037bacab61940b8f360a6285bf24b6674f6a62fe0b.png

Training#

We now train a ObjectSegmenter, which is under the hood a scikit-learn RandomForestClassifier. After training, the classifier will be converted to clij-compatible OpenCL code and save to disk under a given filename.

# define features
features = "gaussian_blur=1 gaussian_blur=5 sobel_of_gaussian_blur=1"

# this is where the model will be saved
cl_filename = 'my_object_segmenter.cl'

# delete classifier in case the file exists already
apoc.erase_classifier(cl_filename)

# train classifier
clf = apoc.ObjectSegmenter(opencl_filename=cl_filename, positive_class_identifier=2)
clf.train(features, manual_annotations, image)

Prediction / segmentation#

The classifier can then be used to classify all pixels in the given image. Starting point is again, the feature stack. Thus, the user must make sure that the same features are used for training and for prediction. Prediction can be done on the CPU using the original scikit-learn code and on the GPU using the generated OpenCL-code. OCLRFC works well if both result images look identical.

segmentation_result = clf.predict(features=features, image=image)
cle.imshow(segmentation_result, labels=True)

../../_images/20fc2f2ed7e927800f500bfdc9618e3df0f3a288df09dbc53a4126e634ae1161.png

Segmentation from a loaded segmenter#

After training, the segmenter has been stored as a file. This file can be loaded again and applied to other data.

# load segmenter from disk
clf = apoc.ObjectSegmenter(opencl_filename=cl_filename)

# apply it
segmentation_result = clf.predict(image=image)
cle.imshow(segmentation_result, labels=True)

Object classification#

When classifying segmented objects, we need to provide three images: The raw intensity image, a label image representing the objects and an annotation where different objects are marked with different labels. The first two are available above. We load the label annotation from disk. In this annotation, lines were drawn through three different kinds of objects:

Small, round
Large, round
Large, elongated

annotation = imread('data/blobs_label_annotation.tif')

cle.imshow(segmentation_result, continue_drawing=True)
cle.imshow(annotation, labels=True, alpha=0.7)

../../_images/0b3efbb815d30816785a9e671ea5b67e440359de901f00e48848ba39160bd2d6.png

# for the classification we define size and shape as criteria
features = 'area mean_max_distance_to_centroid_ratio'

# This is where the model will be saved
cl_filename_object_classifier = "my_object_classifier.cl"

# delete classifier in case the file exists already
apoc.erase_classifier(cl_filename_object_classifier)

# train the classifier
classifier = apoc.ObjectClassifier(cl_filename_object_classifier)
classifier.train(features, segmentation_result, annotation, image)

# determine object classification
classification_result = classifier.predict(segmentation_result, image)
cle.imshow(classification_result, labels=True)

../../_images/04e06a9edbf667323c50129a0aec28d6a6c1e71d8987e222eb12ed5f9459e4b5.png

Exercise#

Use the graphical user interface of napari and change blobs_label_annotation.tif so that the classifier trained here differentiates two classes: Round and elongated objects.

Optional: Use the graphical user interface of APOC.