VLMs guessing algorithms to process images

VLMs guessing algorithms to process images#

In this notebook we will use a VLM to guess an algorithm that could be used to process an image.

We do this 25 times and draw a wordcloud from the responses.

from bia_bob import bob, ask_llm
from skimage.data import human_mitosis
import wordcloud
import matplotlib.pyplot as plt

bob.initialize(model="gpt-4o-2024-08-06", vision_model="gpt-4o-2024-08-06")

This notebook may contain text, code and images generated by artificial intelligence. Used model: gpt-4o-2024-08-06, vision model: gpt-4o-2024-08-06, endpoint: None, bia-bob version: 0.27.0.. Do not enter sensitive or private information and verify generated contents according to good scientific practice. Read more: https://github.com/haesleinhuepf/bia-bob#disclaimer

image = human_mitosis()

def ask():
    return ask_llm("""
    You are an excellent bio-image analyst and Python developer. 
    Given an image, describe what you see in one sentence. 
    Afterwards, propose a Deep-Learning based Python library afterwards. 
    Only write the name of the library.
    """, image)

ask()

'The image shows a microscopic view of numerous cell nuclei stained to highlight their structure. \n\nLibrary: Cellpose'

responses = []
for _ in range(25):
    responses.append(ask())

text = "\n".join(responses)

# Generate word cloud
w = wordcloud.WordCloud(width=800, height=400, colormap='viridis').generate(text)

# Display the word cloud
plt.figure(figsize=(10, 5))
plt.imshow(w, interpolation='bilinear')
plt.axis('off')
plt.show()

../_images/28410808a851e88223b9b9957209b917627d07df08d180cb234ea4755b4eb8b7.png

[r.replace("\n", " ").split(" ")[-1] for r in responses]

['CellProfiler.',
 'Cellpose.',
 'Cellpose',
 'StarDist',
 'Cellpose',
 'Cellpose',
 'DeepCell.',
 'Cellpose',
 'Kiosk.',
 'Cellpose',
 'Cellpose',
 'Cellpose',
 'CellPose',
 'Cellpose',
 'Kiosk.',
 'Cellpose',
 'DeepImageJ',
 'DeepCell.',
 'StarDist.',
 'CellPose',
 'CellPose.',
 'Cellpose',
 'Cellpose',
 'Cellpose',
 'CellProfiler.']

text = "\n".join([r.replace("\n", " ").split(" ")[-1] for r in responses])

# Generate word cloud
w2 = wordcloud.WordCloud(width=800, height=400, colormap='viridis', background_color='white').generate(text)

# Display the word cloud
plt.figure(figsize=(10, 5))
plt.imshow(w2, interpolation='bilinear')
plt.axis('off')
plt.show()

../_images/4cb5860ea03a0f99630cb74498d190a4a690e468155106ad04b94b9de30176c3.png