Skip to main content

Ctrl+K

Site Navigation

Setting up your computer
Prompting basics
Accessing LLMs
Chatbots
Function / Tool calling

Image generation

Image manipulation

Generating Videos, Books, Slides and online content

Synthesizing data

Code generation

Vision language models

Auto-generating PowerPoint files with chatGPT and Dall-E

Retrieval Augmented Generation

Solving github issues

Model Fine-Tuning in the cloud

Model Fine-Tuning locally

Benchmarking Vision Language Models

Site Navigation

Setting up your computer
Prompting basics
Accessing LLMs
Chatbots
Function / Tool calling

Image generation

Image manipulation

Generating Videos, Books, Slides and online content

Synthesizing data

Code generation

Vision language models

Auto-generating PowerPoint files with chatGPT and Dall-E

Retrieval Augmented Generation

Solving github issues

Model Fine-Tuning in the cloud

Model Fine-Tuning locally

Benchmarking Vision Language Models

Ctrl+K

Generative Artificial Intelligence Notebooks

Setup

Setting up your computer
- Installation instructions for Scientific Computing Uni Leipzig (paula)

LLM basics

Prompting basics
Accessing LLMs
Chatbots
- Programming an LLM-based chatbot
- A Chatbot GUI
Function / Tool calling
- Function calling using ollama
- Function calling using ScaDS.AI’s LLM service

Multi-Modal LLMs

Image generation
Image manipulation
Generating Videos, Books, Slides and online content
- Video generation
Synthesizing data
- Generating synthetic customer data
- Combining LLMs with Random number generators for data generation
Code generation
Vision language models

Advanced Prompt Engineering

Auto-generating PowerPoint files with chatGPT and Dall-E
Retrieval Augmented Generation
Chat with Docs
Solving github issues
Agents
Model Fine-Tuning in the cloud
Model Fine-Tuning locally
Benchmarking
Benchmarking Vision Language Models

Links

Imprint

repository
open issue

.ipynb

Moondream for bounding-box segmentation

Contents

Human mitosis

Moondream for bounding-box segmentation#

In this notebook we will use the vision language model moondream to determine bounding-boxes around objects.

Installation (Windows):

Download vips-dev-w64-all-8.16.1.zip from here, unzip it, and add its subfolder bin to the PATH environment variable.
pip install einops pyvips

from transformers import AutoModelForCausalLM, AutoTokenizer
from PIL import Image
from image_utilities import numpy_to_bytestream, extract_json, generate_spots
from tqdm import tqdm
import stackview

model = AutoModelForCausalLM.from_pretrained(
    "vikhyatk/moondream2",
    revision="2025-04-14",
    trust_remote_code=True,
    # Comment to run on CPU. To use the GPU, you need about 5 GB of GPU Memory.
    device_map={"": "cuda"}
)

Human mitosis#

import stackview
from skimage import data
import numpy as np

# Load the human mitosis dataset
image = data.human_mitosis()[:100, :100]

# Display the image
stackview.insight(image)

shape	(100, 100)
dtype	uint8
size	9.8 kB
min	7
max	88

pil_image = Image.fromarray(image)

encoded_image = model.encode_image(pil_image)

bb = model.detect(encoded_image, "Mark all the bright blobs individually")["objects"]
print(f"Found {len(bb)} bright spot(s)")

Found 11 bright spot(s)

for b in bb:
    b["x"] = b["x_min"]
    b["y"] = b["y_min"]
    b["width"] = b["x_max"]-b["x_min"]
    b["height"] = b["y_max"]-b["y_min"]
bb[:2]

[{'x_min': 0.2584344260394573,
  'y_min': 0.0017580389976501465,
  'x_max': 0.3743780739605427,
  'y_max': 0.09199196100234985,
  'x': 0.2584344260394573,
  'y': 0.0017580389976501465,
  'width': 0.11594364792108536,
  'height': 0.09023392200469971},
 {'x_min': 0.42661894857883453,
  'y_min': 0.0029355064034461975,
  'x_max': 0.5265060514211655,
  'y_max': 0.1220644935965538,
  'x': 0.42661894857883453,
  'y': 0.0029355064034461975,
  'width': 0.09988710284233093,
  'height': 0.1191289871931076}]

stackview.add_bounding_boxes(image, bb)

shape	(100, 100, 3)
dtype	uint8
size	29.3 kB
min	0
max	255

previous

Benchmarking spot counting using Vision Language Models

next

Claude VLM for bounding-box segmentation

On this page

Human mitosis

By Robert Haase

Last updated on 2025-07-18.

Copyright: Licensed CC-BY 4.0 unless mentioned otherwise. Contributions and feedback are welcome.