Skip to main content

Ctrl+K

Site Navigation

Setting up your computer
Prompting basics
Accessing LLMs
Chatbots
Function / Tool calling

Image generation

Image manipulation

Generating Videos, Books, Slides and online content

Synthesizing data

Code generation

Vision language models

Auto-generating PowerPoint files with chatGPT and Dall-E

Retrieval Augmented Generation

Solving github issues

Model Fine-Tuning in the cloud

Model Fine-Tuning locally

Benchmarking Vision Language Models

Site Navigation

Setting up your computer
Prompting basics
Accessing LLMs
Chatbots
Function / Tool calling

Image generation

Image manipulation

Generating Videos, Books, Slides and online content

Synthesizing data

Code generation

Vision language models

Auto-generating PowerPoint files with chatGPT and Dall-E

Retrieval Augmented Generation

Solving github issues

Model Fine-Tuning in the cloud

Model Fine-Tuning locally

Benchmarking Vision Language Models

Ctrl+K

Generative Artificial Intelligence Notebooks

Setup

Setting up your computer
- Installation instructions for Scientific Computing Uni Leipzig (paula)

LLM basics

Prompting basics
Accessing LLMs
Chatbots
- Programming an LLM-based chatbot
- A Chatbot GUI
Function / Tool calling
- Function calling using ollama
- Function calling using ScaDS.AI’s LLM service

Multi-Modal LLMs

Image generation
Image manipulation
Generating Videos, Books, Slides and online content
- Video generation
Synthesizing data
- Generating synthetic customer data
- Combining LLMs with Random number generators for data generation
Code generation
Vision language models

Advanced Prompt Engineering

Auto-generating PowerPoint files with chatGPT and Dall-E
Retrieval Augmented Generation
Chat with Docs
Solving github issues
Agents
Model Fine-Tuning in the cloud
Model Fine-Tuning locally
Benchmarking
Benchmarking Vision Language Models

Links

Imprint

repository
open issue

.ipynb

Kiara LLM endpoint

Contents

Exercise

Kiara LLM endpoint#

In this notebook we will use yet experimental LLM infrastructure infrastructure. To use it, you must enter two enviroment variables KIARA_API_KEY and KIARA_LLM_SERVER. Also this method uses the OpenAI API and we just change the base_url.

import os
import openai
openai.__version__

'1.90.0'

def prompt_kiara(message:str, model="ollama-llama3-3-70b"):
    """A prompt helper function that sends a message to kiara LLM server 
    and returns only the text response.
    """
    import os
    
    # convert message in the right format if necessary
    if isinstance(message, str):
        message = [{"role": "user", "content": message}]
    
    # setup connection to the LLM
    client = openai.OpenAI(base_url=os.environ.get('KIARA_LLM_SERVER') + "api/",
                           api_key=os.environ.get('KIARA_API_KEY')
    )
    
    response = client.chat.completions.create(
        model=model,
        messages=message
    )
    
    # extract answer
    return response.choices[0].message.content

prompt_kiara("Hi!")

"It's nice to meet you. Is there something I can help you with, or would you like to chat?"

Exercise#

List the models available in the endpoint and try them out by specifying them when calling prompt_scadsai_llm().

client = openai.OpenAI(base_url=os.environ.get('KIARA_LLM_SERVER') + "api/",
                       api_key=os.environ.get('KIARA_API_KEY'))

print("\n".join([model.id for model in client.models.list().data]))

ollama-llama3-3-70b
vllm-baai-bge-m3
vllm-deepseek-coder-33b-instruct
vllm-deepseek-r1-distill-llama-70b
vllm-llama-3-3-nemotron-super-49b-v1
vllm-llama-4-scout-17b-16e-instruct
vllm-meta-llama-llama-3-3-70b-instruct
vllm-mistral-small-24b-instruct-2501
vllm-multilingual-e5-large-instruct
vllm-nvidia-llama-3-3-70b-instruct-fp8

previous

ScaDS.AI LLM endpoint

next

KISSKI / GWDG endpoint

On this page

Exercise

By Robert Haase

Last updated on 2025-07-18.

Copyright: Licensed CC-BY 4.0 unless mentioned otherwise. Contributions and feedback are welcome.