Setting up your computer#
This chapter provides instructions for preparing your computer for the training school. It is recommended to install a VPN client as described below days before the training. In case of issues, please reach out to the training organizers.
VPN and access to the HPC cluster#
During the training school, you will have access to the High Performance Computing (HPC) infrastructure of TU Dresden to run the tutorial notebooks. To connect to HPC, you need to set-up a connection to TU Dresden VPN.
It is recommended to install the eduVPN client, which is available for all major operating systems. To install eduVPN, download the client for your operating system and follow the instructions provided.
To set-up the VPN connection via eduVPN, a TUD guest account will be provided on the first day of the training. With this available, follow the instructions provided.
The HPC cluster / Jupyter Hub is then accessible via https://jupyterhub.hpc.tu-dresden.de (provide the same TUD guest account credentials as for the VPN).
Setting-up your workspace#
This will be done together on the first day of the training school.
Connect to TUD VPN and go to https://jupyterhub.hpc.tu-dresden.de to log in with your training account
Use the provided spawn links to allocate HPC resources and spawn a Jupyter Lab server
After being assigned a spot on the HPC cluster, you will be greeted by a Jupyter Lab environment.
Open a terminal and copy a Juypter notebook for HPC setup to your home dir:
cp /data/horse/ws/mats024c-iom-ai/HPC-setup.ipynb .
Close the terminal. In the file browser on the left side, click the refresh icon and open
HPC-setup.ipynb
When asked to “Select Kernel”, leave the default Python 3 kernel and click “Select”.
Do the steps explained in the notebook.
This setup is only required once.
For subsequent sessions, you just follow steps 1 & 2 to start Jupyter Lab.
Registration at Helmholtz Blablador#
In the training school, we also want to take a short look at how Large Language Models (LLM) and AI chatbots based on them can support us in coding and data analysis. Helmholtz-Jülich provides a free LLM service for researchers called “Blablador”. In order to use this service, prior registration and creation of an API key are necessary.
Please follow the steps on this website as described:
Register at Helmholtz Gitlab (Codebase)
Open the Helmholtz Codebase
Log in via “Sign in with Helmholtz ID” and select your university to authenticate with your university account
Create an API Key (Token)
ATTENTION: this API key should be treated like a personal password and should not be made available to anyone else!
Open your profile page
Select “Access Token” on the left side
Under “Personal access tokens”, create a new token via “Add new token”
Name it best “Blablador API key”
Select “api” as scope
Copy the displayed key and save it securely (e.g. password manager) - it will not be displayed again after confirmation
Optional for local execution#
Setting up Python and virtual environments#
NOTE: As preparation of the course you do not need to install Python or Conda on your computer because we will work in a remote HPC environment where you get access to. However, in case you want to play with the notebooks and tools on your own computer, follow the instructions below.
Background: When working with Python, we will make use of software libraries (packages) for additional functionalities like Machine Learning, which need to be organized. One way of doing this is by managing virtual Python environments, and one tool to do this is Conda. A virtual environment can be considered a virtual desktop or virtual computer and is accessible via the terminal and other software once it has been activated. If you install packages into one virtual environment, it may not be accessible from another environment. If a virtual environment breaks, e.g. incompatible packages were installed, you can just remove it and start over.
See also
Step 1: Install Conda-forge / Miniforge#
We recommend the small-footprint Conda distribution miniforge. For ease-of-use, it is recommended to install it for your user only and to add Conda to the PATH variable during installation.
Please refer to the official installation instructions!
Check whether conda is available in your terminal via conda --version
Step 2: Create a virtual environment with all packages#
Download File
Download this Conda configuration file, which defines the environment: conda-iom-env.yml
Create Virtual Environment
Open a terminal or command prompt
Navigate to the file’s location
Execute this command:
conda env update -f conda-iom-env.yml --prune
With this command:
Conda creates a new environment with the name iom-env” or updates an already existing one with the same name
Conda downloads and installs all defined software packages in the environment
Troubleshooting
If problems occur, e.g. with an already existing environment “iom-env”, you can remove it completely and start from scratch
Execute this command:
conda remove --name iom-env --all
Then, perform the above steps for a fresh installation of the environment
Test the Virtual Environment
Open a terminal or command prompt
Execute the following command:
conda activate iom-env
- the environment is activatedStart Jupyter Lab for testing:
jupyter lab
- your browser should open with Jupyter LabYou can now execute the training notebooks locally
Then, Jupyter Lab can be shut down or terminated (e.g.,
ctrl-C
in the terminal)To deactivate the environment run:
conda deactivate