Networking with EuroCC Cyprus: an ‘intermediate’ course to increase the efficiency of LLMs with PyTorch

On April 19, 2024, we will participate in the training event “Accelerating Generative AI with PyTorch”, organized by the Cyprus Competence Centre, featuring Ivan Gentile, a data scientist from IFAB (International Foundation Big Data and Artificial Intelligence for Human Development).

The event will take place at the Cyprus Institute from 10:00 AM to 1:00 PM. Below is the agenda.

Speakers:

  • Mr. Christodoulos Stylianou, Research Engineer (CaSToRC, The Cyprus Institute)
  • Mr Ivan Gentile, Data Scientist (IFAB – NCC Italy)
  • Dr Charalambos Chrysostomou, Associate Research Scientist (CaSToRC, The Cyprus Institute)

The tutorial aims to provide optimization techniques for Llama, a foundational Large Language Model (LLM) based on the Transformer Architecture, analogous to the GPT series. Noted for their human-like text generation capabilities, these models encounter challenges regarding efficiency and scalability due to their complexity and computational demands. The session intends to augment the operational efficiency of these models through PyTorch-native optimization strategies, including model compilation, GPU quantization, speculative decoding, and tensor parallelism. Participants will have the chance to evaluate the proposed optimizations in real-time on a real supercomputer.

Pre-requisites:

  • Basic knowledge of Deep Learning, and programming in Python
  • Additionally, some experience in using HPC systems is helpful (Linux shell, Slurm) but not mandatory

Participants are expected to provide a laptop with which they can access the HPC system.

Go to the web page dedicated to the event >