LLMs for Physics @ ICTP

Here are my contacts

LinkedIn ( talking about: AI, Quantum, and Physics)
Beyond Entropy - my Newsletter on Emergent Technologies

Course Abstract

The course will begin with a review of the basics of Deep Learning, NLP and PyTorch. Then the Transformer architecture and its Self-attention Mechanism will be introduced and coded. A simple, small but complete autoregressive generative language model such as GPT-2 will be built. This will allow us to understand several relevant aspects of more sophisticated pre-trained LLMs, such as GPT4, Mistral or Llama. Afterwards, we will play with open-source pre-trained LLMs and, if possible, fine-tune one of them. In the last part of the course, we will explore some interesting, also from a physical point of view, emerging abilities of LLMs, touch upon multi-agent systems and their collective behaviour.

Technical Requisites

A Google/Gmail account for accessing Colab Notebooks
key access to Open AI, Mistral AI, Weight and Bias, Tavily

Lecture Material

Slides: LLMs for Physics
Lecture 1: Deep Learning and Neural Networks Review
Lecture 2: The Transformer Architecture
Lecture 3: Coding GPT-2 step by step
Lecture 4: Mistral Embeddings
Lecture 5: Multi-agent example

Course Abstract

Technical Requisites

Lecture Material

Further Readings