Avsnittsöversikt

  • Block 1: Fundamentals

    April 23

    L1: Languages of the world & Linguistic Universals; Course organization

    April 30 L2: Language modeling, word embedding models, tokenization & vocab building
    May 7 L3: Deep Learning for (Modern) NLP — Perceptron/MLP, Backprop, Batching, Gradient Descent, Dropout…
      Ex1: Intro to Language Modeling & Tokenization
    May 14 (online) L4: Transformer Almighty & Pretraining Language Models (autoregressive, masked language modeling)
      Ex2: Backprop & Training Models in Pytorch

    Block 2: Multilinguality

    May 28

    L5: Multilingual Word Embedding Spaces (and CL Transfer with them)

     

    Ex3: Transformer

    June 4 L6: Multilingual LMs and Cross-Lingual Transfer; Tasks, Benchmarks & Evaluation
      Ex4: Project Topics & Setup
    June 11 L7: Curse of Multilinguality, Modularization, and Language Adaptation
    June 18 L8: Transfer for Token-Level Tasks: Word Alignment & Label Projection (+ maybe translate train on sequence labeling task)
      Ex5: Modularization

    Block 3: Advanced Topics

    June 25

    L9: Neural Machine Translation (incl. decoder only MT)

     

    Ex6: Transfer for Token-Level Tasks

    July 2 L10: Multilingual Sentence Representations
      Ex7: Neural Machine Translation
    July 9 L11: Large Language Models, Instruction-Tuning and Generative NLP
      Ex8: Multilingual Sentence Representations
    July 23 Student Project Presentations