Tópico Designação Descrição
Part 1: Introduction to Data Science for Humanities Ficheiro Notebook: Python-Basics (HTML-Version)
Ficheiro Kick-off slides
Part 2: (Re-)introduction to Python Ficheiro Notebook: Basics 2 (HTML-Version)
Part 3: Data modeling for data science Ficheiro Notebook: Numpy (HTML-Version)
Ficheiro Solution: Bag-of-words
Ficheiro Solution: Python basics
Part 4: Data Acquisition and Preparation Ficheiro Notebook: Acquistion and Preprocessing (HTML-Version)
Ficheiro Dataset: Olympics
Part 5: Explorative Analysis 1 – Descriptive Analysis and visualization Ficheiro Notebook: Descriptive Statistics (HTML-Version)
Ficheiro Notebook: Deep-dive into Seaborn (HTML-Version)
Part 6: Explorative Analysis 2 – Clustering and distance functions Ficheiro Notebook: Clustering and Distance Functions (HTML)
Ficheiro Notebook: Clustering and distance function (.ipynb)
Part 7: Predictive Analysis (A Gentle Introduction to Machine Learning) Ficheiro Notebook: Classification (.ipynb)
Ficheiro Data: reviews_train.csv
Ficheiro Data: reviews_text.csv
Ficheiro Text Classification and Clustering: Slides
Part 8: Text and Language I (Computational Linguistics) Ficheiro Notebook: Text Processing
Ficheiro Slides: Lexical Semantics
Ficheiro Slides: Text Representations
Ficheiro Slides: Information Extraction
Ficheiro Data: Unlabeled Reviews