Argomento Nome Descrizione
Part 1: Introduction to Data Science for Humanities File Notebook: Python-Basics (HTML-Version)
File Kick-off slides
Part 2: (Re-)introduction to Python File Notebook: Basics 2 (HTML-Version)
Part 3: Data modeling for data science File Notebook: Numpy (HTML-Version)
File Solution: Bag-of-words
File Solution: Python basics
Part 4: Data Acquisition and Preparation File Notebook: Acquistion and Preprocessing (HTML-Version)
File Dataset: Olympics
Part 5: Explorative Analysis 1 – Descriptive Analysis and visualization File Notebook: Descriptive Statistics (HTML-Version)
File Notebook: Deep-dive into Seaborn (HTML-Version)
Part 6: Explorative Analysis 2 – Clustering and distance functions File Notebook: Clustering and Distance Functions (HTML)
File Notebook: Clustering and distance function (.ipynb)
Part 7: Predictive Analysis (A Gentle Introduction to Machine Learning) File Notebook: Classification (.ipynb)
File Data: reviews_train.csv
File Data: reviews_text.csv
File Text Classification and Clustering: Slides
Part 8: Text and Language I (Computational Linguistics) File Notebook: Text Processing
File Slides: Lexical Semantics
File Slides: Text Representations
File Slides: Information Extraction
File Data: Unlabeled Reviews