Skip to main content
You can watch recordings of the lecture videos online. Recordings are saved on Panopto and require a UNI login.

The lecture schedule will be updated as the term progresses.

Week 0 - Pre-course

Make sure to fill out the pre-course survey that is available on Slack.

Make sure you are registered for the course Gradescope, JupyterHub and Slack.

Week 1 (05/03) - Working with Text in Python and Bash

Weekly Readings - Due Fri, May 7, 2021 - Overleaf Template

Weekly Homework - Due Tue, May 11, 2021

Date Topic Reading Tutorial
Mon, May 3, 2021 Lecture 1
Course Overview
Command Line [slides]
[empty notebook]
[completed notebook]
Cultural Analytics Textbook, Chapter 1: The Command Line
Tutorial 1.1 - Python Bootcamp (Due Tue, May 4, 2021)
Tue, May 4, 2021 Lecture 2
Python Overview [slides]
[empty notebook]
[completed notebook]
Cultural Analytics Textbook, Chapter 2: Python Basics
Tutorial 1.2 - 2016 Miami Presidential Debate (Due Wed, May 5, 2021)
Wed, May 5, 2021 Lecture 3
Regex Numpy
Pandas [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), "Chapter 3 Regular Expressions"
Dive into Data Science, Chapter 2 - Numpy and BabyPandas
Thu, May 6, 2021 Lecture 4
Pandas
Matplotlib [slides]
[empty notebook]
[completed notebook]
Dive into Data Science, Chapter 3 - Visualization
Tutorial 1.3 - Google Store App Reviews with Pandas (Due Fri, May 7, 2021)
Fri, May 7, 2021 Deadline to Add Summer A Courses

Week 2 (05/10) - Words, Words, Words: the base unit of analysis

Weekly Readings - Due Sun, May 16, 2021 - Overleaf Template

Date Topic Reading Tutorial
Mon, May 10, 2021 Lecture 5
What is a word? [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 2: What’s in a Word?
Cultural Analytics Textbook, Part of Speech Tagging
Tutorial 2.1 - Text Processing (Due Tue, May 11, 2021)
Tue, May 11, 2021 Lecture 6
Document Representation [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 5.1: Enter the Matrix; Chapter 5.2: Discrete Representations
Cultural Analytics Textbook, TF-IDF
Wed, May 12, 2021 Lecture 7
Dictionary-based Methods
TF-IDF [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 4: Pointwise Mutual Information
Hovy (Textbook), Chapter 5.2.4: Dictionary-based methods
Thu, May 13, 2021 Lecture 8
Language Modeling, n-grams, & Word Representations
Dictionary-based Methods [slides]
[empty notebook]
Hovy (Textbook), Chapter 8: Language Models
Tutorial 2.2 - Dictionary-based methods (Due Thu, May 20, 2021)

Week 3 (05/23) - Topic Modeling

Weekly Readings - Due Mon, May 24, 2021 - Overleaf Template

Weekly Homework - Due Wed, May 26, 2021

Date Topic Reading Tutorial
Mon, May 17, 2021 Deadline to Drop Summer A Course
Mon, May 17, 2021 Open Hours (Gauri)
Tue, May 18, 2021 Lecture 9
Reguar Expressions (Gauri) [slides]
Hovy (Textbook), Chapter 3 Regular Expressions
Wed, May 19, 2021 Lecture 10
Latent Dirichlet Allocation [slides]
Topic Modeling demo (David Mimno)
Hovy (Textbook), Chapter 9: Topic Models
Thu, May 20, 2021 Lecture 11
Evaluating Topic Models [slides]
[empty notebook]
[completed notebook]
Boyd-Graber, Hu, Mimno, Chapter 3: Evaluation & Interpretation in Applications of Topic Models

Week 4 (05/24) - Data Collection

Weekly Readings - Due Sun, May 30, 2021 - Overleaf Template

Date Topic Reading Tutorial
Mon, May 24, 2021 Lecture 12
Web Scraping [slides]
[empty notebook]
[completed notebook]
Online Textbook, Web Scraping 101
Casey Fiesler, Nathan Beard, Brian C. Keegan, No Robots, Spiders, or Scrapers: Legal and Ethical Regulation of Data Collection Methods in Social Media Terms of Service
Tue, May 25, 2021 Lecture 13
APIs: Twitter [slides]
[empty notebook]
[completed notebook]
Online Textbook, Collecting Twitter Data
Dorothy Kim and Eunsong Kim, The #TwitterEthics Manifesto
Wed, May 26, 2021 Lecture 14
APIs: Reddit [slides]
[empty notebook]
Online Textbook, Collecting Reddit Data
Baumgartner, Jason, Savvas Zannettou, Brian Keegan, Megan Squire, and Jeremy Blackburn. 2020, The Pushshift Reddit Dataset
Thu, May 27, 2021 Lecture 15
Hypothesis Testing
Text Classification
Naive Bayes [slides]
Ani Adhikari and John DeNero, inferential Thinking, Testing Testing_Hypotheses
Lena Voita, NLP Course, Text Classification
Jurafsky and Martin, Speech and Language Processing (3rd ed. draft), Chapter 4: Naive Bayes and Sentiment Classification

Week 5 (05/31) - Machine Learning

Weekly Readings - Due Sun, Jun 6, 2021 - Overleaf Template

Weekly Homework - Due Thu, Jun 3, 2021

Date Topic Reading Tutorial
Mon, May 31, 2021 Memorial Day
No Lecture
Tue, Jun 1, 2021 Lecture 16
Guest Lecture: Maria Antoniak
Wed, Jun 2, 2021 Lecture 17
Matrix Factorization
Logistic Regression [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 6 Matrix Factorization
Lena Voita, NLP Course, Text Classification
Jurafsky and Martin, Speech and Language Processing (3rd ed. draft), Chapter 5: Logistic Regression
Thu, Jun 3, 2021 Lecture 18
Clustering [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 7: Clustering
Project Proposal (Due Thu, Jun 3, 2021)

Week 6 (06/07) - Advanced Topics & Projects

Weekly Readings - Due

Weekly Homework - Due Thu, Jun 10, 2021

Date Topic Reading Tutorial
Mon, Jun 7, 2021 Deadline to P/D/F or Withdraw from Summer A Course
Mon, Jun 7, 2021 Lecture 19
Matrix Factorization [slides]
[empty notebook]
[completed notebook]
Hovy (Textbook), Chapter 6 Matrix Factorization
Tue, Jun 8, 2021 Lecture 20
Word Embeddings [slides]
[empty notebook]
Hovy (Textbook), Chapter 5.3 Distributed Representations
Jay Alammar, The Illustrated Word2Vec (blog)
Wed, Jun 9, 2021 Lecture 21
Guest Lecture: Lucy Li, Berkeley
Project Discussions
Thu, Jun 10, 2021 Lecture 22
N-grams & Phrases
Project Discussions [slides]
[empty notebook]
Hovy (Textbook), Chapter 2.1.4 n-Grams
Hovy (Textbook), Chapter 4 Pointwise Mutual Information
Hovy (Textbook), Chapter 8 Language Models

Week 7 (06/14) - Projects

Weekly Readings - Due

Date Topic Reading Tutorial
Mon, Jun 14, 2021 Lecture 24
Project Update Presentations
Fri, Jun 18, 2021 Final Project due