The lecture schedule will be updated as the term progresses.
Make sure to fill out the pre-course survey that is available on Slack.
Make sure you are registered for the course Gradescope, JupyterHub and Slack.
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, May 3, 2021 |
Lecture 1 Course Overview Command Line [slides] [empty notebook] [completed notebook] |
Cultural Analytics Textbook,
Chapter 1: The Command Line
|
Tutorial 1.1 - Python Bootcamp
(Due Tue, May 4, 2021)
|
Tue, May 4, 2021 |
Lecture 2 Python Overview [slides] [empty notebook] [completed notebook] |
Cultural Analytics Textbook,
Chapter 2: Python Basics
|
Tutorial 1.2 - 2016 Miami Presidential Debate
(Due Wed, May 5, 2021)
|
Wed, May 5, 2021 |
Lecture 3 Pandas [slides] [empty notebook] [completed notebook] |
Dive into Data Science, Chapter 2 - Numpy and BabyPandas |
|
Thu, May 6, 2021 |
Lecture 4 Pandas Matplotlib [slides] [empty notebook] [completed notebook] |
Dive into Data Science,
Chapter 3 - Visualization
|
Tutorial 1.3 - Google Store App Reviews with Pandas
(Due Fri, May 7, 2021)
|
Fri, May 7, 2021 |
Deadline to Add Summer A Courses
|
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, May 10, 2021 |
Lecture 5 What is a word? [slides] [empty notebook] [completed notebook] |
Hovy (Textbook),
Chapter 2: What’s in a Word?
Cultural Analytics Textbook, Part of Speech Tagging |
Tutorial 2.1 - Text Processing
(Due Tue, May 11, 2021)
|
Tue, May 11, 2021 |
Lecture 6 Document Representation [slides] [empty notebook] [completed notebook] |
Hovy (Textbook),
Chapter 5.1: Enter the Matrix; Chapter 5.2: Discrete Representations
Cultural Analytics Textbook, TF-IDF |
|
Wed, May 12, 2021 |
Lecture 7 TF-IDF [slides] [empty notebook] [completed notebook] |
Hovy (Textbook),
Chapter 4: Pointwise Mutual Information
Hovy (Textbook), Chapter 5.2.4: Dictionary-based methods |
|
Thu, May 13, 2021 |
Lecture 8 Dictionary-based Methods [slides] [empty notebook] |
Hovy (Textbook),
Chapter 8: Language Models
|
Tutorial 2.2 - Dictionary-based methods
(Due Thu, May 20, 2021)
|
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, May 17, 2021 |
Deadline to Drop Summer A Course
|
||
Mon, May 17, 2021 |
Open Hours (Gauri)
|
||
Tue, May 18, 2021 |
Lecture 9 Reguar Expressions (Gauri) [slides] |
Hovy (Textbook),
Chapter 3 Regular Expressions
|
|
Wed, May 19, 2021 |
Lecture 10 Latent Dirichlet Allocation [slides] |
Topic Modeling demo (David Mimno)
Hovy (Textbook), Chapter 9: Topic Models |
|
Thu, May 20, 2021 |
Lecture 11 Evaluating Topic Models [slides] [empty notebook] [completed notebook] |
Boyd-Graber, Hu, Mimno,
Chapter 3: Evaluation & Interpretation in Applications of Topic Models
|
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, May 24, 2021 |
Lecture 12 Web Scraping [slides] [empty notebook] [completed notebook] |
Online Textbook,
Web Scraping 101
Casey Fiesler, Nathan Beard, Brian C. Keegan, No Robots, Spiders, or Scrapers: Legal and Ethical Regulation of Data Collection Methods in Social Media Terms of Service |
|
Tue, May 25, 2021 |
Lecture 13 APIs: Twitter [slides] [empty notebook] [completed notebook] |
Online Textbook,
Collecting Twitter Data
Dorothy Kim and Eunsong Kim, The #TwitterEthics Manifesto |
|
Wed, May 26, 2021 |
Lecture 14 APIs: Reddit [slides] [empty notebook] |
Online Textbook,
Collecting Reddit Data
Baumgartner, Jason, Savvas Zannettou, Brian Keegan, Megan Squire, and Jeremy Blackburn. 2020, The Pushshift Reddit Dataset |
|
Thu, May 27, 2021 |
Lecture 15 Text Classification Naive Bayes [slides] |
Lena Voita, NLP Course, Text Classification Jurafsky and Martin, Speech and Language Processing (3rd ed. draft), Chapter 4: Naive Bayes and Sentiment Classification |
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, May 31, 2021 |
Memorial Day No Lecture |
||
Tue, Jun 1, 2021 |
Lecture 16 Guest Lecture: Maria Antoniak |
||
Wed, Jun 2, 2021 |
Lecture 17 Logistic Regression [slides] [empty notebook] [completed notebook] |
Lena Voita, NLP Course, Text Classification Jurafsky and Martin, Speech and Language Processing (3rd ed. draft), Chapter 5: Logistic Regression |
|
Thu, Jun 3, 2021 |
Lecture 18 Clustering [slides] [empty notebook] [completed notebook] |
Hovy (Textbook),
Chapter 7: Clustering
|
Project Proposal
(Due Thu, Jun 3, 2021)
|
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, Jun 7, 2021 |
Deadline to P/D/F or Withdraw from Summer A Course
|
||
Mon, Jun 7, 2021 |
Lecture 19 Matrix Factorization [slides] [empty notebook] [completed notebook] |
Hovy (Textbook),
Chapter 6 Matrix Factorization
|
|
Tue, Jun 8, 2021 |
Lecture 20 Word Embeddings [slides] [empty notebook] |
Hovy (Textbook),
Chapter 5.3 Distributed Representations
Jay Alammar, The Illustrated Word2Vec (blog) |
|
Wed, Jun 9, 2021 |
Lecture 21 Guest Lecture: Lucy Li, Berkeley Project Discussions |
||
Thu, Jun 10, 2021 |
Lecture 22 N-grams & Phrases Project Discussions [slides] [empty notebook] |
Hovy (Textbook),
Chapter 2.1.4 n-Grams
Hovy (Textbook), Chapter 4 Pointwise Mutual Information Hovy (Textbook), Chapter 8 Language Models |
Date | Topic | Reading | Tutorial |
---|---|---|---|
Mon, Jun 14, 2021 |
Lecture 24 Project Update Presentations |
||
Fri, Jun 18, 2021 |
Final Project due
|