COMET
  • Get Started
    • Quickstart Guide
    • Install and Use COMET
    • Get Started
  • Learn By Skill Level
    • Getting Started
    • Beginner
    • Intermediate - Econometrics
    • Intermediate - Geospatial
    • Advanced

    • Browse All
  • Learn By Class
    • Making Sense of Economic Data (ECON 226/227)
    • Econometrics I (ECON 325)
    • Econometrics II (ECON 326)
    • Statistics in Geography (GEOG 374)
  • Learn to Research
    • Learn How to Do a Project
  • Teach With COMET
    • Learn how to teach with Jupyter and COMET
    • Using COMET in the Classroom
    • See COMET presentations
  • Contribute
    • Install for Development
    • Write Self Tests
  • Launch COMET
    • Launch on JupyterOpen (with Data)
    • Launch on JupyterOpen (lite)
    • Launch on Syzygy
    • Launch on Colab
    • Launch Locally

    • Project Datasets
    • Github Repository
  • |
  • About
    • COMET Team
    • Copyright Information
Categories
All (90)
1:m (2)
2SLS (5)
AI (1)
Breusch-Pagan test (2)
CRS (2)
LLMs (1)
Microsoft Azure text-to-speech (1)
OneDrive (3)
OpenAI (1)
PyTorch (1)
R (31)
S2 (1)
Whisper (1)
White's test (1)
advanced (14)
aesthetic (1)
append (3)
asymptotics (1)
audio transcription (1)
bar chart (3)
bar plot (1)
basics (2)
before-after estimator (1)
beginner (13)
bootstrapping (1)
case when (2)
causality (6)
cbind (1)
cdf (1)
cells (1)
central limit theorem (1)
central tendency (3)
chi-square distribution (1)
cleaning (2)
clearing (1)
codebook (2)
coefficients (6)
coefplot (2)
collapse (2)
combining graphs (3)
commenting (6)
comparison measure estimator (1)
confidence bands (1)
confidence intervals (1)
confidence level (1)
continuous treatment (1)
continuous variable (1)
control variables (1)
controls (4)
correlation (1)
cosine similarity (2)
covariance (1)
critical value (1)
cross-section data (1)
csv (2)
data (4)
data cleaning (2)
data frames (1)
data structures (1)
data types (2)
data visualization (1)
data wrangling (3)
dataframes (1)
delimiters (3)
describe (4)
descriptive statistics (4)
diarization (1)
difference in differences (3)
difference-in-differences (3)
directories (2)
dispersion (1)
distance (1)
distribution (1)
distributions (2)
do-files (2)
dta (2)
dummy variable (6)
dummy variables (5)
econ 226 (3)
econ 227 (2)
econ 227. confidence intervals (1)
econ 325 (11)
econ 326 (5)
econ 425 (1)
econ 490 (53)
econ 495 (1)
econ 499 (1)
econ intro (1)
egen (2)
elbow plot (1)
endogeneity (3)
estimates (3)
etable (2)
event study (4)
exclusion (3)
exporting (5)
expressions (2)
faceting (1)
factor variables (1)
factorizing (1)
fine-tuning (1)
fixed effects (1)
fixed-effects (3)
foreach (2)
forvalues (2)
full join (1)
functions (2)
gTTS (1)
generate (2)
generating variables (6)
geospatial (2)
getting started (4)
ggplot (4)
glimpse (1)
globals (2)
graphs (1)
hedonic (1)
help (4)
heteroskedasticity (8)
histogram (5)
homoskedasticity (1)
hypothesis testing (2)
ifelse (2)
import (2)
importing data (2)
inferential statistics (2)
inner join (1)
instrumental variable (3)
instrumental variables (2)
interaction (3)
interaction terms (2)
intermediate (4)
interpretation (6)
interquartile range (1)
intro (3)
introduction (8)
joint-probability (1)
jupyter (1)
k-means clustering (1)
labels (1)
large language models (2)
law of large numbers (1)
left join (1)
line plot (1)
linear differencing (1)
linear probability model (1)
lists (1)
loading data (1)
locals (2)
log-files (2)
loops (4)
m:1 (2)
m:m (2)
map (1)
master (3)
matrices (1)
mean (2)
median (2)
merge (3)
merging data (1)
missing data (1)
missing values (1)
mode (2)
mozilla TTS (1)
multicollinearity (5)
multiple regression (7)
mutating (2)
naming (4)
natural language processing (2)
non-linear terms (1)
nonlinearity (3)
normal distribution (2)
notebooks (1)
objects (1)
ols (3)
operations (2)
organization (3)
outliers (3)
p-value (2)
panel data (7)
parallel trends (4)
pareto distribution (1)
pdf (1)
percentile (1)
point estimate (1)
polynomial terms (1)
pooling (1)
preprocessing (1)
pricing (1)
probability (2)
probability notation (1)
projects (2)
pystata (18)
python (5)
r (17)
random effects (1)
random-effects (3)
raster image (1)
rbind (1)
regression (26)
relevance (3)
reshape (2)
review (1)
robust standard errors (1)
sample mean (1)
sample proportion (1)
sampling distributions (1)
save (2)
scales (1)
scatter plot (4)
scripts (4)
sentiment-analysis (1)
serial correlation (3)
setting up (3)
shapefile (2)
significance level (1)
simple regression (1)
skewness (2)
sort (2)
standard deviation (1)
standard error (1)
stargazer (1)
stata (18)
summarize (2)
summary (1)
summary statistics (1)
synthetic control (1)
t-distribution (2)
t-test (5)
tables (1)
tabulate (4)
text-to-speech (1)
tidyverse (2)
trade (1)
treatment timing (1)
trimming (3)
triple difference estimator (1)
troubleshooting (1)
twoway (4)
type-I error (1)
type-II error (1)
uniform distribution (1)
univariate regression (3)
variables (2)
variance (1)
vector (2)
vector image (1)
vectors (3)
vif (2)
visualization (14)
vocalization (1)
weak instrument (3)
winsorizing (3)
word embeddings (2)
word2vec (2)
workflow (3)

All Modules

This section contains all materials and case studies to support a variety of classes. Filter by topic using the categories on the right. Filter by class by choosing the appropriate class category.

Modules

Kernel Symbol Off

0.1 - Introduction to JupyterNotebooks
Welcome to COMET! This is the very first notebook most of you will do, and it introduces you to some basics of Jupyter and using this project. Have fun!
12 Jan 2023

0.2 - Introduction to R
This notebook introduces you to some fundamental concepts in R. It might be a little complex for a start, but it covers basically all of the fundamental syntax you need to…
12 Jan 2023

0.3.1 - Introduction to Data in R - Part 1
This notebook introduces you to data in R, primarily using the tidyverse set of packages. It includes basic data curation and cleaning, including table-based inspection…
12 Jan 2023

0.3.2 - Introduction to Data in R - Part 2
An introduction to analyzing data using tidyverse and dplyr in R including workflows for loading, merging, cleaning and visualizing data.
9 Jul 2023

01 - Jupyter and Stata
This notebook explains how to set up Jupyter, add a Stata Kernel, and connect to COMET. It is the first step to take when embarking on a research project.
29 May 2024

01 - Setting Up Jupyter and R
This notebook explains how to set up Jupyter, add an R Kernel, and connect to COMET. It is the first step to take when embarking on a research project.
29 May 2024

01 - Setting up PyStata for your Windows computer
This notebook explains how to set up Jupyter, add a PyStata Kernel, and connect to COMET. It is the first step to take when embarking on a research project.
29 May 2024

02 - Working with Do-Files
This notebook covers how to write and work with Stata do-files. We go over how to create a do-file, commenting, and generating log-files.
29 May 2024

02 - Working with Do-Files
This notebook covers how to write and work with Stata do-files. We go over how to create a do-file, commenting, and generating log-files.
29 May 2024

02 - Working with R Scripts
This notebook covers how to write and work with R scripts. We go over how to create a script, commenting, and preparing our R session.
29 May 2024

03 - R Essentials
This notebook dives into a few essentials commands in R, including types of data, how to explore our data, and some basic functions.
29 May 2024

03 - Stata Essentials
This notebook dives into a few essentials commands in Stata, including describe, summarize, loops, and help.
29 May 2024

03 - Stata Essentials
This notebook dives into a few essentials commands in Stata, including describe, summarize, loops, and help.
29 May 2024

04 - Opening Datasets
This notebook explains how to load, view, and clean data. We go over importing and previewing our data, as well as preparing the data for analysis.
29 May 2024

04 - Working with Locals and Globals
This notebook explains how to create and use locals and globals.
29 May 2024

04 - Working with Locals and Globals
This notebook explains how to create and use locals and globals.
29 May 2024

05 - Generating Variables
In this notebook, we go over how to create variables. We look into how creating dummy variables works, as well as how to create variables using mathematical expressions.
29 May 2024

05 - Opening Data Sets
This notebook explains how to load, view, and clean data. We go over importing and previewing our data, as well as preparing the data for analysis.
29 May 2024

05 - Opening Datasets
This notebook explains how to load, view, and clean data. We go over importing and previewing our data, as well as preparing the data for analysis.
29 May 2024

06 - Conducting Within Group Analysis
In this notebook, we go over how to create variables. We look into how creating dummy variables works, as well as how to create variables using mathematical expressions.
29 May 2024

06 - Generating Variables
In this notebook, we go over how to create variables. We look into how creating dummy variables works, as well as how to create variables using mathematical expressions.
29 May 2024

06 - Generating Variables
In this notebook, we go over how to create variables. We look into how creating dummy variables works, as well as how to create variables using mathematical expressions.
29 May 2024

07 - Combining Data Sets
This notebook explains how to append and merge data sets.
29 May 2024

07 - Conducting Within Group Analysis
In this notebook, we look at within-group analysis. We see how to summarize data for subgroups, how to generate new variables among subgroups, and how to reshape out data.
29 May 2024

07 - Conducting Within Group Analysis
In this notebook, we look at within-group analysis. We see how to summarize data for subgroups, how to generate new variables among subgroups, and how to reshape out data.
29 May 2024

08 - Combining Datasets
This notebook explains how to append and merge data sets.
29 May 2024

08 - Combining Datasets
This notebook explains how to append and merge data sets.
29 May 2024

08 - Creating Meaningful Visuals
This notebook goes over how to make all sorts of visuals. We look at different types of graphs, like scatter plots and histograms, exporting figures, and how to edit the…
29 May 2024

09 - Combining Graphs
This notebook explains how to combine graphs.
29 May 2024

09 - Creating Meaningful Visuals
This notebook goes over how to make all sorts of visuals. We look at different types of graphs, like scatter plots and histograms, exporting figures, and how to edit the…
29 May 2024

09 - Creating Meaningful Visuals
This notebook goes over how to make all sorts of visuals. We look at different types of graphs, like scatter plots and histograms, exporting figures, and how to edit the…
29 May 2024

game die

1.0.0 - Beginner - Introduction to Statistics
This notebook introduces you to some fundamental statistics and basic probability concepts. It is designed to cover introductory statistics material taught in Foundations of…
10 Aug 2023

1.0.1 - Beginner - Introduction to Statistics using R
This notebook is an introduction to basic statistics using Jupyter and R, and some fundamental data analysis. It is a high-level review of the most important applied tools…
8 May 2023

1.1 - Beginner - Introduction to Central Tendency
This notebook is an hands-on introduction to the concepts of Central Tendency at the beginner level using R. It is meant for undergraduates with no or very little prior…
13 Oct 2023

1.1.1 - Beginner - Central Tendency
This notebook is an introduction to basic statistics using Jupyter and R, and some fundamental data analysis.
4 Jun 2023

1.2 - Beginner - Dispersion and Dependence
In this notebook we explore how data is spread out, and what that means for its interpretation. This includes both how individual values may vary, and how values may…
12 Jan 2023

1.3.2 - Beginner - Confidence Intervals
What does it mean to be confident in a statistical estimate? This notebook is an introduction to confidence, and confidence intervals - especially in the context of the…
12 Jan 2023

1.4.2 - Beginner - Hypothesis Testing
What is a hypothesis? How do we test it? This notebook introduces hypothesis testing in two different ways, outlining the connection between them and how we can use this…
12 Jan 2023

1.5.1 - Beginner - Introduction to Data Visualization I
How do we make visualizations of data? This notebook uses the ggplot packages to help create impactful data visualizations using R. We will also discuss the rationale…
12 Jan 2023

1.5.2 - Beginner - Introduction to Data Visualization II
How do we make visualizations of data? This notebook uses the ggplot packages to help create impactful data visualizations using R. This builds on the first notebook…
12 Jan 2023

1.6 - Beginner - Distributions
This notebook introduces the idea of a statistical distribution, including its properties and some commonly used functions associated with it. We also discuss the shape of…
12 Jan 2023

1.8 - Beginner - Sampling Distributions
What is a sampling distribution? This notebook tackles, using a simulation-based approach, the most complex idea in this course. It is code-heavy, but most of it is in the…
12 Jan 2023

10 - Combining Graphs
This notebook explains how to combine graphs.
29 May 2024

10 - Combining Graphs
This notebook explains how to combine graphs.
29 May 2024

10 - Conducting Regression Analysis
This notebook goes over the theory surrounding linear regressions, as well as how to implement univariate and multivariate regressions in R. We dig into coefficient…
29 May 2024

11 - Conducting Regression Analysis
This notebook goes over the theory surrounding linear regressions, as well as how to implement univariate and multivariate regressions in Stata. We dig into coefficient…
29 May 2024

11 - Conducting Regression Analysis
This notebook goes over the theory surrounding linear regressions, as well as how to implement univariate and multivariate regressions in Stata. We dig into coefficient…
29 May 2024

11 - Exporting Regression Output
Here, we work on how to export our regression results. We introduce some packages to make our regression results look professional and to present our coefficients in a…
29 May 2024

12 - Dummy Variables and Interactions
In this notebook, we dive into dummy variables and interaction terms. We look at how to include them in our regressions and how to interpret their coefficients.
29 May 2024

12 - Exporting Regression Output
Here, we work on how to export our regression results. We introduce some commands to make our regression results look professional and to present our coefficients in a…
29 May 2024

12 - Exporting Regression Output
Here, we work on how to export our regression results. We introduce some commands to make our regression results look professional and to present our coefficients in a…
29 May 2024

13 - Good Regression Practice
This notebook covers some good practices that should be implemented when we perform regression analysis. We look at how to handle outliers, multicollinearity…
29 May 2024

13 - Using Dummy Variables and Interactions
In this notebook, we dive into dummy variables and interaction terms. We look at how to include them in our regressions and how to interpret their coefficients.
29 May 2024

13 - Using Dummy Variables and Interactions
In this notebook, we dive into dummy variables and interaction terms. We look at how to include them in our regressions and how to interpret their coefficients.
29 May 2024

14 - Good Regression Practices
This notebook covers some good practices that should be implemented when we perform regression analysis. We look at how to handle outliers, multicollinearity…
29 May 2024

14 - Good Regression Practices
This notebook covers some good practices that should be implemented when we perform regression analysis. We look at how to handle outliers, multicollinearity…
29 May 2024

14 - Panel Data Regressions
In this notebook, we go over panel data. We look into what it is, how to run regressions with panel data, as well as fixed and random-effects models. We finish by looking at…
29 May 2024

15 - Difference-in-Differences Analysis
This notebook introduces difference-in-difference analysis. We look the assumptions required to perform this type of analysis, how to run the regressions, how to run event…
29 May 2024

15 - Panel Data Regressions
In this notebook, we go over panel data. We look into what it is, how to run regressions with panel data, as well as fixed and random-effects models. We finish by looking at…
29 May 2024

15 - Panel Data Regressions
In this notebook, we go over panel data. We look into what it is, how to run regressions with panel data, as well as fixed and random-effects models. We finish by looking at…
29 May 2024

16 - Differences-in-Differences Analysis
This notebook introduces difference-in-difference analysis. We look the assumptions required to perform this type of analysis, how to run the regressions, how to run event…
29 May 2024

16 - Differences-in-Differences Analysis
This notebook introduces difference-in-difference analysis. We look the assumptions required to perform this type of analysis, how to run the regressions, how to run event…
29 May 2024

16 - Instrumental Variable Analysis
This notebook introduces instrumental variable analysis. We look the conditions that must be satisfied to perform an IV analysis, how the two-stage-least-squares approach…
29 May 2024

17 - Instrumental Variable Analysis
This notebook introduces instrumental variable analysis. We look the conditions that must be satisfied to perform an IV analysis, how the two-stage-least-squares approach…
29 May 2024

17 - R Workflow Guide
This notebook is here to help us organize our files when conducting large-scale research. We talk about workflow management, coding style, and cloud storage.
29 May 2024

17: Instrumental Variable Analysis
This notebook introduces instrumental variable analysis. We look the conditions that must be satisfied to perform an IV analysis, how the two-stage-least-squares approach…
29 May 2024

18 - Stata Workflow Guide
This notebook is here to help us organize our files when conducting large-scale research. We talk about workflow management, coding style, and cloud storage.
29 May 2024

18 - Stata Workflow Guide
This notebook is here to help us organize our files when conducting large-scale research. We talk about workflow management, coding style, and cloud storage.
29 May 2024

2.1 - Intermediate - Introduction to Regression
An introduction to simple regression using Jupyter and R, with an emphasis on understanding what regression models are actually doing. Computation is using OLS.
25 Jul 2024

2.2 - Intermediate - Multiple Regression
An introduction to multiple regression using Jupyter and R, connecting simple to multiple regression. We also discuss some important concepts, including control variables.
8 Dec 2022

2.4 - Intermediate - Issues in Regression
What are the key issues with a regression model? This notebook discusses collinearity, heteroskedasticity, and model specification.
8 Dec 2022

2.5 - Intermediate - Interactions and Non-linear Terms
How do we specific non-linear models? Why would we want to do so? This notebook covers non-linear and interaction models, including marginal effects and related issues.
8 Dec 2022

3.1.1 - Advanced - Linear Differencing Models I
This note introducing difference-in-difference style models, particularly for causal models and inference.

3.1.2 - Advanced - Linear Differencing Models II
This notebook introduces students to linear differencing, focusing on techniques of difference-in-differences with variation in treatment timing, and event-studies with a…
4 Jun 2024

3.2.1 - Advanced - Instrumental Variables
An introduction to estimating causal effects with instrumental variables on Jupyter and R.
10 Jun 2024

3.2.2 - Advanced - Instrumental Variables 2
An introduction to estimating causal effects with instrumental variables on Jupyter and R.
3 Jun 2024

3.3 - Advanced - Panel Data
This module goes over the theory of panel data analysis as well as how to apply the theory to real-world data. We look into panel regressions, fixed effects, a few other…
26 Jul 2024

3.4 - Advanced - Synthetic Control
An introduction to estimating causality through the use of synthetic control. Synthetic control is the process by which we create a counterfactual to the unit we actually…
24 Aug 2024

3.5.1 - Advanced - Geospatial Analysis
This notebook introduces geospatial analysis with vector data in R. We go over basic geospatial objects and operations.
24 Jun 2024

3.5.2 - Advanced - Geospatial Analysis
This notebook explores geospatial analysis with vector data in R in more detail. We go over file types, choosing a CRS, as well as an application with real world data.
24 Jun 2024

4.1 - Advanced - Classification and Clustering
This notebook introduces the classification and clustering models, especially for economic and sociological datasets.
18 Oct 2022

4.2 - Advanced - Introduction to Sentiment Analysis
This notebook explains how to perform basic sentiment analysis and Reddit web scraping using R.
4 Jul 2024

whisper audio

4.3.1 - Advanced - Transcription
This notebook introduces how to use machine learning tools to transcribe and diarize audio files.

4.3.2 - Advanced - Vocalization
This notebook demonstrates how to produce human-like speech from text input in a programmatic fashion, using Python.

4.4 - Advanced - Word Embeddings (Python)
This notebook introduces the concept and implementation of word embeddings, as used in AI tools like LLMs, in Python.

4.4 - Advanced - Word Embeddings (R)
This notebook introduces the concept and implementation of word embeddings, as used in AI tools like LLMs, in R.

4.5 - Advanced - LLM APIs 2
This notebook illustrates how to call different Large Language Models (LLMs) using their API, for the purposes of data analysis or computational use.
7 Aug 2024

4.6 - Advanced - Fine-Tuning Large Language Models for Sentiment Analysis
An introduction to fine-tuning LLMs using BERT, in Python.
29 Jul 2024

Projects - Example Project for ECON 325
Let’s put it all together! This notebook is an example of what a “final project” might look like in ECON 325. It summarizes and uses all of the empirical skills and R…
12 Jan 2023

Projects - Example Project for ECON 326
Let’s put it all together! This notebook is an example of what a “final project” might look like in ECON 326. It summarizes and uses all many of the empirical skills and R…
1 Jul 2023
No matching items
    • Creative Commons License. See details.
     
    • Report an issue
    • The COMET Project and the UBC Vancouver School of Economics are located on the traditional, ancestral and unceded territory of the xʷməθkʷəy̓əm (Musqueam) and Sḵwx̱wú7mesh (Squamish) peoples.