COMET
  • Get Started
    • Quickstart Guide
    • Install and Use COMET
    • Get Started
  • Learn By Skill Level
    • Getting Started
    • Beginner
    • Intermediate - Econometrics
    • Intermediate - Geospatial
    • Advanced

    • Browse All
  • Learn By Class
    • Making Sense of Economic Data (ECON 226/227)
    • Econometrics I (ECON 325)
    • Econometrics II (ECON 326)
    • Statistics in Geography (GEOG 374)
  • Learn to Research
    • Learn How to Do a Project
  • Teach With COMET
    • Learn how to teach with Jupyter and COMET
    • Using COMET in the Classroom
    • See COMET presentations
  • Contribute
    • Install for Development
    • Write Self Tests
  • Launch COMET
    • Launch on JupyterOpen (with Data)
    • Launch on JupyterOpen (lite)
    • Launch on Syzygy
    • Launch on Colab
    • Launch Locally

    • Project Datasets
    • Github Repository
  • |
  • About
    • COMET Team
    • Copyright Information
  1. Beginner: Using R and Data in Applied Econometrics
  • Learn by Skill Level


  • Getting Started: Introduction to Data, R, and Econometrics
    • Intro to JupyterNotebooks
    • Intro to R
    • Intro to Data (Part 1)
    • Intro to Data (Part 2)

  • Beginner: Using R and Data in Applied Econometrics
    • Introduction to Statistics I
    • Introduction to Statistics II
    • Central Tendency
    • Dispersion and Dependence
    • Confidence Intervals
    • Hypothesis Testing
    • Data Visualization I
    • Data Visualization II
    • Distributions
    • Sampling Distributions
    • Simple Regression

  • Intermediate: Econometrics and Modeling Using R
    • Simple Regression
    • Multiple Regression
    • Issues in Regression
    • Interactions

    • Geographic Computation
    • Chi-Square Test
    • t-test
    • ANOVA
    • Regression
    • Wrangling and Visualizing Data

  • Advanced Modules
    • Classification and Clustering
    • Differences In Differences
    • Geospatial I
    • Geospatial II
    • Instrumental Variables I
    • Instrumental Variables II
    • Large Language Model APIs (Python)
    • Linear Differencing
    • Training LLMS
    • Sentiment Analysis Using LLMs (Python)
    • Transcription (Python)
    • Vocalization (Python)
    • Word Embeddings (Python)
    • Word Embeddings (R)
    • Panel Data
    • Synthetic Controls
Categories
All (11)
R (11)
asymptotics (1)
bar plot (1)
beginner (12)
bootstrapping (1)
cdf (1)
central limit theorem (1)
central tendency (3)
chi-square distribution (1)
confidence bands (1)
confidence intervals (1)
confidence level (1)
correlation (1)
covariance (1)
critical value (1)
data (1)
data types (1)
data visualization (1)
data wrangling (1)
descriptive statistics (4)
dispersion (1)
distribution (1)
distributions (2)
dummy variable (1)
econ 226 (3)
econ 227 (2)
econ 227. confidence intervals (1)
econ 325 (9)
exporting (1)
faceting (1)
ggplot (2)
graphs (1)
histogram (2)
hypothesis testing (1)
importing data (1)
inferential statistics (2)
interquartile range (1)
introduction (3)
joint-probability (1)
labels (1)
law of large numbers (1)
line plot (1)
mean (2)
median (2)
mode (2)
normal distribution (2)
p-value (1)
pdf (1)
percentile (1)
point estimate (1)
probability (2)
probability notation (1)
raster image (1)
review (1)
sample mean (1)
sample proportion (1)
sampling distributions (1)
scales (1)
scatter plot (1)
significance level (1)
skewness (2)
standard deviation (1)
standard error (1)
t-distribution (2)
t-test (2)
type-I error (1)
type-II error (1)
uniform distribution (1)
variance (1)
vector image (1)
visualization (2)

Beginner: Using R and Data in Applied Econometrics

The modules in this unit are Beginner level. They are intended for people who are starting to learn how to use tools like Jupyter and R for applied econometrics. Many courses share this material, and you should try working through the notebooks in order.

  • If you are looking to filter by a specific course, select the course number in the list of categories on the right.
  • If you’re looking for the basics, check out our getting started pages for an introduction.

You should make sure you’re familiar with the basics of R and using data before starting these notebooks.

game die

1.0.0 - Beginner - Introduction to Statistics
This notebook introduces you to some fundamental statistics and basic probability concepts. It is designed to cover introductory statistics material taught in Foundations of…
10 Aug 2023

1.0.1 - Beginner - Introduction to Statistics using R
This notebook is an introduction to basic statistics using Jupyter and R, and some fundamental data analysis. It is a high-level review of the most important applied tools…
8 May 2023

1.1 - Beginner - Introduction to Central Tendency
This notebook is an hands-on introduction to the concepts of Central Tendency at the beginner level using R. It is meant for undergraduates with no or very little prior…
13 Oct 2023

1.1.1 - Beginner - Central Tendency
This notebook is an introduction to basic statistics using Jupyter and R, and some fundamental data analysis.
4 Jun 2023

1.2 - Beginner - Dispersion and Dependence
In this notebook we explore how data is spread out, and what that means for its interpretation. This includes both how individual values may vary, and how values may…
12 Jan 2023

1.3.2 - Beginner - Confidence Intervals
What does it mean to be confident in a statistical estimate? This notebook is an introduction to confidence, and confidence intervals - especially in the context of the…
12 Jan 2023

1.4.2 - Beginner - Hypothesis Testing
What is a hypothesis? How do we test it? This notebook introduces hypothesis testing in two different ways, outlining the connection between them and how we can use this…
12 Jan 2023

1.5.1 - Beginner - Introduction to Data Visualization I
How do we make visualizations of data? This notebook uses the ggplot packages to help create impactful data visualizations using R. We will also discuss the rationale…
12 Jan 2023

1.5.2 - Beginner - Introduction to Data Visualization II
How do we make visualizations of data? This notebook uses the ggplot packages to help create impactful data visualizations using R. This builds on the first notebook…
12 Jan 2023

1.6 - Beginner - Distributions
This notebook introduces the idea of a statistical distribution, including its properties and some commonly used functions associated with it. We also discuss the shape of…
12 Jan 2023

1.8 - Beginner - Sampling Distributions
What is a sampling distribution? This notebook tackles, using a simulation-based approach, the most complex idea in this course. It is code-heavy, but most of it is in the…
12 Jan 2023
No matching items

Introduction to Statistics I
  • Creative Commons License. See details.
 
  • Report an issue
  • The COMET Project and the UBC Vancouver School of Economics are located on the traditional, ancestral and unceded territory of the xʷməθkʷəy̓əm (Musqueam) and Sḵwx̱wú7mesh (Squamish) peoples.