This repository contains the source data for my first, second and third blog posts on college degrees in the US.
I derive data/observations.csv from the 2016 ACS PUMS data by running code/observations.do in Stata MP/15.
The data dictionary for data/observations.csv is as follows.
| Variable | Type | Description |
|---|---|---|
age |
int | Age of respondent at survey date |
female |
int | Binary indicator for whether respondent is female |
level |
int | Educational attainment level ID as in data/attainment.csv |
field1, field2 |
int | Degree field IDs as in data/fields.csv |
weight |
dbl | Aggregated sample weight |
I create data/attainment.csv and data/fields.csv manually based on the PUMS data dictionary.
All repository content is licensed under the MIT license.