SEDA Data Downloads

Publicly available data files, technical documentation and data codebooks

Version 1.0
Data description Download Documentation
This file contains district level means in grade equivalent units. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV Codebook
This file contains district level means in grade equivalent units. There are multiple observations per district, one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level means in grade equivalent units. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level means in constant population standard deviation units. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level means in NAEP-referenced units. Estimates are comparable between states. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level means in state-referenced units. Estimates are comparable within states. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level white-black and white-Hispanic achievement gaps. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year and grade. Stata CSV Codebook
This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year. Stata CSV
This file contains district level covariates (socioeconomic, demographic, school level data). There is one observation per district. Stata Excel
This file contains a unique school identifier, an identifier indicating its NCES ID (the district to which it legally belongs), and the district in which it is included in our estimates. There is one observation per school. Stata CSV Documentation
Version 1.1

Technical Documentation

Assessment Outcomes: Means and Standard Errors
Data Description Disaggregated by Download Documentation
File Title Description Metric Geographic Level Year Grade Subject
MeanA_V1.1 This file contains district level means in grade equivalent units. There are multiple observations per district; one for each year, grade and subject. Grade Equivilant Units District x x x Stata CSV Codebook
MeanB_V1.1 This file contains district level means in grade equivalent units. There are multiple observations per district, one for each subject; values are averaged across years and grades. Grade Equivilant Units District x Stata CSV
MeanC_V1.1 This file contains district level means in grade equivalent units. There is one observations per district; values are averaged across years, grades and subjects. Grade Equivilant Units District Stata CSV
MeanD_V1.1 This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each year, grade and subject. Standard Deviation Units District x x x Stata CSV
MeanE_V1.1 This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each subject; values are averaged across years and grades. Standard Deviation Units District x Stata CSV
MeanF_V1.1 This file contains district level means in constant population standard deviation units. There is one observations per district; values are averaged across years, grades and subjects. Standard Deviation Units District Stata CSV
MeanG_V1.1 This file contains district level means in NAEP-referenced units. Estimates are comparable between states. There are multiple observations per district; one for each year, grade and subject. NAEP District x x x Stata CSV
MeanH_V1.1 This file contains district level means in state-referenced units. Estimates are comparable within states. There are multiple observations per district; one for each year, grade and subject. State Referenced District x x x Stata CSV
Assessment Outcomes: Achievement Gaps
Data Description Disaggregated by Dowload Documentation
File Title Description Metric Geographic Level Year Grade Subject
GapA_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each year, grade and subject. Standard Deviation Units District x x x Stata CSV Codebook
GapB_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each subject; values are averaged across years and grades. Standard Deviation Units District x Stata CSV
GapC_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There is one observations per district; values are averaged across years, grades and subjects. Standard Deviation Units District Stata CSV
Covariates
Data Description Disaggregated by Dowload Documentation
File Title Description Metric Geographic Level Year Grade Subject
CovA_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year and grade. - District x x Stata CSV Codebook
CovB_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year. - District x Stata CSV
CovC_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There is one observation per district. - District Stata CSV
Ancillary Files
Data Description Disaggregated by Dowload Documentation
File Title Description Metric Geographic Level Year Grade Subject
AncillaryA_V1.1 This file contains a unique school identifier, an identifier indicating its NCES ID (the district to which it legally belongs), and the district in which it is included in our estimates. There is one observation per school. - District Stata CSV
AncillaryB_V1.1 This file contains the shape file that corresponds to the district crosswalk. - National File
Version 2.0

This page contains four sets of files:

  1. Technical Documentation and Codebooks
  2. Test Score Estimates: Means, Standard Deviations, and Achievement Gaps
  3. Covariate Data Files
  4. Ancillary Data Files

Errata: The first release of SEDA 2.0 had an error in the pooled gap estimates. The current files incorporate the fix. See the technical documentation for details.

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_v20 PDF
SEDA_codebook_geodist_v20 Excel
SEDA_codebook_county_v20 Excel
SEDA_codebook_cov_geodist_v20 Excel
SEDA_codebook_crosswalk_v20 Excel

Test Score Estimates: Means, Standard Deviations, and Achievement Gaps
File Name Form Metric Disaggregated by Download
Geographic District County Year Grade Subject Group
All Race Race Gaps
SEDA_geodist_long_CS_v20 Long CS X X X X X X X Stata CSV
SEDA_geodist_long_GCS_v20 Long GCS X X X X X X X Stata CSV
SEDA_geodist_long_NAEP_v20 Long NAEP X X X X X X X Stata CSV
SEDA_geodist_long_State_v20 Long State X X X X X X X Stata CSV
SEDA_geodist_poolsub_CS_v20 Pooled CS X X X X X Stata CSV
SEDA_geodist_poolsub_GCS_v20 Pooled GCS X X X X X Stata CSV
SEDA_geodist_pool_CS_v20 Pooled CS X X X X Stata CSV
SEDA_geodist_pool_GCS_v20 Pooled GCS X X X X Stata CSV
SEDA_county_long_CS_v20 Long CS X X X X X X X Stata CSV
SEDA_county_long_GCS_v20 Long GCS X X X X X X X Stata CSV
SEDA_county_long_NAEP_v20 Long NAEP X X X X X X X Stata CSV
SEDA_county_long_State_v20 Long State X X X X X X X Stata CSV
SEDA_county_poolsub_CS_v20 Pooled CS X X X X X Stata CSV
SEDA_county_poolsub_GCS_v20 Pooled GCS X X X X X Stata CSV
SEDA_county_pool_CS_v20 Pooled CS X X X X Stata CSV
SEDA_county_pool_GCS_v20 Pooled GCS X X X X Stata CSV

Metric: CS = Cohort Scale; GCS = Grade Scale; NAEP = NAEP Scale; State = State-referenced Scale
Academic Years: 2008/09 – 2014/15
Grades: 3 – 8
Subjects: Math, ELA
Race: white, black, Hispanic, and Asian
Race Gaps: white-black, white-Hispanic, white-Asian

Covariate Data
File Name Form Disaggregated by Download
District Year Grade
SEDA_cov_geodist_long_v20 Long X X X Stata CSV
SEDA_cov_geodist_poolyr_v20 Pooled X X Stata CSV
SEDA_cov_geodist_pool_v20 Pooled X Stata CSV

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_v20 X X Stata CSV
SEDA_shapefiles_v20 X Zip

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata (v13) and .csv formats.

In publications, please cite the data as: Sean F. Reardon, Andrew D. Ho., Benjamin R. Shear, Erin M. Fahle, Demetra Kalogrides, & Richard DiSalvo. (2018). Stanford Education Data Archive (Version 2.1). http://purl.stanford.edu/vy177vf4659.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Version 2.1 Notes

The currently available data include district and county level average achievement (for all students and by race/ethnicity and gender), district and county level racial/ethnic and gender achievement gaps, and district level demographic/socioeconomic data. The most recent release (currently, Version 4.1) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication. Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process and describes the contents of each file.

Version 2.1

This page contains five sets of files:

  1. Technical Documentation and Codebooks
  2. Data Files Used in News Articles
  3. Test Score Estimates: Means, Standard Deviations, and Achievement Gaps
  4. Covariate Data Files
  5. Ancillary Data Files

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_v21 PDF
SEDA_codebook_geodist_v21 Excel
SEDA_codebook_county_v21 Excel
SEDA_codebook_cov_geodist_v21 Excel
SEDA_codebook_crosswalk_v21 Excel

Data Files Used in News Articles
Article and Date Paper Data
Where Boys Outperform Girls in Math:
Rich, White and Suburban Districts
. New York Times. 6/13/2018
Gender Achievement Gaps in U.S. School Districts. Reardon, S.F., Fahle, E.M., Kalogrides, D., Podolsky, A., & Zárate, R.C. (2018) Excel* Data
*Notes about the data are included in the Excel spreadsheet.

Test Score Estimates: Means, Standard Deviations, and Achievement Gaps
File Name Form Metric Disaggregated by Download
Unit Year Grade Subject Means & SDs Gaps
All Race Gender Race Gender
SEDA_geodist_long_CS_v21 Long CS Geographic District X X X X X X X X Stata CSV
SEDA_geodist_long_GCS_v21 Long GCS Geographic District X X X X X X X X Stata CSV
SEDA_geodist_long_NAEP_v21 Long NAEP Geographic District X X X X X X X X Stata CSV
SEDA_geodist_long_State_v21 Long State Geographic District X X X X X X X X Stata CSV
SEDA_geodist_poolsub_CS_v21 Pooled CS Geographic District X X X X X X Stata CSV
SEDA_geodist_poolsub_GCS_v21 Pooled GCS Geographic District X X X X X X Stata CSV
SEDA_geodist_pool_CS_v21 Pooled CS Geographic District X X X X X Stata CSV
SEDA_geodist_pool_GCS_v21 Pooled GCS Geographic District X X X X X Stata CSV
SEDA_county_long_CS_v21 Long CS County X X X X X X X X Stata CSV
SEDA_county_long_GCS_v21 Long GCS County X X X X X X X X Stata CSV
SEDA_county_long_NAEP_v21 Long NAEP County X X X X X X X X Stata CSV
SEDA_county_long_State_v21 Long State County X X X X X X X X Stata CSV
SEDA_county_poolsub_CS_v21 Pooled CS County X X X X X X Stata CSV
SEDA_county_poolsub_GCS_v21 Pooled GCS County X X X X X X Stata CSV
SEDA_county_pool_CS_v21 Pooled CS County X X X X X Stata CSV
SEDA_county_pool_GCS_v21 Pooled GCS County X X X X X Stata CSV

Metric: CS = Cohort Scale; GCS = Grade Scale; NAEP = NAEP Scale; State = State-referenced Scale
Academic Years: 2008/09 – 2014/15
Grades: 3 – 8
Subjects: Math, ELA
Race: white, black, Hispanic, and Asian
Race Gaps: white-black, white-Hispanic, white-Asian
Gender: male, female
Gender Gaps: male-female

Covariate Data
File Name Form Disaggregated by Download
District Year Grade
SEDA_cov_geodist_long_v21 Long X X X Stata CSV
SEDA_cov_geodist_poolyr_v21 Pooled X X Stata CSV
SEDA_cov_geodist_pool_v21 Pooled X Stata CSV

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_v21 X X Stata CSV
SEDA_shapefiles_v20 X ZIP

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as: Reardon, S. F., Ho, A. D., Shear, B. R., Fahle, E. M., Kalogrides, D., Jang, H., Chavez, B., Buontempo, J., & DiSalvo, R. (2019). Stanford Education Data Archive (Version 3.0). http://purl.stanford.edu/mk782vn8293.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Version 3.0 Notes

The currently available data include district and county level average achievement (for all students and by race/ethnicity and gender), district and county level racial/ethnic and gender achievement gaps, and district level demographic/socioeconomic data. The most recent release (currently, Version 4.1) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication. Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process and describes the contents of each file.

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_v30 PDF
seda_codebook_school_v30 Excel
seda_codebook_geodist_v30 Excel
seda_codebook_county_v30 Excel
seda_codebook_metro_v30 Excel
seda_codebook_commzone_v30 Excel
seda_codebook_cov_school_v30 Excel
seda_codebook_cov_geodist_v30 Excel
seda_codebook_cov_county_v30 Excel
seda_codebook_crosswalk_v30 Excel

Scroll for download links  ►

*On 9/21/2023 errors were identified in these "poolsub" data files in SEDA 3.0. These files have been removed from the data archive. If you previously downloaded any of these files, please delete any locally stored copies. If you are looking to use "poolsub" data files, newer versions of these files are available in SEDA 4.1.
Test Score Estimations: Means and Achievement Gaps
File Name Form Metric Unit Disaggregated by Subgroups Download
School Geographic District County Metro CZ Year Grade Subject Means Gaps
All Race Gender ECD Race Gender
seda_school_pool_CS_v30 Pooled CS X X Stata Excel
seda_school_pool_GCS_v30 Pooled GCS X X Stata Excel
seda_geodist_long_CS_v30 Long CS X X X X X X X X X X X Stata Excel
seda_geodist_long_GCS_v30 Long GCS X X X X X X X X X X X Stata Excel
seda_geodist_poolsub_CS_v30 Pooled CS X X X X X X X X X Not available* Not available*
seda_geodist_poolsub_GCS_v30 Pooled GCS X X X X X X X X X Not available* Not available*
seda_geodist_pool_GCS_v30 Pooled GCS X X X X X X X X Stata Excel
seda_geodist_pool_CS_v30 Pooled CS X X X X X X X X Stata Excel
seda_county_long_CS_v30 Long CS X X X X X X X X X X X Stata Excel
seda_county_long_GCS_v30 Long GCS X X X X X X X X X X X Stata Excel
seda_county_poolsub_CS_v30 Pooled CS X X X X X X X X X Not available* Not available*
seda_county_poolsub_GCS_v30 Pooled GCS X X X X X X X X X Not available* Not available*
seda_county_pool_CS_v30 Pooled CS X X X X X X X X Stata Excel
seda_county_pool_GCS_v30 Pooled GCS X X X X X X X X Stata Excel
seda_metro_long_CS_v30 Long CS X X X X X X X X X X X Stata Excel
seda_metro_long_GCS_v30 Long GCS X X X X X X X X X X X Stata Excel
seda_metro_poolsub_CS_v30 Pooled CS X X X X X X X X X Not available* Not available*
seda_metro_poolsub_GCS_v30 Pooled GCS X X X X X X X X X Not available* Not available*
seda_metro_pool_CS_v30 Pooled CS X X X X X X X X Stata Excel
seda_metro_pool_GCS_v30 Pooled GCS X X X X X X X X Stata Excel
seda_commzone_long_CS_v30 Long CS X X X X X X X X X X X Stata Excel
seda_commzone_long_GCS_v30 Long GCS X X X X X X X X X X X Stata Excel
seda_commzone_poolsub_CS_v30 Pooled CS X X X X X X X X X Not available* Not available*
seda_commzone_poolsub_GCS_v30 Pooled GCS X X X X X X X X X Not available* Not available*
seda_commzone_pool_CS_v30 Pooled CS X X X X X X X X Stata Excel
seda_commzone_pool_GCS_v30 Pooled GCS X X X X X X X X Stata Excel

Covariate Data
File Name Form Disaggregated by Download
Unit Year Grade
SEDA_cov_school_pool_v30 Pooled X Stata Excel
SEDA_cov_geodist_long_v30 Long X X X Stata Excel
SEDA_cov_geodist_poolyr_v30 Pooled X X Stata Excel
SEDA_cov_geodist_pool_v30 Pooled X Stata Excel
SEDA_cov_county_long_v30 Long X X X Stata Excel
SEDA_cov_county_poolyr_v30 Pooled X X Stata Excel
SEDA_cov_county_pool_v30 Pooled X Stata Excel
SEDA_cov_metro_long_v30 Long X X X Stata Excel
SEDA_cov_metro_poolyr_v30 Pooled X X Stata Excel
SEDA_cov_metro_pool_v30 Pooled X Stata Excel

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_v30 X X Stata CSV
SEDA_shapefiles_v20 X ZIP

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Ho, A. D., Shear, B. R., Fahle, E. M., Kalogrides, D., Jang, H., & Chavez, B. (2021).
Stanford Education Data Archive (Version 4.0). Retrieved from http://purl.stanford.edu/vb140hd2862.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Version 4.0 Notes

The currently available data include district and county level average achievement (for all students and by race/ethnicity and gender), district and county level racial/ethnic and gender achievement gaps, and district level demographic/socioeconomic data. The most recent release (currently, Version 4.1) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication. Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process and describes the contents of each file.

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_4.0 PDF
seda_codebook_school_4.0 Excel
seda_codebook_geodist_4.0 Excel
seda_codebook_county_4.0 Excel
seda_codebook_commzone_4.0 Excel
seda_codebook_metro_4.0 Excel
seda_codebook_state_4.0 Excel
seda_codebook_cov_school_4.0 Excel
seda_codebook_cov_geodist_4.0 Excel
seda_codebook_cov_county_4.0 Excel
seda_codebook_cov_metro_4.0 Excel
seda_codebook_cov_state_4.0 Excel
seda_codebook_crosswalk_4.0 Excel

Scroll for download links  ►

*On 9/21/2023 errors were identified in these "poolsub" and "long" data files in SEDA 4.0. These files have been removed from the data archive. If you previously downloaded any of these files, please delete any locally stored copies. If you are looking to use "poolsub" or "long" data files, newer versions of these files are available in SEDA 4.1.
Test Score Estimations: Means and Achievement Gaps
File Name Form Metric Unit Disaggregated by Subgroups Download
School Geographic District County Metro CZ State Year Grade Subject Means Gaps
All Race Gender ECD Race Gender ECD
seda_school_pool_CS_4.0 Pooled CS X X Stata CSV
seda_school_pool_GCS_4.0 Pooled GCS X X Stata CSV
seda_geodist_long_CS_4.0 Long CS X X X X X X X X X X X Not available* Not available*
seda_geodist_long_GCS_4.0 Long GCS X X X X X X X X X X X Not available* Not available*
seda_geodist_poolsub_CS_4.0 Pooled CS X X X X X X X X X Not available* Not available*
seda_geodist_poolsub_GCS_4.0 Pooled GCS X X X X X X X X X Not available* Not available*
seda_geodist_pool_GCS_4.0 Pooled GCS X X X X X X X X Stata CSV
seda_geodist_pool_CS_4.0 Pooled CS X X X X X X X X Stata CSV
seda_county_long_CS_4.0 Long CS X X X X X X X X X X X Not available* Not available*
seda_county_long_GCS_4.0 Long GCS X X X X X X X X X X X Not available* Not available*
seda_county_poolsub_CS_4.0 Pooled CS X X X X X X X X X Not available* Not available*
seda_county_poolsub_GCS_4.0 Pooled GCS X X X X X X X X X Not available* Not available*
seda_county_pool_CS_4.0 Pooled CS X X X X X X X X Stata CSV
seda_county_pool_GCS_4.0 Pooled GCS X X X X X X X X Stata CSV
seda_metro_long_CS_4.0 Long CS X X X X X X X X X X X Not available* Not available*
seda_metro_long_GCS_4.0 Long GCS X X X X X X X X X X X Not available* Not available*
seda_metro_poolsub_CS_4.0 Pooled CS X X X X X X X X X Not available* Not available*
seda_metro_poolsub_GCS_4.0 Pooled GCS X X X X X X X X X Not available* Not available*
seda_metro_pool_CS_4.0 Pooled CS X X X X X X X X Stata CSV
seda_metro_pool_GCS_4.0 Pooled GCS X X X X X X X X Stata CSV
seda_commzone_long_CS_4.0 Long CS X X X X X X X X X X X Not available* Not available*
seda_commzone_long_GCS_4.0 Long GCS X X X X X X X X X X X Not available* Not available*
seda_commzone_poolsub_CS_4.0 Pooled CS X X X X X X X X X Not available* Not available*
seda_commzone_poolsub_GCS_4.0 Pooled GCS X X X X X X X X X Not available* Not available*
seda_commzone_pool_CS_4.0 Pooled CS X X X X X X X X Stata CSV
seda_commzone_pool_GCS_4.0 Pooled GCS X X X X X X X X Stata CSV
seda_state_long_cs_4.0 Long CS X X X X X X X X X X X Not available* Not available*
seda_state_long_gcs_4.0 Long GCS X X X X X X X X X X X Not available* Not available*
seda_state_poolsub_cs_4.0 Pooled CS X X X X X X X X X Not available* Not available*
seda_state_poolsub_gcs_4.0 Pooled GCS X X X X X X X X X Not available* Not available*
seda_state_pool_cs_4.0 Pooled CS X X X X X X X X Stata CSV
seda_state_pool_gcs_4.0 Pooled GCS X X X X X X X X Stata CSV

Covariate Data
File Name Form Disaggregated by Download
Unit Year Grade
seda_cov_school_pool_4.0 Pooled X Stata CSV
seda_cov_school_poolyr_4.0 Pooled X X Stata CSV
seda_cov_geodist_pool_4.0 Pooled X Stata CSV
seda_cov_geodist_poolyr_4.0 Pooled X X Stata CSV
seda_cov_geodist_long_4.0 Long X X X Stata CSV
seda_cov_county_pool_4.0 Pooled X Stata CSV
seda_cov_county_poolyr_4.0 Pooled X X Stata CSV
seda_cov_county_long_4.0 Long X X X Stata CSV
seda_cov_metro_pool_4.0 Pooled X Stata CSV
seda_cov_metro_poolyr_4.0 Pooled X X Stata CSV
seda_cov_metro_long_4.0 Long X X X Stata CSV
seda_cov_state_pool_4.0 Pooled X Stata CSV
seda_cov_state_poolyr_4.0 Pooled X X Stata CSV
seda_cov_state_long_4.0 Long X X X Stata CSV

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_4.0 X X Stata CSV
seda_shapefiles_2019_4.0 X ZIP

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Ho, A. D., Shear, B. R., Fahle, E. M., Kalogrides, D., Jang, H., & Chavez, B. (2021).
Stanford Education Data Archive (Version 4.1). Retrieved from http://purl.stanford.edu/xv742vh9296.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Version 4.1 Notes

The currently available data include district and county level average achievement (for all students and by race/ethnicity and gender), district and county level racial/ethnic and gender achievement gaps, and district level demographic/socioeconomic data. The most recent release (currently, Version 4.1) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication. Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process and describes the contents of each file.

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_4.1 PDF
seda_codebook_school_4.1 Excel
seda_codebook_geodist_4.1 Excel
seda_codebook_county_4.1 Excel
seda_codebook_commzone_4.1 Excel
seda_codebook_metro_4.1 Excel
seda_codebook_state_4.1 Excel
seda_codebook_cov_school_4.1 Excel
seda_codebook_cov_geodist_4.1 Excel
seda_codebook_cov_county_4.1 Excel
seda_codebook_cov_metro_4.1 Excel
seda_codebook_cov_state_4.1 Excel
seda_codebook_crosswalk_4.1 Excel

Scroll for download links  ►

*On 9/21/2023 errors were identified in these "poolsub" data files in SEDA 4.1. The original files have been removed from the data archive. If you previously downloaded any of these files, please delete any locally stored copies. Corrected versions of these files are available in the archive; these files can be identified by the suffix "_corrected" in the filename.
Test Score Estimations: Means and Achievement Gaps
File Name Form Metric Unit Disaggregated by Subgroups Download
School Geographic District County Metro CZ State Year Grade Subject Means Gaps
All Race Gender ECD Race Gender ECD
seda_school_pool_CS_4.1 Pooled CS X X Stata CSV
seda_school_pool_GCS_4.1 Pooled GCS X X Stata CSV
seda_geodist_long_CS_4.1 Long CS X X X X X X X X X X X Stata CSV
seda_geodist_long_GCS_4.1 Long GCS X X X X X X X X X X X Stata CSV
seda_geodist_poolsub_CS_4.1 Pooled CS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_geodist_poolsub_GCS_4.1 Pooled GCS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_geodist_pool_GCS_4.1 Pooled GCS X X X X X X X X Stata CSV
seda_geodist_pool_CS_4.1 Pooled CS X X X X X X X X Stata CSV
seda_county_long_CS_4.1 Long CS X X X X X X X X X X X Stata CSV
seda_county_long_GCS_4.1 Long GCS X X X X X X X X X X X Stata CSV
seda_county_poolsub_CS_4.1 Pooled CS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_county_poolsub_GCS_4.1 Pooled GCS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_county_pool_CS_4.1 Pooled CS X X X X X X X X Stata CSV
seda_county_pool_GCS_4.1 Pooled GCS X X X X X X X X Stata CSV
seda_metro_long_CS_4.1 Long CS X X X X X X X X X X X Stata CSV
seda_metro_long_GCS_4.1 Long GCS X X X X X X X X X X X Stata CSV
seda_metro_poolsub_CS_4.1 Pooled CS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_metro_poolsub_GCS_4.1 Pooled GCS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_metro_pool_CS_4.1 Pooled CS X X X X X X X X Stata CSV
seda_metro_pool_GCS_4.1 Pooled GCS X X X X X X X X Stata CSV
seda_commzone_long_CS_4.1 Long CS X X X X X X X X X X X Stata CSV
seda_commzone_long_GCS_4.1 Long GCS X X X X X X X X X X X Stata CSV
seda_commzone_poolsub_CS_4.1 Pooled CS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_commzone_poolsub_GCS_4.1 Pooled GCS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_commzone_pool_CS_4.1 Pooled CS X X X X X X X X Stata CSV
seda_commzone_pool_GCS_4.1 Pooled GCS X X X X X X X X Stata CSV
seda_state_long_CS_4.1 Long CS X X X X X X X X X X X Stata CSV
seda_state_long_GCS_4.1 Long GCS X X X X X X X X X X X Stata CSV
seda_state_poolsub_CS_4.1 Pooled CS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_state_poolsub_GCS_4.1 Pooled GCS X X X X X X X X X Stata (Corrected)* CSV (Corrected)*
seda_state_pool_CS_4.1 Pooled CS X X X X X X X X Stata CSV
seda_state_pool_GCS_4.1 Pooled GCS X X X X X X X X Stata CSV

Covariate Data
File Name Form Disaggregated by Download
Unit Year Grade
seda_cov_school_pool_4.1 Pooled X Stata CSV
seda_cov_school_poolyr_4.1 Pooled X X Stata CSV
seda_cov_geodist_pool_4.1 Pooled X Stata CSV
seda_cov_geodist_poolyr_4.1 Pooled X X Stata CSV
seda_cov_geodist_long_4.1 Long X X X Stata CSV
seda_cov_county_pool_4.1 Pooled X Stata CSV
seda_cov_county_poolyr_4.1 Pooled X X Stata CSV
seda_cov_county_long_4.1 Long X X X Stata CSV
seda_cov_metro_pool_4.1 Pooled X Stata CSV
seda_cov_metro_poolyr_4.1 Pooled X X Stata CSV
seda_cov_metro_long_4.1 Long X X X Stata CSV
seda_cov_state_pool_4.1 Pooled X Stata CSV
seda_cov_state_poolyr_4.1 Pooled X X Stata CSV
seda_cov_state_long_4.1 Long X X X Stata CSV

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_4.1 X X Stata CSV

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Ho, A. D., Shear, B. R., Fahle, E. M., Kalogrides, D., Saliba, J. (2024).
Stanford Education Data Archive (Version 5.0). Retrieved from https://purl.stanford.edu/cs829jn7849.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Version 5.0 Notes

The currently available data include average achievement (for all students and by race/ethnicity and gender), racial/ethnic and gender achievement gaps, and demographic/socioeconomic data. We report multiple aggregations of the data: school, geographic district, administrative district, county, metropolitan statistical area, commuting zone, and state.

The most recent release (currently, Version 5.0) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication. Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process and describes the contents of each file.

Technical Documentation and Codebooks
File Name Download
SEDA_documentation_5.0 PDF
seda_codebook_school_5.0 Excel
seda_codebook_geodist_5.0 Excel
seda_codebook_admindist_5.0 Excel
seda_codebook_county_5.0 Excel
seda_codebook_commzone_5.0 Excel
seda_codebook_metro_5.0 Excel
seda_codebook_state_5.0 Excel
seda_codebook_cov_school_5.0 Excel
seda_codebook_cov_geodist_5.0 Excel
seda_codebook_cov_admindist_5.0 Excel
seda_codebook_cov_county_5.0 Excel
seda_codebook_cov_metro_5.0 Excel
seda_codebook_cov_state_5.0 Excel
seda_codebook_crosswalk_5.0 Excel

Scroll for download links  ►

Test Score Estimations: Means and Achievement Gaps
File Name Form Metric Unit Disaggregated by Subgroups Download
School Geographic District Admin District County Metro CZ State Year Grade Subject Means Gaps
All Race Gender ECD Race Gender ECD
seda_school_pool_cs_5.0 Pooled CS X X Stata CSV
seda_school_pool_gcs_5.0 Pooled GCS X X Stata CSV
seda_geodist_long_cs_5.0* Long CS X X X X X X X X X X X Stata CSV
seda_geodist_long_gcs_5.0* Long GCS X X X X X X X X X X X Stata CSV
seda_geodist_poolsub_cs_5.0* Pooled CS X X X X X X X X X Stata CSV
seda_geodist_poolsub_gcs_5.0* Pooled GCS X X X X X X X X X Stata CSV
seda_geodist_pool_gcs_5.0* Pooled CS X X X X X X X X Stata CSV
seda_geodist_pool_cs_5.0* Pooled GCS X X X X X X X X Stata CSV
seda_admindist_long_cs_5.0* Long CS X X X X X X X X X X X Stata CSV
seda_admindist_long_gcs_5.0* Long GCS X X X X X X X X X X X Stata CSV
seda_admindist_poolsub_cs_5.0* Pooled CS X X X X X X X X X Stata CSV
seda_admindist_poolsub_gcs_5.0* Pooled GCS X X X X X X X X X Stata CSV
seda_admindist_pool_gcs_5.0* Pooled CS X X X X X X X X Stata CSV
seda_admindist_pool_cs_5.0* Pooled GCS X X X X X X X X Stata CSV
seda_county_long_cs_5.0 Long CS X X X X X X X X X X X Stata CSV
seda_county_long_gcs_5.0 Long GCS X X X X X X X X X X X Stata CSV
seda_county_poolsub_cs_5.0 Pooled CS X X X X X X X X X Stata CSV
seda_county_poolsub_gcs_5.0 Pooled GCS X X X X X X X X X Stata CSV
seda_county_pool_cs_5.0 Pooled CS X X X X X X X X Stata CSV
seda_county_pool_gcs_5.0 Pooled GCS X X X X X X X X Stata CSV
seda_metro_long_cs_5.0 Long CS X X X X X X X X X X X Stata CSV
seda_metro_long_gcs_5.0 Long GCS X X X X X X X X X X X Stata CSV
seda_metro_poolsub_cs_5.0 Pooled CS X X X X X X X X X Stata CSV
seda_metro_poolsub_gcs_5.0 Pooled GCS X X X X X X X X X Stata CSV
seda_metro_pool_cs_5.0 Pooled CS X X X X X X X X Stata CSV
seda_metro_pool_gcs_5.0 Pooled GCS X X X X X X X X Stata CSV
seda_commzone_long_cs_5.0 Long CS X X X X X X X X X X X Stata CSV
seda_commzone_long_gcs_5.0 Long GCS X X X X X X X X X X X Stata CSV
seda_commzone_poolsub_cs_5.0 Pooled CS X X X X X X X X X Stata CSV
seda_commzone_poolsub_gcs_5.0 Pooled GCS X X X X X X X X X Stata CSV
seda_commzone_pool_cs_5.0 Pooled CS X X X X X X X X Stata CSV
seda_commzone_pool_gcs_5.0 Pooled GCS X X X X X X X X Stata CSV
seda_state_long_cs_5.0 Long CS X X X X X X X X X X X Stata CSV
seda_state_long_gcs_5.0 Long GCS X X X X X X X X X X X Stata CSV
seda_state_poolsub_cs_5.0 Pooled CS X X X X X X X X X Stata CSV
seda_state_poolsub_gcs_5.0 Pooled GCS X X X X X X X X X Stata CSV
seda_state_pool_cs_5.0 Pooled CS X X X X X X X X Stata CSV
seda_state_pool_gcs_5.0 Pooled GCS X X X X X X X X Stata CSV

*Files were updated on 03/20/2024 to correct district names.

Covariate Data
File Name Form Disaggregated by Download
Unit Year Grade
seda_cov_school_pool_5.0 Pooled X Stata CSV
seda_cov_school_annual_5.0 Pooled X X Stata CSV
seda_cov_geodist_long_5.0* Long X X X Stata CSV
seda_cov_geodist_annual_5.0* Pooled X X Stata CSV
seda_cov_geodist_pool_5.0* Pooled X Stata CSV
seda_cov_admindist_long_5.0* Long X X X Stata CSV
seda_cov_admindist_annual_5.0* Pooled X X Stata CSV
seda_cov_admindist_pool_5.0* Pooled X Stata CSV
seda_cov_county_long_5.0** Long X X X Stata CSV
seda_cov_county_annual_5.0** Pooled X X Stata CSV
seda_cov_county_pool_5.0 Pooled X Stata CSV
seda_cov_metro_long_5.0 Long X X X Stata CSV
seda_cov_metro_annual_5.0 Pooled X X Stata CSV
seda_cov_metro_pool_5.0 Pooled X Stata CSV
seda_cov_state_long_5.0 Long X X X Stata CSV
seda_cov_state_annual_5.0 Pooled X X Stata CSV
seda_cov_state_pool_5.0 Pooled X Stata CSV

*Files were updated on 03/20/2024 to correct district names.
**County covariate files were updated on 04/19/2024 to add 2019 data.

Ancillary Data
File Name Disaggregated by Download
School District Year
SEDA_crosswalk_5.0* X X Stata CSV
seda_shapefiles_2019_5.0 X Zip

*Files were updated on 03/20/2024 to correct district names.

SEDA 2022 (the data shown on the 2019-2022 Education Recovery Explorer) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Fahle, E. M., Ho, A. D., Shear, B. R., Kalogrides, D., Saliba, J., & Kane, T.J. (2022). Stanford Education Data Archive (Version SEDA 2022). Retrieved from http://purl.stanford.edu/jm728cq7283.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

SEDA 2022 Notes

The SEDA 2022 dataset for the 2019-2022 Education Recovery Explorer is unique from other versions of SEDA. The currently available data include districts' average math and reading achievement in 2019 and 2022 respectively, and the change in math and reading achievement between 2019 and 2022 relative to the national average in grades 3-8 in 2019. Data is available for a subset of states; more states will be added as data become available.

Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process, which is distinct from the main version of SEDA, and describes the contents of each file.

Version SEDA 2022

This page contains two sets of files:

  1. Technical Documentation and Codebooks
  2. Test Score Estimates

Technical Documentation and Codebooks
File Name Download
SEDA_documentation PDF
SEDA_codebook_admindist Excel

Scroll for download links  ►

Test Score Estimates
File Name Form Metric Disaggregated by Subgroups Download
      Administrative District Year Grade Subject Means    
              All Race Gender ECD    
SEDA2022_admindist_poolsub_YS Pooled YS X X   X X X   X Stata CSV
SEDA2022_admindist_poolsub_GYS Pooled GYS X X   X X X   X Stata CSV
SEDA2022_admindist_poolsub_NAEP Pooled NAEP X X   X X X   X Stata CSV

Covariate Data
File Name Form Disaggregated by
    Unit Year Grade
SEDA_cov_admindist_pool Pooled X  

Ancillary Data
File Name Disaggregated by Download
  School District Year    
SEDA_admincrosswalk X   X Stata CSV
SEDA_shapefiles   X   Stata CSV

SEDA 2022 2.0 (the data shown on the 2019-2022 Education Recovery Explorer) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Fahle, E. M., Ho, A. D., Shear, B. R., Kalogrides, D., Saliba, J., & Kane, T.J. (2023). Stanford Education Data Archive (Version SEDA 2022 2.0). Retrieved from http://purl.stanford.edu/dt080zr0625.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

SEDA 2022 2.0 Notes

The SEDA 2022 2.0 dataset for the 2019-2022 Education Recovery Explorer is unique from other versions of SEDA. The currently available data include districts' average math and reading achievement in 2019 and 2022 respectively, and the change in math and reading achievement between 2019 and 2022 relative to the national average in grades 3-8 in 2019. Data are available for a subset of states.

Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process, which is distinct from the main version of SEDA, and describes the contents of each file.

Version SEDA 2022 2.0

This page contains two sets of files:

  1. Technical Documentation and Codebooks
  2. Test Score Estimates

Technical Documentation and Codebooks
File Name Download
SEDA2022_documentation_2.0 PDF
SEDA2022_codebook_admindist_2.0 Excel

Scroll for download links  ►

Test Score Estimates
File Name Form Metric Disaggregated by Subgroups Download
      Administrative District Year Grade Subject Means    
              All Race Gender ECD    
SEDA2022_admindist_poolsub_YS_2.0 Pooled YS X X   X X X   X Stata CSV
SEDA2022_admindist_poolsub_GYS_2.0 Pooled GYS X X   X X X   X Stata CSV
SEDA2022_admindist_poolsub_NAEP_2.0 Pooled NAEP X X   X X X   X Stata CSV

Covariate Data
File Name Form Disaggregated by
    Unit Year Grade
SEDA_cov_admindist_pool Pooled X  

Ancillary Data
File Name Disaggregated by Download
  School District Year    
SEDA_admincrosswalk X   X Stata CSV
SEDA_shapefiles   X   Stata CSV

SEDA 2023 (the data shown on the 2019-2023 Education Recovery Explorer) includes a number of publicly available data files, the technical documentation and data codebooks, listed on the following page. Data files are available in Stata and .csv formats.

In publications, please cite the data as:

Reardon, S. F., Fahle, E. M., Ho, A. D., Shear, B. R., Min, J., Kalogrides, D., & Kane, T. J. (2024). Stanford Education Data Archive (Version SEDA 2023). Retrieved from https://purl.stanford.edu/xt779fj2637.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

SEDA 2023 Notes

The SEDA 2023 dataset for the 2019-2023 Education Recovery Explorer is unique from other versions of SEDA. The currently available data include the change in math and reading achievement from 2019 to 2022, 2022 to 2023, and 2019 to 2023 relative to the national average in grades 3-8 in 2019. Data are available for administrative school districts and states.

Please review the technical documentation and codebooks that accompany the data sets. These documents review the data construction process, which is distinct from the main version of SEDA, and describes the contents of each file.

Version SEDA 2023

This page contains two sets of files:

  1. Technical Documentation and Codebooks
  2. Test Score Estimates

Technical Documentation and Codebooks
File Name Download
seda2023_documentation_20240130.pdf PDF
SEDA2023_codebook_admindist Excel
seda2023_codebook_state_updated_202402025 Excel

Scroll for download links  ►

Test Score Estimates
File Name Form Metric Unit Disaggregated by Subgroups Download
      Administrative District State Year Grade Subject Means  
                All Race Gender ECD  
SEDA2023_admindist_poolsub_YS_updated_20240205 Pooled YS X   X   X X X   X Stata CSV
SEDA2023_admindist_poolsub_GYS_updated_20240205 Pooled GYS X   X   X X X   X Stata CSV
SEDA2023_state_poolsub_YS_updated_20240205 Pooled YS   X X   X X X   X Stata CSV
SEDA2023_state_poolsub_GYS_updated_20240205 Pooled GYS   X X   X X X   X Stata CSV

Covariate Data
File Name Form Disaggregated by Download
    Unit Year Grade  
seda2023_cov_admindist_annual Pooled X X   Stata CSV
seda2023_cov_state_annual Pooled X X   Stata CSV