Texas Schools Project

Below you will find documentation for the UT Dallas Education Research Center, Texas Education Agency, and Texas Higher Education Coordinating Board data. To learn more about the data, please review the documents below. As you click on the relevant documents, they will open in a separate window.
(Last updated on April, 2024)

UT Dallas ERC Documentation:

Texas Education Agency

PEIMS Data Documentation:

Testing Data Documentation:

Texas Higher Education Coordinating Board

Texas Public Universities Documentation:

Texas Health Related Institutions Documentation:

Texas Community, Technical, and State Colleges Documentation:

Texas Independent Colleges & Universities Documentation:

Financial Aid Database Documentation:

Appendices for THECB Manuals:

Documentation summarizing differences in the manuals related to Texas Public Universities, Texas Community, Technical, State Colleges and Texas Health-Related Institutions can be found in:

UT Dallas ERC Synthetic Data:

Our synthetic data files contain artificially generated values that mimic the structure of our ERC data files.

If you are interested in using our synthetic data, please send a request and a half-page abstract of your proposed project to  TSP leadership team. Please note that it is expected that researchers provided access to the synthetic data will submit a proposal to the ERC Advisory Board to conduct their project at the UTD-ERC should their project prove feasible. 

Objectives of the synthetic data:

  • Help potential researchers understand the structure of the data
  • Prepare the analytical programs in advance

Description of the synthetic data:

  • All the observations in the real data have been truncated completely before generating synthetic data.
  • The synthetic data is generated based on our current data holdings.
  • The directory tree structure for the synthetic data is same as the real data.
  • All the synthetic data files are STATA data format.
  • Number of observations
    • If number of observation is greater than or equal to 1000, then in synthetic data, the number of observations will be 1% of real data.
    • If number of observation is less than 1000 and greater than or equal to 10, then in synthetic data, the number of observations will be 10% of real data.
    • If number of observation is less than 10, then in synthetic data, the number of observations is same as real data.
  • Each synthetic data file has the same name as the corresponding real data.
  • Every variable has the same variable name, same storage type (numeric, string), and same variable label as the real data.
  • Every categorical variable has the same value labels as the real data.
  • The value of a string variable is “the name of this data file & the name of this variable”.

UT Dallas Non-ERC Public Data:

For questions, contact Mark Lu, data manager.