LMI For All

Documentation & Development

User Tools

Site Tools


Sidebar

Start Pages

Team Pages

Upcoming Events

Apr 24 Modding Day
data:hesa_dlhl

HESA DLHE Courses

  • Submitted by: Luke Bosworth, l.p.bosworth@warwick.ac.uk
  • Submitted on: 15/05/2014
  • Revision; 1

Data File(s)

This data load contains 2 files, which are as follows:

File Content Size (bytes)
35799_Data.txt Main Data 979,291
35799_Field_Order_&Labelling_File.xlsx Relevant description, including fields & columns 63,103
IPS_Terms_and_conditions_V1_7_1.pdf.xlsx Text document - Sets out the terms under conditions under which the data are provided 136,885
35799_Schedule.pdf Text document - Sets out the specific schedule to the agreement for the supply of data between HESA and UKCES 211,899
hesa spec v3.docx Text document - Sets out the specification of the data provided 29,278

Source Dataset

HESA Destination of Leavers survey 2011/12 and 2012/13. Copyright Higher Education Statistics Agency Limited 2014 (HESA).

Destination of Full-time UK and EU domiciled leavers in paid employment only from Higher education institutions 2011/12 by Standard occupational classification (4 digit) Level of qualification obtained (Doctorate, Masters, Other Postgraduate, First degree, Other undergraduate) Qualification required for job Subject of study (2 digit JACS)

Population and restriction definitions
  • Coverage - Destinations of Leavers Survey
  • Domicile - The students' domicile prior to commencement of the course.
  • Activity - Used to identify if the student has entered employment
  • Mode of study - Refers to the method by which the student is being taught i.e.Full-time/Part-time.
  • Rounding and suppression strategy - To be applied to secondary analysis for the purpose of data protection.

Field Definitions

Fields and Columns

  • _SOCDLHE2010 - Standard occupational classification
  • F_LEVEL - Level of qualification obtained
    1. DOC - Doctorate
    2. MAS - Masters
    3. OPG - Other Postgraduate
    4. FID - First degree
    5. OUG - Other undergraduate
  • F_QUALREQ - Qualification required for job
    1. 11 - Yes: the qualification was a formal requirement
    2. 12 - Yes: while the qualification was not a formal requirement it did give me an advantage
    3. 13 - No: the qualification was not required
    4. 14 - Don't know
    5. Unk - Unknown
  • F_XJACS201 - Subject of study
  • TOTAL - Number of cases (NB, this includes decimals since there is an apportionment of courses split between different areas).

HESA data

One of the key objectives of LMI for All is to provide information on entry routes for specific occupations. HESA DLHE data offers potentially valuable insights into the higher education subjects previously studied by entrants into particular occupations, addressing the key question: “what subjects do people study prior to taking up a specific job?”

Users are interested in the profile of subject studied among members of the DLHE population who are in work, broken down by the occupation of their employment destination. The following illustrative example shows how the analysis might be presented by applications linked to LMI for All:

“Recent leavers from higher education working in this occupation studied the following subjects”. It is proposed to limit the top three subjects studied in terms of the proportion of people working in the occupation.

Therefore the core requirement from the DLHE dataset is for data relating to DLHE population (including Qualifiers population marker) in employment by subject category (JACS) and occupation (SOC 2010).

Users are also interested in the level of HE qualification studied / achieved. For example, it is potentially useful to know that x per cent of leavers from HE working in a given occupation had qualified at postgraduate level (compared with an overall average of y per cent).

The focus is on leavers who had pursued a full-time course entry routes into occupations and Part-time learners are often already established within an occupation.

Occupation is classified to 4-digit SOC 2010 Unit group. Some unit groups are poorly-populated in terms of HE leavers, such as routine manual occupations, because they are not typical graduate destinations. For SOC unit groups for which there are fewer than 50 responses in the DLHE the API should deliver analysis relating to the parent 3-digit category. In those instances in which the number of responses remains below 50 we would then move up to the 2-digit category.

Subject studied is classified to the 2-digit Principal subject of study.

The data / analysis made available via LMI for All is at the overall UK level only.

Descriptions of the data

Leavers from Higher Education Institutions classified by:

  • Standard Occupational Classification (SOC 2010) of employment at 4-digit level
  • Principal subject of study
  • Qualifier flag
  • Level of qualification studied
  • Whether qualification required for job.

The population of leavers is restricted using the following criteria:

  • Leavers in paid employment only
  • Leavers who had pursued a full-time course only
  • UK and European domiciled students working in UK at time of survey.

Currently only one iteration of the DLHE survey is classified to SOC 2010 but that the next iteration of the data is due to become available in July 2014. In due course it is hoped to improve the statistical quality of the estimates by combining two years’ worth of data once the new dataset becomes available.

35799_Field_Order_&Labelling_File.xlsx

The file 35799_Field_Order_&Labelling_File.xlsx contains three sheets.

Notes: describes the data provided by HESA to UKCES in general terms, including Field definition and links.

Field_Order: Shows the orders of the fields in the txt file

Field_Labelling: gives the detailed labels used for each of the fields used (e.g. occupations, levels and types of qualifications required and subject/disciplines

Issues

Some occupations have a much stronger association with a specific course subject than others. For example, all doctors will have qualified in a relevant medical subject whereas corporate managers will have pursued a wider variety of subjects, with no single one dominating. The intention for LMI for All is simply to provide information on these patterns rather than presenting definite conclusions about formal entry requirements.

In those cases in which the profile of prior study is highly fragmented we could also apply a minimum threshold (e.g. a subject area is only presented if it accounts for a minimum of 10 per cent of respondents in the SOC category).

A key limitation of the DLHE data is that it only provides information about the initial destinations (six months after completion) of HE leavers rather than their employment activity in the medium to longer term.

Restricting Areas

To ensure that HESA is happy with the presentation of the data via the API initially areas to the data is restricted using an API key giving HESA the opportunity to review the dataset prior to its public release.

HESA will be given the opportunity to review the API guidance documentation relating to the HESA data in draft form prior to the publication of the data.

Permitted Purposes

Data may only be used to generate aggregate statistics for an on-line data portal ‘LMI for All’ which will allow third parties to obtain information on entry routes for specific occupations via an API.

Any data shared from the portal must be rounded to the nearest 5 according to the standard rounding methodology as outlined in special condition 1. Any data released via the API must relate to 50 or more leavers. The Data may not be made available in its entirety via the API.

Publication/Reproduction of the Data

Any reproduction or publishing of Data, subject to the above Permitted Purposes, must adhere to the HESA Services Standard Rounding Methodology.

All statistics published should be at a level of anonymisation and aggregation which will ensure that no Personal Data or Sensitive Personal Data are published, and thereby ensure the confidentiality of individuals.

HESA Services Standard Rounding Methodology:

  1. 0, 1, 2 must be rounded to 0
  2. All other numbers must be rounded to the nearest multiple of 5
  3. Percentages based on 52 or fewer individuals must be suppressed
  4. Averages based on 7 or fewer individuals must be suppressed
  5. Full-Time Equivalent data does not require rounding.
data/hesa_dlhl.txt · Last modified: 2014-07-02 14:46 by Luke Bosworth