LMI For All

Documentation & Development

User Tools

Site Tools


Sidebar

Start Pages

Team Pages

Upcoming Events

Apr 24 Modding Day
data:data_integration

Region

We plan to use these as basis:

id region code
1 London UKI
2 South East (England) UKJ
3 East of England UKH
4 South West (England) UKK
5 West Midlands (England) UKG
6 East Midlands (England) UKF
7 Yorkshire and the Humber UKE
8 North West (England) UKD
9 North East (England) UKC
10 Wales UKL
11 Scotland UKM
12 Northern Ireland UKN

Status of datasets:

DatasetRemarks
LFScorrect
ESScorrect
WFunknown what is unknown? IT MATCHES THE REST? RAW 11/07/2014
ONETNA
EarningsNeed to check if recoding is straightforward Recoding of what? AGAIN IT MATCHES THE REST? RAW 11/07/2014

Gender

We plan to use these as basis:

id region
1 Male
2 Female

Status of datasets:

DatasetRemarks
LFScorrect
ESSNA
WFcorrect
ONETNA
Earningscorrect

SOC

We plan to use SOC2010 as basis:

Status of datasets:

DatasetRemarks
LFScorrect, older data has been recalculated 2000→2010
ESSneed to check if this is soc2010 I think so but Dave to answer definitively
WFcorrect
ONETcorrect, but will be 27.000 titles in future
Earningscorrect

Note: we need the 27.000 jobtitles including mapping to soc2010

Industry

We need a standard as a basis. Will it be the 79, with exception for WF that uses 75??? IT IS ALL BASED ON THE 75 INDUSTRIES NOW? RAW 11/07/2014

Status of datasets:

DatasetRemarks
LFShas 88 rows will be aggregated to 75 DONE RAW 11/07/2014
ESShas 79 rows could be aggregated to 75 (Dave to check feasibility) DONE RAW 11/07/2014
WFhas 75 rows
ONETNA
Earningshas 79 rows will be aggregated to 75DONE RAW 11/07/2014

Qualifications

We have NQF8 and NQF5, they map as follows:

NQF8NQF5
11
21
32
42
52
63
74
85
96

Employment is classified into 9 broad qualification categories and 6 aggregated qualification groups:

id QCF id QCF aggregated
1 QCF8 Doctorate 1 QCF 7-8 Higher degree
2 QCF7 Other higher degree
3 QCF6 First degree 2 QCF 4-6 First degree & other HE
4 QCF5 Foundation degree;Nursing;Teaching
5 QCF4 HE below degree level
6 QCF3 A level & equivalent 3 QCF 3 A level & equivalent
7 QCF2 GCSE(A-C) & equivalent 4 QCF 2 GCSE(A-C) & equivalent
8 QCF1 GCSE(below grade C) & equivalent 5 QCF 1 GCSE(below grade C) & equivalent
9 No qualification 6 No qualification

Earning uses NQF5 with labels NOW USES NQF8 RAW 11/07/2014

idqualification
1 degree or equivalent
2 Higher edu
3 GCE A Level or equiv
4 GCSE grades A-C or equiv
5 Other qualifications
6 No qualification

How will we do this? It looks like most data is using NQF5 and only LFS (and possibly WF) is using the richer NQF8… Probably better to standardize on NQF5-0 and ignore NQF8-0 IN FACT WE HAVE STANDARDISED ON NQF8-0 RAW 11/07/2014

DatasetRemarks
LFSboth NQF5 and NQF8
ESSNQF5
WFunknown, looks like NQF8
ONETNA
EarningsNQF5
data/data_integration.txt · Last modified: 2014-07-30 10:52 by Luke Bosworth