readme lcfs

Upload: two1frets

Post on 10-Mar-2016

243 views

Category:

Documents


0 download

DESCRIPTION

Data compilation

TRANSCRIPT

  • The UK Data Archive

    The archive consists of data from four sources: the British Household Panel

    Survey (BHPS), the Expenditure and Food Survey (formerly the Food and

    Expenditure Survey, the FES), the Labour Force Survey (LFS) and the Survey

    of Personal Incomes (SPI).

    Access to the microdata in any of these datasets is restricted to registered

    users. Researchers wishing to use the data must register a project with the

    UK data archive at http://www.esds.ac.uk/aandp/access/access.asp and regis-

    ter use of each dataset. When access has been granted users can then write to

    me, Ben Etheridge at [email protected]; I can then send the cleaned datasets

    used in this project. In practice, access should be a formality for academic

    researchers.

    The rest of the archive contains all the stata and matlab codes and docu-

    mentation needed to process the data and reproduce results used in the paper.

    Details of Each Dataset

    BHPS

    Data and documentation for the main BHPS dataset can be downloaded from

    the UK data archive. Derived net household income data can be downloaded

    here (code SN3909). The codes supplied can be applied to the raw, downloaded

    data or researchers can e-mail me to obtain the compiled ready-to-use dataset,

    panellong.dta. We advise the latter route.

    Readme_BHPS.txt in the BHPS folder contains more information on run-

    ning the codes once the raw/compiled data have been obtained.

    Contents of Compiled Dataset - panellong.dta:

    obs: 132,309

    vars: 53

    size: 22,624,839

    -

    storage display

    variable name type format value label variable label

    1

  • -

    hid long %12.0g household identication number

    pno byte %8.0g person number

    eaage byte %8.0g aeaage whether working age

    jbhas byte %8.0g ajbhas did paid work last week

    jbsoc int %8.0g ajbsoc occupation (soc): current main

    job

    jbsic int %8.0g ajbsic industry (sic) of employer:

    current job

    jbsemp byte %8.0g ajbsemp employee or self-employed:

    current job

    jbhrs byte %8.0g ajbhrs no. of hours normally worked

    per week

    jshrs byte %8.0g ajshrs s/emp: hours normally worked

    per week

    jbstat byte %8.0g ajbstat current labour force status

    race byte %8.0g arace ethnic group membership

    pid long %12.0g cross-wave person identier

    xrwght double %10.0g x-sectional respondent weight

    xewght double %10.0g x-sectional enumerated

    individual weight

    mastat byte %8.0g amastat marital status

    age int %8.0g aage age at date of interview

    qfedhi byte %8.0g aqfedhi highest educational

    qualication

    qfachi byte %8.0g aqfachi highest academic qualication

    paygu double %10.0g apaygu usual gross pay per month:

    current job

    jsprof double %10.0g ajsprof monthly self employed prot

    jspayg double %10.0g ajspayg monthly self employed gross pay

    yrl double %10.0g ayrl annual labour income

    (1.9.90-1.9.91)

    paygui byte %8.0g apaygui imputation ag - apaygu

    (derived)

    yrli byte %8.0g ayrli imputation ag - ayrl

    (derived)

    hgr2r byte %8.0g ahgr2r relationship to reference person

    hgsex byte %8.0g ahgsex sex - hh grid

    butype byte %8.0g abutype benet unit type

    buno byte %8.0g abuno benet unit number

    hoh byte %8.0g ahoh conventional head of household

    indicator

    hdemp double %10.0g hd empl earn

    2

  • spemp double %10.0g sp empl earn

    hdse double %10.0g hd se earn

    spse double %10.0g sp se earn

    grpay double %10.0g hh gross labour earnings

    hhp oat %9.0g hh pension income

    hhneti oat %9.0g hh net income

    xpfood int %8.0g axpfood total weekly food and grocery

    bill

    hhwght double %10.0g household weight

    hhsize byte %8.0g ahhsize number of persons in household

    region byte %8.0g aregion region / metropolitan area

    nch02 byte %8.0g anch02 number children in household

    aged 0-2

    nch34 byte %8.0g anch34 number children in household

    aged 3-4

    nch511 byte %8.0g anch511 number children in household

    aged 5-11

    nch1215 byte %8.0g anch1215 number children in household

    aged 12-15

    nch1618 byte %8.0g anch1618 number dependent children in hh

    16+

    nkids byte %8.0g ankids number of children in household

    ncouple byte %8.0g ancouple number of couples in household

    nwage byte %8.0g anwage number in household of working

    age

    year oat %9.0g

    xhwght double %10.0g household weight

    jbsic92 int %8.0g industry(sic92) of employer:

    current job

    racel byte %8.0g mracel ethnic group membership (long

    version)

    FES

    Data and documentation can be downloaded from e.g: here. The codes supplied

    can be applied to the raw individual income data or researchers can e-mail

    Ben Etheridge to obtain the compiled dataset, merged_inc_ind.dta. The raw

    household expenditure data are extremely detailed and have been compiled and

    cleaned over many years by researchers at the IFS. To protect this intellectual

    capital we provide the household data at a more advanced stage of processing,

    after they have been aggregated to total household non-durable expenditure and

    after some data trimming. These data are contained in merged_hh_resid.dta.

    For all data, we advise researchers to use the dataset created for this project.

    Readme_FES.txt in the FES folder contains more information on running

    3

  • the codes once the raw/compiled data have been obtained.

    Contents of Compiled Dataset - merged_inc_ind.dta:

    obs: 370,818

    vars: 37

    size: 66,747,240

    -

    storage display

    variable name type format value label variable label

    -

    persno double %9.0g Internal IFS Person Number

    hhref double %9.0g Household reference

    year long %9.0g Year of interview

    week double %9.0g Week of interview

    W oat %9.0g Wage

    L oat %9.0g Hours worked

    YE_ind oat %9.0g Income from employment

    YS_ind oat %9.0g Gross income from

    self-employment

    Y_ind_L oat %9.0g Labour income

    Y_ind_Lp oat %9.0g Labour income plus private

    transfers

    Y_ind_A oat %9.0g Financial asset income

    Y_ind_Ap oat %9.0g Financial asset income plus

    real (non-imputed) housing

    asset income

    Y_ind oat %9.0g Gross income

    Y_ind_D oat %9.0g Net income

    Y_ind_O oat %9.0g ONS-comparable income

    Y_ind_mTa oat %9.0g Gross income minus taxes

    Y_ind_pTr oat %9.0g Gross income plus transfers

    yrm oat %9.0g Year and month

    numads double %9.0g HH:NumAdults

    numhhkid double %9.0g HH:NumChildren

    grosfac oat %9.0g Sampling weights

    age long %9.0g Ad:Age

    sex long %9.0g sex_l Ad:Sex

    marstat long %38.0g marstat Ad:Marital Status

    occ double %12.0g occ Ad:Occupation

    emppos long %45.0g emppos Ad:Employment Position

    hours double %9.0g Ad:Hours of Work

    4

  • region oat %9.0g ssr Standard Statistical Region

    (approx from 2001)

    jbhas oat %24.0g newjh Employment status

    hoh_red oat %21.0g hoh_label

    Head-of-household type

    spouse oat %9.0g is a spouse

    num_ads_25_60 oat %9.0g Number of adults aged 25-60

    num_ads_25_6~FT oat %9.0g Number of adults aged 25-60

    working full time

    num_ads_25_6~PT oat %9.0g Number of adults aged 25-60

    working part time

    hh_weights oat %9.0g Mod'ed-OECD equiv. factor

    weighta oat %9.0g Better Samp. weights after

    2001q2

    educ_att oat %11.0g educ_label

    Educational attainment

    Contents of Compiled Dataset - merged_hh_resid.dta:

    obs: 125,540

    vars: 96

    size: 51,220,320

    -

    storage display

    variable name type format value label variable label

    -

    hhref double %8.0g Household reference

    week oat %8.0g Week of interview

    yrm double %9.0g Year and month

    year long %9.0g Year of interview

    region oat %9.0g ssr Standard Statistical Region

    (approx from 2001)

    hoh_red oat %21.0g hoh_lab el

    Head-of-household type

    age_bands oat %9.0g age_lab el_2

    age_h oat %9.0g Age of head

    educ_h oat %9.0g Education of head

    marstat_h oat %38.0g marstat _l

    Marital status of head

    hh_weights oat %9.0g Mod'ed-OECD equiv. factor

    price oat %9.0g CPI price deator

    5

  • num_ads_25_60 oat %9.0g Number of adults aged 25-60

    num_ads_25_6~FT oat %9.0g Number of adults aged 25-60

    working full time

    num_ads_25_6~PT oat %9.0g Number of adults aged 25-60

    working part time

    W_h oat %9.0g Wage of head

    W_s oat %9.0g Wage of spouse

    L_h oat %9.0g Hours of head

    L_s oat %9.0g Hours of spouse

    Y_L oat %9.0g Labour income

    Y_Lp oat %9.0g Labour income plus private

    transfers

    Y_A oat %9.0g Financial asset income

    Y_Ap oat %9.0g Financial asset income plus

    real (non-imputed) housing

    asset income

    Y oat %9.0g Gross income

    Y_D oat %9.0g Net income

    C oat %9.0g Non-durable consumption

    C_p oat %9.0g Non-durable consumption plus

    real housing costs

    C_COMP_ONS_NOM oat %9.0g ONS-comparable consumption

    samp_select oat %9.0g RED sample selection ind.

    numads double %9.0g HH:NumAdults

    numhhkid double %9.0g HH:NumChildren

    grosfac oat %9.0g Sampling weights

    sex long %9.0g sex_l Ad:Sex

    occ double %12.0g occ Ad:Occupation

    weighta oat %9.0g Better Samp. weights after

    2001q2

    Y_h oat %9.0g Head labour earnings

    Y_s oat %9.0g Spouse labour earnings

    P_h oat %9.0g Head working

    P_s oat %9.0g Spouse working

    FT_h oat %9.0g Head working full time

    FT_s oat %9.0g Spouse working full time

    Y_O oat %9.0g ONS-comparable income

    Y_mTa oat %9.0g Gross income minus taxes

    Y_pTr oat %9.0g Gross income plus transfers

    lY_h oat %9.0g Log head labour earnings

    lY_s oat %9.0g Log spouse labour earnings

    lW_h oat %9.0g Log wage of head

    lW_s oat %9.0g Log wage of spouse

    6

  • lL_h oat %9.0g Log hours of head

    lL_s oat %9.0g Log hours of spouse

    lY_L oat %9.0g Log labour income

    lY_Lp oat %9.0g Log labour income plus private

    transfers

    lY_A oat %9.0g Log nancial asset income

    lY_Ap oat %9.0g Log nancial asset income plus

    real (non-imputed) housing

    asset income

    lY oat %9.0g Log gross income

    lY_D oat %9.0g Log net income

    lY_mTa oat %9.0g Log gross income minus taxes

    lY_pTr oat %9.0g Log gross income plus transfers

    lC oat %9.0g Log non-durable consumption

    lC_p oat %9.0g Log non-durable consumption

    plus real housing costs

    Y_L_eq oat %9.0g Equiv'd labour income

    Y_Lp_eq oat %9.0g Equiv'd labour income plus

    private transfers

    Y_A_eq oat %9.0g Equiv'd nancial asset income

    Y_Ap_eq oat %9.0g Equiv'd nancial asset income

    plus real (non-imputed) housing

    asset income

    Y_eq oat %9.0g Equiv'd gross income

    Y_D_eq oat %9.0g Equiv'd net income

    Y_mTa_eq oat %9.0g Equiv'd gross income minus taxes

    Y_pTr_eq oat %9.0g Equiv'd gross income plus

    transfers

    C_eq oat %9.0g Equiv'd non-durable consumption

    C_p_eq oat %9.0g Equiv'd non-durable consumption

    plus real housing costs

    lY_L_eq oat %9.0g Equiv'd log labour income

    lY_Lp_eq oat %9.0g Equiv'd log labour income plus

    private transfers

    lY_A_eq oat %9.0g Equiv'd log nancial asset

    income

    lY_Ap_eq oat %9.0g Equiv'd log nancial asset

    income plus real (non-imputed)

    housing asset income

    lY_eq oat %9.0g Equiv'd log gross income

    lY_D_eq oat %9.0g Equiv'd log net income

    lY_mTa_eq oat %9.0g Equiv'd log gross income minus

    taxes

    7

  • lY_pTr_eq oat %9.0g Equiv'd log gross income plus

    transfers

    lC_eq oat %9.0g Equiv'd log non-durable

    consumption

    lC_p_eq oat %9.0g Equiv'd log non-durable

    consumption plus real housing

    costs

    lnumhh oat %9.0g Log size of household

    lW_h_resid oat %9.0g Residual log wage of head

    lW_s_resid oat %9.0g Residual log wage of spouse

    lL_h_resid oat %9.0g Residual log hours of head

    lL_s_resid oat %9.0g Residual log hours of spouse

    lY_h_resid oat %9.0g Residual log head labour

    earnings

    lY_L_resid oat %9.0g Residual log labour income

    lY_Lp_resid oat %9.0g Residual log labour income plus

    private transfers

    lY_A_resid oat %9.0g Residual log nancial asset

    income

    lY_Ap_resid oat %9.0g Residual log nancial asset

    income plus real (non-imputed)

    housing asset income

    lY_resid oat %9.0g Residual log gross income

    lY_D_resid oat %9.0g Residual log net income

    lC_resid oat %9.0g Residual log non-durable

    consumption

    lC_p_resid oat %9.0g Residual log non-durable

    consumption plus real housing

    costs

    lY_mTa_resid oat %9.0g Residual log gross income minus

    taxes

    lY_pTr_resid oat %9.0g Residual log gross income plus

    transfers

    LFS

    Information about the LFS can be found here. The LFS is not a core database

    in the analysis and use of the data depends on much eort supplied by other

    researchers at the IFS. Therefore, we provide fewer microdata from this survey.

    Readme_LFS.txt in the LFS folder contains more information on the data

    and results from the LFS.

    Contents of Compiled Dataset - lfs_7508.dta:

    8

  • obs: 11,528,898

    vars: 5

    size: 161,404,572

    -

    storage display

    variable name type format value label variable label

    -

    year int %9.0g Year of interview

    sex byte %4.0g Sex

    age int %9.0g Age (at interview)

    econstat byte %14.0g econstat Economic status

    weight oat %8.0g LFS Weight

    SPI

    Data and documentation can be downloaded here. The codes supplied can be

    applied to the raw, downloaded data or researchers can e-mail me to obtain the

    compiled ready-to-use datasets - e.g. spi1996.dta.

    Readme_SPI.txt in the SPI folder contains more information on the data

    and results from the SPI.

    Contents of Compiled Dataset - spi1996.dta:

    obs: 61,020

    vars: 51

    size: 9,397,080

    -

    storage display

    variable name type format value label variable label

    -

    sex byte %8.0g indicator of sex

    marital byte %8.0g indicator of marital status

    sref long %12.0g statistical reference number

    st byte %8.0g sex and married status

    mcatype byte %8.0g married couples allowance

    indicator

    wbadue int %8.0g widows bereavement allowance due

    apadue int %8.0g additional personal allowance

    due

    bpadue int %8.0g blind persons allowance due

    pay long %12.0g pay from employment (net ben &

    9

  • foreign)

    taxterm long %12.0g taxable pay on termination of

    employment

    epb long %12.0g gross taxable exps, pays and

    benets

    srp int %8.0g total state pensions

    pension long %12.0g occupational, personal, other

    pensions

    ubisja int %8.0g unemployment ben, income sup,

    jobseeker

    incpben int %8.0g taxable incapacity benet

    ossben int %8.0g other taxable soc. sec. benets

    incprop long %12.0g net income from property (uk

    and other)

    incbbs long %12.0g interest from uk banks etc

    gdivs long %12.0g gross dividends, uk companies &

    unit trs

    oth20pc long %12.0g other interest & divs charged @

    20%

    mothinc long %12.0g other income not shown elsewhere

    omortal int %8.0g home loans outside of miras

    oldmrlf long %12.0g relief for maint paid under old

    rules

    covnts long %12.0g covenanted payments to charities

    fact double %10.0g factor to gross sample to uk

    level

    ti long %12.0g total income

    mainsrc byte %8.0g main source indicator

    mcatran int %8.0g married allowance transferred

    to spouse

    exps long %12.0g deductions for expenses from

    employment

    newmrlf int %8.0g relief for maint paid under new

    rules

    giftaid long %12.0g gift aid payments to charities

    mothded long %12.0g other charges, deductions,

    reliefs

    lossbf long %12.0g losses brought forward against

    self emp

    capall long %12.0g capital allowances for self emp

    prots long %12.0g gross adjsuted prots, self emp

    dships byte %8.0g directorship indicator

    mcadue int %8.0g married couples allowance due

    10

  • eidf long %12.0g earned income deductions (full

    relief)

    iidf long %12.0g invest income deductions (full

    relief)

    pas int %8.0g personal allowance, aged, non,

    super

    mcas int %8.0g sum of mca, wna, bpa

    othalls int %8.0g other allowances due to case

    taxinc long %12.0g amount of taxable income

    mar byte %8.0g marginal rate of tax indicator

    totax long %12.0g total tax liability less tax

    credits

    tei long %12.0g total earned income

    tii long %12.0g total investment income

    gorcode byte %8.0g government oce region

    industry byte %8.0g

    pensrlf long %12.0g rar+gppens+nppens+ravc+supann

    aged byte %8.0g

    11