A tale of 3 cities: model selection in over-, exact, and under-specified equations

Model selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-speci...

Full description

Bibliographic Details
Main Authors: Castle, J, Hendry, D
Format: Working paper
Published: University of Oxford 2011
_version_ 1797093916820897792
author Castle, J
Hendry, D
author_facet Castle, J
Hendry, D
author_sort Castle, J
collection OXFORD
description Model selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-specific (Gets) selection is to eliminate the irrelevant variables while retaining the relevant. In an exact specification, the theory formulation is precisely correct and can always be retained by 'forcing' during selection, but is nevertheless embedded in a broader model where possible omissions, breaks, non-linearity, or data contamination are checked. The most realistic case is where some aspects of the relevant DGP are correctly included, but some are omitted, leading to under-specification. We review the analysis of model selection procedures which allow for many relevant effects, but inadvertently omit others, yet irrelevant variables are also included in the GUM, and exploit the ability of automatic procedures to handle more variables than observations, and consequentially tackle perfect collinearity. Considering all of the possibilities - where it is not known which one obtains in practice - reveals that model selection can excel relative to just fitting a prior specification, yet has very low costs when an exact specification is correctly postulated initially.
first_indexed 2024-03-07T04:06:59Z
format Working paper
id oxford-uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b6
institution University of Oxford
last_indexed 2024-03-07T04:06:59Z
publishDate 2011
publisher University of Oxford
record_format dspace
spelling oxford-uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b62022-03-27T06:38:38ZA tale of 3 cities: model selection in over-, exact, and under-specified equationsWorking paperhttp://purl.org/coar/resource_type/c_8042uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b6Bulk import via SwordSymplectic ElementsUniversity of Oxford2011Castle, JHendry, DModel selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-specific (Gets) selection is to eliminate the irrelevant variables while retaining the relevant. In an exact specification, the theory formulation is precisely correct and can always be retained by 'forcing' during selection, but is nevertheless embedded in a broader model where possible omissions, breaks, non-linearity, or data contamination are checked. The most realistic case is where some aspects of the relevant DGP are correctly included, but some are omitted, leading to under-specification. We review the analysis of model selection procedures which allow for many relevant effects, but inadvertently omit others, yet irrelevant variables are also included in the GUM, and exploit the ability of automatic procedures to handle more variables than observations, and consequentially tackle perfect collinearity. Considering all of the possibilities - where it is not known which one obtains in practice - reveals that model selection can excel relative to just fitting a prior specification, yet has very low costs when an exact specification is correctly postulated initially.
spellingShingle Castle, J
Hendry, D
A tale of 3 cities: model selection in over-, exact, and under-specified equations
title A tale of 3 cities: model selection in over-, exact, and under-specified equations
title_full A tale of 3 cities: model selection in over-, exact, and under-specified equations
title_fullStr A tale of 3 cities: model selection in over-, exact, and under-specified equations
title_full_unstemmed A tale of 3 cities: model selection in over-, exact, and under-specified equations
title_short A tale of 3 cities: model selection in over-, exact, and under-specified equations
title_sort tale of 3 cities model selection in over exact and under specified equations
work_keys_str_mv AT castlej ataleof3citiesmodelselectioninoverexactandunderspecifiedequations
AT hendryd ataleof3citiesmodelselectioninoverexactandunderspecifiedequations
AT castlej taleof3citiesmodelselectioninoverexactandunderspecifiedequations
AT hendryd taleof3citiesmodelselectioninoverexactandunderspecifiedequations