A tale of 3 cities: model selection in over-, exact, and under-specified equations
Model selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-speci...
Main Authors: | , |
---|---|
Format: | Working paper |
Published: |
University of Oxford
2011
|
_version_ | 1797093916820897792 |
---|---|
author | Castle, J Hendry, D |
author_facet | Castle, J Hendry, D |
author_sort | Castle, J |
collection | OXFORD |
description | Model selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-specific (Gets) selection is to eliminate the irrelevant variables while retaining the relevant. In an exact specification, the theory formulation is precisely correct and can always be retained by 'forcing' during selection, but is nevertheless embedded in a broader model where possible omissions, breaks, non-linearity, or data contamination are checked. The most realistic case is where some aspects of the relevant DGP are correctly included, but some are omitted, leading to under-specification. We review the analysis of model selection procedures which allow for many relevant effects, but inadvertently omit others, yet irrelevant variables are also included in the GUM, and exploit the ability of automatic procedures to handle more variables than observations, and consequentially tackle perfect collinearity. Considering all of the possibilities - where it is not known which one obtains in practice - reveals that model selection can excel relative to just fitting a prior specification, yet has very low costs when an exact specification is correctly postulated initially. |
first_indexed | 2024-03-07T04:06:59Z |
format | Working paper |
id | oxford-uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b6 |
institution | University of Oxford |
last_indexed | 2024-03-07T04:06:59Z |
publishDate | 2011 |
publisher | University of Oxford |
record_format | dspace |
spelling | oxford-uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b62022-03-27T06:38:38ZA tale of 3 cities: model selection in over-, exact, and under-specified equationsWorking paperhttp://purl.org/coar/resource_type/c_8042uuid:c6813f78-b9ff-4a83-9dac-cf3405bfa8b6Bulk import via SwordSymplectic ElementsUniversity of Oxford2011Castle, JHendry, DModel selection from a general unrestricted model (GUM) can potentially confront three very different environments: over-, exact, and under-specification of the data generation process (DGP). In the first, and most-studied setting, the DGP is nested in the GUM, and the main role of general-to-specific (Gets) selection is to eliminate the irrelevant variables while retaining the relevant. In an exact specification, the theory formulation is precisely correct and can always be retained by 'forcing' during selection, but is nevertheless embedded in a broader model where possible omissions, breaks, non-linearity, or data contamination are checked. The most realistic case is where some aspects of the relevant DGP are correctly included, but some are omitted, leading to under-specification. We review the analysis of model selection procedures which allow for many relevant effects, but inadvertently omit others, yet irrelevant variables are also included in the GUM, and exploit the ability of automatic procedures to handle more variables than observations, and consequentially tackle perfect collinearity. Considering all of the possibilities - where it is not known which one obtains in practice - reveals that model selection can excel relative to just fitting a prior specification, yet has very low costs when an exact specification is correctly postulated initially. |
spellingShingle | Castle, J Hendry, D A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title | A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title_full | A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title_fullStr | A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title_full_unstemmed | A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title_short | A tale of 3 cities: model selection in over-, exact, and under-specified equations |
title_sort | tale of 3 cities model selection in over exact and under specified equations |
work_keys_str_mv | AT castlej ataleof3citiesmodelselectioninoverexactandunderspecifiedequations AT hendryd ataleof3citiesmodelselectioninoverexactandunderspecifiedequations AT castlej taleof3citiesmodelselectioninoverexactandunderspecifiedequations AT hendryd taleof3citiesmodelselectioninoverexactandunderspecifiedequations |