Learning and value function approximation in complex decision processes

Learning and value function approximation in complex decision processes

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998.

Մատենագիտական մանրամասներ
Հիմնական հեղինակ:	Van Roy, Benjamin
Այլ հեղինակներ:	John N. Tsitsiklis.
Ձևաչափ:	Թեզիս
Լեզու:	eng
Հրապարակվել է:	Massachusetts Institute of Technology 2005
Խորագրեր:	Electrical Engineering and Computer Science
Առցանց հասանելիություն:	http://hdl.handle.net/1721.1/9960

Նմանատիպ նյութեր

Approximate solution methods for partially observable Markov and semi-Markov decision processes
‌: Yu, Huizhen, Ph. D. Massachusetts Institute of Technology
Հրապարակվել է: (2007)

Making discrete decisions based on continuous values
‌: Sherman, Benjamin (Benjamin Marc)
Հրապարակվել է: (2017)

Multiple machine maintenance : applying a separable value function approximation to a variation of the multiarmed bandit
‌: Lin, Haixia, 1977-
Հրապարակվել է: (2014)

Realization and approximation of stationary stochastic processes
‌: Avniel, Yehuda
Հրապարակվել է: (2005)

Training hierarchical networks for function approximation
‌: Miranda, Brando, M. Eng. Massachusetts Institute of Technology
Հրապարակվել է: (2018)

Approximate value iteration approaches to constrained dynamic portfolio problems
‌: Wang, Alexander C. (Alexander Che-Wei)
Հրապարակվել է: (2006)

Representation and transfer learning using information-theoretic approximations
‌: Qiu, David.
Հրապարակվել է: (2020)

Low power digital filtering using adaptive approximate processing
‌: Ludwig, Jeffrey Thomas, 1968-
Հրապարակվել է: (2008)

Improving clinical decision making with natural language processing and machine learning
‌: Forsyth, Alexander William
Հրապարակվել է: (2017)

Networks, decisions, and outcomes : coordination with local information and the value of temporal data for learning influence networks
‌: Zoumpoulis, Spyridon Ilias
Հրապարակվել է: (2014)

Sparse approximations, iterative methods, and faster algorithms for matrices and graphs
‌: Cohen, Michael Benjamin
Հրապարակվել է: (2018)

Stochastic approximation for non-expansive maps : application to Q-learning algorithms
‌: Abounadi, Jinane, 1966-
Հրապարակվել է: (2005)

Local approximations of deep learning models for black-box adversarial attacks
‌: Sun, Michael(Michael Z.)
Հրապարակվել է: (2019)

Feature point detection and curve approximation for early processing of free-hand sketches
‌: Sezgin, Tevfik Metin, 1978-
Հրապարակվել է: (2014)

On approximating projection games
‌: Manurangsi, Pasin
Հրապարակվել է: (2016)

In-situ wafer uniformity estimation using principal component analysis and function approximation methods
‌: White, David A. (David Allan), 1966-
Հրապարակվել է: (2005)

View-dependent precomputed light transport using non-linear Gaussian function approximations
‌: Green, Paul Elijah
Հրապարակվել է: (2007)

Approximating the maximum acyclic subgraph
‌: Newman, Alantha
Հրապարակվել է: (2014)

Analysis of approximation and uncertainty in optimization
‌: Mastin, Dana Andrew
Հրապարակվել է: (2015)

Mechanism design with approximate types
‌: Zhu, Zeyuan Allen
Հրապարակվել է: (2012)

Approximate string joins with abbreviations
‌: Tao, Wenbo, Ph. D. Massachusetts Institute of Technology
Հրապարակվել է: (2018)

Simulation-based optimization of Markov decision processes
‌: Marbach, Peter, 1966-
Հրապարակվել է: (2005)

Interactions between learning and decision making
‌: Tulabandhula, Theja
Հրապարակվել է: (2015)

Approximation algorithms for disjoint paths problems
‌: Kleinberg, Jon M
Հրապարակվել է: (2005)

Perturbation stability for approximate MAP inference
‌: Lang, Hunter(Hunter J.)
Հրապարակվել է: (2019)

Approximate inference in Gaussian graphical models
‌: Malioutov, Dmitry M., 1981-
Հրապարակվել է: (2009)

Logical reasoning for approximate and unreliable computation
‌: Carbin, Michael (Michael James)
Հրապարակվել է: (2015)

Fast approximate hierarchical solution of MDPs
‌: Barry, Jennifer L. (Jennifer Lynn)
Հրապարակվել է: (2010)

Signal approximation using the bilinear transform
‌: Venkataraman, Archana, Ph. D. Massachusetts Institute of Technology
Հրապարակվել է: (2009)

Accuracy-aware optimization of approximate programs
‌: Misailović, Saša
Հրապարակվել է: (2016)

Approximation algorithms for stochastic scheduling problems
‌: Dean, Brian C. (Brian Christopher), 1975-
Հրապարակվել է: (2006)

Coherent approximation of distributed expert assessments
‌: Jones, Peter B., Ph.D. Massachusetts Institute of Technology
Հրապարակվել է: (2011)

Energy-efficient approximate computation in Topaz
‌: Achour, Sara
Հրապարակվել է: (2015)

Communication complexity of permutation-invariant functions
‌: Kamath, Pritish
Հրապարակվել է: (2015)

Kirchhoff approximation for rough surface scattering
‌: Mou, Alex.
Հրապարակվել է: (2024)

Feature-based methods for large scale dynamic programming
‌: Van Roy, Benjamin
Հրապարակվել է: (2005)

Average-case complexity of detecting cliques
‌: Rossman, Benjamin (Benjamin E.)
Հրապարակվել է: (2011)

A VLSI systolic array processor for complex singular value decomposition
‌: Niessen, Christopher Charles
Հրապարակվել է: (2006)

Generative temporal planning with complex processes
‌: Kennell, Jonathan, 1980-
Հրապարակվել է: (2005)

Modeling and optimizing quality for networks of approximate processors
‌: Secor, Matthew J. (Matthew Joelson)
Հրապարակվել է: (2005)