Batch-iFDD for representation expansion in large MDPs

Batch-iFDD for representation expansion in large MDPs

Matching pursuit (MP) methods are a promising class of feature construction algorithms for value function approximation. Yet existing MP methods require creating a pool of potential features, mandating expert knowledge or enumeration of a large feature pool, both of which hinder scalability. This pa...

Full description

Bibliographic Details
Main Authors:	Geramifard, Alborz, Walsh, Thomas J., Roy, Nicholas, How, Jonathan P.
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format:	Article
Language:	en_US
Published:	Association for Uncertainty in Artificial Intelligence (AUAI) 2015
Online Access:	http://hdl.handle.net/1721.1/97035 https://orcid.org/0000-0001-8576-1930 https://orcid.org/0000-0002-2508-1957 https://orcid.org/0000-0002-8293-0492

Similar Items

Practical reinforcement learning using representation learning and safe exploration for large scale Markov decision processes
by: Geramifard, Alborz, 1980-
Published: (2012)

Combining dynamic abstractions in large MDPs
by: Steinkraus, Kurt, et al.
Published: (2005)

UAV Cooperative Control with Stochastic Risk Models
by: Geramifard, Alborz, et al.
Published: (2013)

Markov decision processes in artificial intelligence : MDPs, beyond MDPs and applications /
by: Sigaud, Olivier, et al.
Published: (2010)

Transience in countable MDPs
by: Kiefer, SM, et al.
Published: (2021)

Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning
by: Geramifard, Alborz, et al.
Published: (2013)

Parity objectives in countable MDPs
by: Kiefer, S, et al.
Published: (2017)

Büchi objectives in countable MDPs
by: Kiefer, S, et al.
Published: (2019)

Social Interactions as Recursive MDPs
by: Tejwani, Ravi, et al.
Published: (2022)

Reinforcement learning with misspecified model classes
by: Joseph, Joshua Mason, et al.
Published: (2015)

Planning with hidden parameter polynomial MDPs
by: Costen, C, et al.
Published: (2023)

Invariant causal prediction for block MDPs
by: Zhang, A, et al.
Published: (2020)

Fast approximate hierarchical solution of MDPs
by: Barry, Jennifer L. (Jennifer Lynn)
Published: (2010)

Incorporating Rich Social Interactions Into MDPs
by: Tejwani, Ravi, et al.
Published: (2022)

Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery
by: Geramifard, Alborz, et al.
Published: (2013)

Actor-Critic Policy Learning in Cooperative Planning
by: Redding, Joshua, et al.
Published: (2013)

Blind transport format detection for WCDMA (FDD)
by: Sathiavageeswaran Karthik.
Published: (2008)

Strategy complexity of parity objectives in countable MDPs
by: Kiefer, S, et al.
Published: (2020)

Planning for risk-aversion and expected value in MDPs
by: Rigter, M, et al.
Published: (2022)

Solving Dec-MDPs with options and intention recognition
by: Cruz, Gabriel, M. Eng. Massachusetts Institute of Technology
Published: (2016)

An Empirical Study on Channel Reciprocity in TDD and FDD Systems
by: Huixin Xu, et al.
Published: (2024-01-01)

Optimization of resource allocation in FDD massive MIMO systems
by: Jun Cai, et al.
Published: (2024-02-01)

Analysis of scalable channel estimation in FDD massive MIMO
by: Xing Zhang, et al.
Published: (2023-03-01)

Solving Finite-Horizon Discounted Non-Stationary MDPS
by: Bouchra El Akraoui, et al.
Published: (2023-06-01)

Adaptive Envelope MDPs for Relational Equivalence-based Planning
by: Gardiol, Natalia H., et al.
Published: (2008)

Probabilistic Bisimulations for PCTL Model Checking of Interval MDPs
by: Vahid Hashemi, et al.
Published: (2014-03-01)

NP-Hardness of checking the unichain condition in average cost MDPs
by: Tsitsiklis, John N.
Published: (2012)

Rlpy: A Value-Function-Based Reinforcement Learning Framework for Education and Research
by: Dann, Christoph, et al.
Published: (2016)

An intelligent cooperative control architecture
by: Redding, Josh, et al.
Published: (2010)

On Scalability of FDD-Based Cell-Free Massive MIMO Framework
by: Beenish Hassan, et al.
Published: (2023-08-01)

FDD in Building Systems Based on Generalized Machine Learning Approaches
by: William Nelson, et al.
Published: (2023-02-01)

Adaptive training‐feedback scheme for FDD in massive MIMO systems
by: Yi Huang, et al.
Published: (2023-03-01)

Mixed observability MDPs for shared autonomy with uncertain human behaviour
by: Costen, C, et al.
Published: (2021)

Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty
by: Adulyasak, Yossiri, et al.
Published: (2018)

3G, HSDPA, HSUPA and FDD versus TDD networking : smart antennas and adaptive modulation /
by: Hanzo, Lajos, 1952-, et al.
Published: (2008)

CSI Feedback Model Based on Multi-Source Characterization in FDD Systems
by: Fei Pan, et al.
Published: (2023-09-01)

Effects of Mitochondrial-Derived Peptides (MDPs) on Mitochondrial and Cellular Health in AMD
by: Sonali Nashine, et al.
Published: (2020-04-01)

Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles
by: Golowich, Noah, et al.
Published: (2024)

Downlink training design for FDD massive MIMO systems in the presence of colored noise
by: Naser, Marwah Abdulrazzaq, et al.
Published: (2020)

Downlink Training Design for FDD Massive MIMO Systems in the Presence of Colored Noise
by: Marwah Abdulrazzaq Naser, et al.
Published: (2020-12-01)