Q-learning with nearest neighbors

Q-learning with nearest neighbors

© 2018 Curran Associates Inc.All rights reserved. We consider model-free reinforcement learning for infinite-horizon discounted Markov Decision Processes (MDPs) with a continuous state space and unknown transition kernel, when only a single sample path under an arbitrary policy of the system is ava...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক:	Shah, Devavrat, Xie, Qiaomin
অন্যান্য লেখক:	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
বিন্যাস:	প্রবন্ধ
ভাষা:	English
প্রকাশিত:	2021
অনলাইন ব্যবহার করুন:	https://hdl.handle.net/1721.1/137946

অনুরূপ উপাদানগুলি

Nearest Neighbors for Matrix Estimation Interpreted as Blind Regression for Latent Variable Model
অনুযায়ী: Li, Yihua, অন্যান্য
প্রকাশিত: (2021)

Simultaneous nearest neighbor search
অনুযায়ী: Kleinberg, Robert, অন্যান্য
প্রকাশিত: (2017)

Efficient discriminative learning of parametric nearest neighbor classifiers
অনুযায়ী: Zhang, Z, অন্যান্য
প্রকাশিত: (2012)

Nearest-neighbor methods in learning and vision : theory and practice /
অনুযায়ী: Shakhnarovich, Gregory, অন্যান্য
প্রকাশিত: (2005)

Nearest neighbor queries in spatial database
অনুযায়ী: Liu, Danzhou.
প্রকাশিত: (2008)

Approxiamate Nearest Neighbor Search in High Dimensions
অনুযায়ী: Andoni, Alexandr, অন্যান্য
প্রকাশিত: (2021)

Secure k -ish Nearest Neighbors Classifier
অনুযায়ী: Shaul, Hayim, অন্যান্য
প্রকাশিত: (2021)

Scalable Nearest Neighbor Search for Optimal Transport
অনুযায়ী: Backurs, Arturs, অন্যান্য
প্রকাশিত: (2022)

Nearest-neighbor forecast of U.S. interest rates
অনুযায়ী: Barkoulas, John, অন্যান্য
প্রকাশিত: (2003)

Random kernel k-nearest neighbors regression
অনুযায়ী: Patchanok Srisuradetchai, অন্যান্য
প্রকাশিত: (2024-07-01)

L-SCANN: Logarithmic Subcentroid and Nearest Neighbor
অনুযায়ী: Tohari Ahmad, অন্যান্য
প্রকাশিত: (2016-12-01)

Nearest neighbor search : the old, the new, and the impossible
অনুযায়ী: Andoni, Alexandr
প্রকাশিত: (2010)

Approximate nearest neighbor and its many variants
অনুযায়ী: Mahabadi, Sepideh
প্রকাশিত: (2013)

Predicting Unroll Factors Using Nearest Neighbors
অনুযায়ী: Stephenson, Mark, অন্যান্য
প্রকাশিত: (2005)

Approximate nearest neighbor problem in high dimensions
অনুযায়ী: Andoni, Alexandr
প্রকাশিত: (2006)

Nearest neighbor Markov dynamics on Macdonald processes
অনুযায়ী: Borodin, Alexei, অন্যান্য
প্রকাশিত: (2018)

Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality
অনুযায়ী: Har-Peled, Sariel, অন্যান্য
প্রকাশিত: (2021)

Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality
অনুযায়ী: Har-Peled, Sariel, অন্যান্য
প্রকাশিত: (2022)

Dicke superradiance requires interactions beyond nearest neighbors
অনুযায়ী: Mok, Wai-Keong, অন্যান্য
প্রকাশিত: (2023)

Reverse k Nearest Neighbor Search over Trajectories
অনুযায়ী: Wang, Sheng, অন্যান্য
প্রকাশিত: (2020)

Consistency of the $k$-nearest neighbors rule for functional data
অনুযায়ী: Younso, Ahmad
প্রকাশিত: (2023-01-01)

A pre-averaged pseudo nearest neighbor classifier
অনুযায়ী: Dapeng Li
প্রকাশিত: (2024-08-01)

Nearest-Neighbor Interactions and Their Influence on the Structural Aspects of Dipeptides
অনুযায়ী: Gunajyoti Das, অন্যান্য
প্রকাশিত: (2013-01-01)

Linear estimation for 2-D nearest-neighbor models
প্রকাশিত: (2003)

New LSH-based Algorithm for Approximate Nearest Neighbor
অনুযায়ী: Andoni, Alexandr, অন্যান্য
প্রকাশিত: (2005)

Efisiensi Big Data Menggunakan Improved Nearest Neighbor
অনুযায়ী: Aditya Hari Bawono, অন্যান্য
প্রকাশিত: (2019-12-01)

Clustering algorithm for imbalanced data based on nearest neighbor
অনুযায়ী: Sen WU, অন্যান্য
প্রকাশিত: (2020-09-01)

Probabilistic Nearest Neighbors Based Locality Preserving Projections for Unsupervised Metric Learning
অনুযায়ী: Alaor Cervati Neto, অন্যান্য
প্রকাশিত: (2024-05-01)

Machine learning classification based on k-Nearest Neighbors for PolSAR data
অনুযায়ী: JODAVID A. FERREIRA, অন্যান্য
প্রকাশিত: (2024-04-01)

Efficient Hamiltonian programming in qubit arrays with nearest-neighbor couplings
অনুযায়ী: Tsunoda, T, অন্যান্য
প্রকাশিত: (2020)

Mediated Interactions beyond the Nearest Neighbor in an Array of Superconducting Qubits
অনুযায়ী: Yanay, Yariv, অন্যান্য
প্রকাশিত: (2022)

QUANTUM FLUCTUATIONS IN THE AXIAL NEXT-NEAREST-NEIGHBOR ISING-MODEL
অনুযায়ী: Harris, A, অন্যান্য
প্রকাশিত: (1995)

Online fortune telling system using nearest neighbor relationship
অনুযায়ী: Cheng, Shao Chian.
প্রকাশিত: (2009)

Fruits recognition based on texture features and K-Nearest Neighbor
অনুযায়ী: Kamal Ariffin, Nur Izzani, অন্যান্য
প্রকাশিত: (2018)

Random k conditional nearest neighbor for high-dimensional data
অনুযায়ী: Jiaxuan Lu, অন্যান্য
প্রকাশিত: (2025-01-01)

Bagging Nearest Neighbor and its Enhancement for Machinery Predictive Maintenance
অনুযায়ী: Muhammad Irfan Arisani, অন্যান্য
প্রকাশিত: (2024-08-01)

An Ensemble for Automatic Time Series Forecasting With K-Nearest Neighbors
অনুযায়ী: Maria P. Frias, অন্যান্য
প্রকাশিত: (2025-01-01)

Engineering optimization by constrained differential evolution with nearest neighbor comparison
অনুযায়ী: Pham Hoang Anh
প্রকাশিত: (2016-06-01)

Detecting treatment interference under K-nearest-neighbors interference
অনুযায়ী: Alzubaidi Samirah H., অন্যান্য
প্রকাশিত: (2024-06-01)

Applying a randomized nearest neighbors algorithm to dimensionality reduction
অনুযায়ী: Jayaraman, Gautam, 1981-
প্রকাশিত: (2006)