Abstract

Previous article Next article Adaptive Strategies for Certain Classes of Controlled Markov ProcessesE. I. GordienkoE. I. Gordienkohttps://doi.org/10.1137/1129064PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAbout[1] V. N. Fomin, , A. L. Fradkov and , V. A. Yakubovich, Adaptive Control of Dynamic Objects, Nauka, Moscow, 1981, (In Russian.) 0522.93002 Google Scholar[2] V. G. Sragovich, Theory of Adaptive Systems, Nauka, 1976Moscow, (In Russian.) 0333.93005 Google Scholar[3] Yu. V. Popov, Adaptive systems for the control of certain classes of random processes of general type, Studies in the theory of adaptive systems (Russian), Vyčisl. Centr, Akad. Nauk SSSR, Moscow, 1976, 119–142, 223, (In Russian.) 58:33328 Google Scholar[4] G. A. Agasandyan, Adaptive system for homogeneous processes with continuous sets of states and controls, Theory Prob. Appl., 24 (1979), 515–528 0409.93030 Google Scholar[5] E. I. Gordienko, Adaptive optimal control of some Markov processes, Dokl. Akad. Nauk SSSR, 261 (1981), 271–275, (In Russian.) 83b:93041 0494.93027 Google Scholar[6] Ye. B. Dynkin and , A. A. Yushkevich, Controlled Markov processes, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], Vol. 235, Springer-Verlag, Berlin, 1979xvii+289 80k:90037 CrossrefGoogle Scholar[7] J. Doob, Stochastic processes, John Wiley & Sons Inc., New York, 1953viii+654 15,445b 0053.26802 Google Scholar[8] V. V. Kalashnikov, Qualitative Analysis of the Behavior of Complex Systems by the Method of Probe Functions, Nauka, Moscow, 1978, (In Russian.) 0451.93002 Google Scholar[9] L. G. Glubenko and , E. S. Shtatland, On controlled Markov processes with discrete timeTheory of Probability and Mathematical Statistics, Vol. 7, Naukova Dumka, Kiev, 1972, 51–64, (In Russian.) Google Scholar[10] V. V. Petrov, Sums of independent random variables, Springer-Verlag, New York, 1975x+346 52:9335 0322.60042 CrossrefGoogle Scholar[11] P. Ganssler and , W. Stute, Empirical processes: a survey of results for independent and identically distributed random variables, Ann. Probab., 7 (1979), 193–243 80d:60002 CrossrefGoogle Scholar[12] Patrick Billingsley and , Flemming Topsøe, Uniformity in weak convergence, Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 7 (1967), 1–16 35:326 0147.15701 CrossrefGoogle Scholar[13] A. N. Kolmogorov and , V. M. Tikhomirov, $\varepsilon$-entropy and $\varepsilon$-capacity of sets in function spaces, Uspehi Mat. Nauk, 14 (1959), 3–86 22:2890 Google Scholar[14] R. M. Dudley, The speed of mean Glivenko-Cantelli convergence, Ann. Math. Statist, 40 (1968), 40–50 38:5270 0184.41401 CrossrefGoogle Scholar Previous article Next article FiguresRelatedReferencesCited ByDetails Asymptotically Optimal Strategies for Adaptive Zero-Sum Discounted Markov GamesJ. Adolfo Minjárez-Sosa and Oscar Vega-AmayaSIAM Journal on Control and Optimization, Vol. 48, No. 3 | 15 April 2009AbstractPDF (230 KB)Empirical estimation in average Markov control processesApplied Mathematics Letters, Vol. 21, No. 5 | 1 May 2008 Cross Ref Average Optimality for Adaptive Markov Control Processes with Unbounded Costs and Unknown Disturbance DistributionMarkov Processes and Controlled Markov Chains | 1 Jan 2002 Cross Ref Approximation of average cost optimal policies for general Markov decision processes with unbounded costsMathematical Methods of Operations Research, Vol. 45, No. 2 | 1 Jun 1997 Cross Ref Recurrence conditions for Markov decision processes with Borel state space: A surveyAnnals of Operations Research, Vol. 28, No. 1 | 1 Dec 1991 Cross Ref Nonparametric estimation and adaptive control in a class of finite Markov decision chainsAnnals of Operations Research, Vol. 28, No. 1 | 1 Dec 1991 Cross Ref Density estimation and adaptive control of markov processes: Average and discounted criteriaActa Applicandae Mathematicae, Vol. 20, No. 3 | 1 Sep 1990 Cross Ref Nonparametric adaptive control of discrete-time partially observable stochastic systemsJournal of Mathematical Analysis and Applications, Vol. 137, No. 2 | 1 Feb 1989 Cross Ref Continuous dependence of stochastic control models on the noise distributionApplied Mathematics & Optimization, Vol. 17, No. 1 | 1 Jan 1988 Cross Ref Adaptive policies for discrete-time stochastic control systems with unknown disturbance distributionSystems & Control Letters, Vol. 9, No. 4 | 1 Oct 1987 Cross Ref Adaptive control of stochastic systems with unknown noise distribution--Discounted reward criterion1986 25th IEEE Conference on Decision and Control | 1 Dec 1986 Cross Ref Volume 29, Issue 3| 1985Theory of Probability & Its Applications427-645 History Submitted:12 April 1981Published online:17 July 2006 InformationCopyright © Society for Industrial and Applied MathematicsPDF Download Article & Publication DataArticle DOI:10.1137/1129064Article page range:pp. 504-518ISSN (print):0040-585XISSN (online):1095-7219Publisher:Society for Industrial and Applied Mathematics

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.