E-Books durchsuchen

Mathematical Learning Models — Theory and Algorithms [1983]

1: The Minimax Risk for the Two-Armed Bandit Problem

Elektronische Ausgabe
12: Bandit Problems with Random Discounting

Elektronische Ausgabe
26: Stochastic Approximation on a Bounded Convex Set

Elektronische Ausgabe
33: Learning Automaton for Finite Semi-Markov Decision Processes

Elektronische Ausgabe
43: A Local Asymptotic Minimax Optimality of an Adaptive Robbins Monro Stochastic Approximation Procedure

Elektronische Ausgabe
50: Dynamic Allocation Indices for Bayesian Bandits

Elektronische Ausgabe
68: The Role of Dynamic Allocation Indices in the Evaluation of Suboptimal Strategies for Families of Bandit Processes

Elektronische Ausgabe
78: On the Discretization Technique for Optimal Discounted Control of the Wiener Process

Elektronische Ausgabe
86: Asymptotic Properties of Learning Models

Elektronische Ausgabe
93: On the infinitesimal characterization of monotone stopping problems in continuous time

Elektronische Ausgabe
101: Numerical Investigation of the Two Armed Bandit

Elektronische Ausgabe
108: Uniform Bounds for a Dynamic Programming Model under Adaptive Control Using Exponentially Bounded Error Probabilities

Elektronische Ausgabe
115: Stochastic Regression Models and Consistency of the Least Squares Identification Scheme

Elektronische Ausgabe
126: Recursive Identification Techniques

Elektronische Ausgabe
138: An Optimization Problem for Matrices with Application to Decision Models

Elektronische Ausgabe
145: On a Class of Learning Algorithms with Symmetric Behavior under Success and Failure

Elektronische Ausgabe
156: Convergence of a General Stochastic Approximation Process under Convex Constraints and some Applications

Elektronische Ausgabe
168: On Kersting’s Theorem on Weak Convergence of Recursions

Elektronische Ausgabe
175: On Continuous Time Learning Models

Elektronische Ausgabe
182: Convergence of Stochastic Approximation Algorithms with Non-Additive Dependent Disturbances and Applications

Elektronische Ausgabe
191: Sequential probability ratio tests for homogeneous Markov chains

Elektronische Ausgabe
203: Allocation Rules for Sequential Clinical Trials

Elektronische Ausgabe
213: Non-Deterministic Modelling and its Application in Adaptive Optimal Control

Elektronische Ausgabe

Schnellzugriff

Ausleihen & Bestellen

Schnellzugriff

Recherchieren & Entdecken

Schnellzugriff

Lernen & Arbeiten

Schnellzugriff

Publizieren & Archivieren

Schnellzugriff

Über die TIB

Schnellzugriff

Forschung & Entwicklung

Mathematical Learning Models — Theory and Algorithms [1983]