Bayesian Actor-Critic Algorithms (Englisch)
- Neue Suche nach: Ghavamzadeh, M.
- Neue Suche nach: Engel, Y.
- Neue Suche nach: Ghavamzadeh, M.
- Neue Suche nach: Engel, Y.
- Neue Suche nach: Ghahramani, Zoubin
In:
International conference on machine learning (ICML 2007)
;
297-304
;
2007
-
ISBN:
- Aufsatz (Konferenz) / Print
-
Titel:Bayesian Actor-Critic Algorithms
-
Beteiligte:
-
Kongress:24th, International conference on machine learning (ICML 2007) ; 2007 ; Corvallis, Or.
-
Erschienen in:
-
Verlag:
- Neue Suche nach: ACM]
-
Erscheinungsort:[New York
-
Erscheinungsdatum:01.01.2007
-
Format / Umfang:8 pages
-
Anmerkungen:"Co-located with the International Conference on Inductive Logic Programming (ILP 2007). Includes bibliographical references and author index. Also available on the World Wide Web (PDF files)
-
ISBN:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Print
-
Sprache:Englisch
-
Schlagwörter:
-
Datenquelle:
© Metadata Copyright the British Library Board and other contributors. All rights reserved.
Inhaltsverzeichnis Konferenzband
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Quantum Clustering AlgorithmsAimeur, E. / Brassard, G. / Gambs, S. et al. | 2007
- 9
-
Learning Random Walks to Rank Nodes in GraphsAgarwal, A. / Chakrabarti, S. et al. | 2007
- 17
-
Uncovering Shared Structures in Multiclass ClassificationAmit, Y. / Fink, M. / Srebro, N. / Ullman, S. et al. | 2007
- 25
-
Two-view Feature Generation Model for Semi-supervised LearningAndo, R. / Zhang, T. et al. | 2007
- 33
-
Scalable Training of L1-regularized Log-linear ModelsAndrew, G. / Gao, J. et al. | 2007
- 41
-
Multiclass Core Vector MachineAsharaf, S. / Murty, M. N. / Shevade, S. K. et al. | 2007
- 49
-
The Rendezvous Algorithm: Multiclass Semi-Supervised Learning with Markov Random WalksAzran, A. et al. | 2007
- 57
-
Focussed Crawling with Scalable Ordinal Regression SolversBabaria, R. / Jagarlapudi, S. N. / Kumar, K. S. / Kaveri, S. R. / Bhattacharyya, C. / Murty, M. N. et al. | 2007
- 65
-
Learning Distance Function by Coding SimilarityHillel, A. B. / Weinshall, D. et al. | 2007
- 73
-
Structural Alignment based Kernels for Protein Structure ClassificationBhattacharya, S. / Bhattacharyya, C. / Chandra, N. R. et al. | 2007
- 81
-
Discriminative Learning for Differing Training and Test DistributionsBickel, S. / Bruckner, M. / Scheffer, T. et al. | 2007
- 89
-
Solving MultiClass Support Vector Machines with LaRankBordes, A. / Bottou, L. / Gallinari, P. / Weston, J. et al. | 2007
- 97
-
Efficiently Computing Minimax Expected-Size Confidence RegionsBryan, B. / McMahan, H. B. / Schafer, C. M. / Schneider, J. et al. | 2007
- 105
-
Multiple Instance Learning for Sparse Positive BagsBunescu, R. C. / Mooney, R. J. et al. | 2007
- 113
-
Cluster Analysis of Heterogeneous Rank DataBusse, L. M. / Orbanz, P. / Buhmann, J. M. et al. | 2007
- 121
-
Feature Selection in Kernel SpaceCao, B. / Shen, D. / Sun, J.-T. / Yang, Q. / Chen, Z. et al. | 2007
- 129
-
Learning to Rank: From Pairwise Approach to Listwise ApproachCao, Z. / Qin, T. / Liu, T.-Y. / Tsai, M.-F. / Li, H. et al. | 2007
- 137
-
Local Similarity Discriminant AnalysisCazzanti, L. / Gupta, M. et al. | 2007
- 145
-
Direct Convex Relaxations of Sparse SVMChan, A. B. / Vasconcelos, N. / Lanckriet, G. R. G. et al. | 2007
- 153
-
Minimum Reference Set Based Feature Selection for Small Sample ClassificationsChen, X.-w. / Jeong, J. C. et al. | 2007
- 161
-
Learning to Compress Images and VideoCheng, L. / Vishwanathan, S. V. N. et al. | 2007
- 169
-
Magnitude-Preserving Ranking AlgorithmsCortes, C. / Mohri, M. / Rastogi, A. et al. | 2007
- 177
-
Full Regularization Path for Sparse Principal Component Analysisd Aspremont, A. / Bach, F. R. / El Ghaoui, L. et al. | 2007
- 185
-
Kernel Selection for Semi-Supervised Kernel MachinesDai, G. / Yeung, D.-Y. et al. | 2007
- 193
-
Boosting for Transfer LearningDai, W. / Yang, Q. / Xue, G.-R. / Yu, Y. et al. | 2007
- 201
-
Intractability and Clustering with ConstraintsDavidson, I. / Ravi, S. S. et al. | 2007
- 209
-
Information-Theoretic Metric LearningDavis, J. V. / Kulis, B. / Jain, P. / Sra, S. / Dhillon, I. S. et al. | 2007
- 217
-
An Integrated Approach to Feature Invention and Model Construction for Drug Activity PredictionDavis, J. / Costa, V. S. / Ray, S. / Page, D. et al. | 2007
- 225
-
Percentile Optimization in Uncertain MDP with Application to Efficient ExplorationDelage, E. / Mannor, S. et al. | 2007
- 233
-
Unsupervised Prediction of Citation InfluencesDietz, L. / Bickel, S. / Scheffer, T. et al. | 2007
- 241
-
Non-Isometric Manifold Learning: Analysis and an AlgorithmDollar, P. / Rabaud, V. / Belongie, S. et al. | 2007
- 249
-
Hierarchical Maximum Entropy Density EstimationDudik, M. / Blei, D. M. / Schapire, R. E. et al. | 2007
- 257
-
CarpeDiem: an Algorithm for the Fast Evaluation of SSL ClassifiersEsposito, R. / Radicioni, D. P. et al. | 2007
- 265
-
Manifold-adaptive dimension estimationFarahmand, A. m. / Szepesvari, C. / Audibert, J.-Y. et al. | 2007
- 273
-
Combining Online and Offline Knowledge in UCTGelly, S. / Silver, D. et al. | 2007
- 281
-
Robust Non-linear Dimensionality Reduction using Successive 1-Dimensional Laplacian EigenmapsGerber, S. / Tasdizen, T. / Whitaker, R. et al. | 2007
- 289
-
Gradient Boosting for Kernelized Output SpacesGeurts, P. / Wehenkel, L. / d Alche-Buc, F. et al. | 2007
- 297
-
Bayesian Actor-Critic AlgorithmsGhavamzadeh, M. / Engel, Y. et al. | 2007
- 305
-
Exponentiated Gradient Algorithms for Log-Linear Structured PredictionGloberson, A. / Koo, T. / Carreras, X. / Collins, M. et al. | 2007
- 313
-
Best of Both: A Hybridized Centroid-Medoid Clustering HeuristicGrira, N. / Houle, M. E. et al. | 2007
- 321
-
Recovering Temporally Rewiring Networks: A model-based approachGuo, F. / Hanneke, S. / Fu, W. / Xing, E. P. et al. | 2007
- 329
-
Efficient Inference with Cardinality-based Clique PotentialsGupta, R. / Diwan, A. A. / Sarawagi, S. et al. | 2007
- 337
-
Sparse Probabilistic ClassifiersHerault, R. / Grandvalet, Y. et al. | 2007
- 345
-
Supervised Clustering of Streaming Data for Email Batch DetectionHaider, P. / Brefeld, U. / Scheffer, T. et al. | 2007
- 353
-
A Bound on the Label Complexity of Agnostic Active LearningHanneke, S. et al. | 2007
- 361
-
Learning Nonparametric Kernel Matrices from Pairwise ConstraintsHoi, S. C. H. / Jin, R. / Lyu, M. R. et al. | 2007
- 369
-
Parameter Learning for Relational Bayesian NetworksJaeger, M. et al. | 2007
- 377
-
Bayesian Compressive Sensing and Projection OptimizationJi, S. / Carin, L. et al. | 2007
- 385
-
Constructing Basis Functions from Directed Graphs for Value Function ApproximationJohns, J. / Mahadevan, S. et al. | 2007
- 393
-
Most Likely Heteroscedastic Gaussian Process RegressionKersting, K. / Plagemann, C. / Pfaff, P. / Burgard, W. et al. | 2007
- 401
-
Neighbor Search with Global Geometry: A Minimax Message Passing AlgorithmKim, K.-H. / Choi, S. et al. | 2007
- 409
-
A Recursive Method for Discriminative Mixture LearningKim, M. / Pavlovic, V. et al. | 2007
- 417
-
Infinite Mixtures of TreesKirshner, S. / Smyth, P. et al. | 2007
- 425
-
Local Dependent ComponentsKlami, A. / Kaski, S. et al. | 2007
- 433
-
Statistical Predicate InventionKok, S. / Domingos, P. et al. | 2007
- 441
-
Kernelizing PLS, Degrees of Freedom, and Efficient Model SelectionKramer, N. / Braun, M. L. et al. | 2007
- 449
-
Nonmyopic Active Learning of Gaussian Processes: An Exploration--Exploitation ApproachKrause, A. / Guestrin, C. et al. | 2007
- 457
-
On One Method of Non-Diagonal Regularization in Sparse Bayesian LearningKropotov, D. / Vetrov, D. et al. | 2007
- 465
-
Online Kernel PCA with Entropic Matrix UpdatesKuzmin, D. / Warmuth, M. K. et al. | 2007
- 473
-
An Empirical Evaluation of Deep Architectures on Problems with Many Factors of VariationLarochelle, H. / Erhan, D. / Courville, A. / Bergstra, J. / Bengio, Y. et al. | 2007
- 481
-
The Hierarchical Gaussian Process Latent Variable ModelLawrence, N. D. / Moore, A. J. et al. | 2007
- 489
-
Learning a Meta-Level Prior for Feature Relevance from Multiple Related TasksLee, S.-I. / Chatalbashev, V. / Vickrey, D. / Koller, D. et al. | 2007
- 497
-
Scalable Modeling of Real Graphs using Kronecker MultiplicationLeskovec, J. / Faloutsos, C. et al. | 2007
- 505
-
Support Cluster MachineLi, B. / Chi, M. / Fan, J. / Xue, X. et al. | 2007
- 513
-
A Transductive Framework of Distance Metric Learning by Spectral Dimensionality ReductionLi, F. / Yang, J. / Wang, J. et al. | 2007
- 521
-
Adaptive Dimension Reduction Using Discriminant Analysis and $K$-means ClusteringLi, T. / Ding, C. et al. | 2007
- 529
-
Large-scale RLSC Learning Without AgonyLi, W. / Lee, K.-H. / Leung, K.-S. et al. | 2007
- 537
-
A Novel Orthogonal NMF-Based Belief Compression for POMDPsLi, X. / Cheung, W. K. W. / Liu, J. / Wu, Z. et al. | 2007
- 545
-
A Permutation-augmented Sampler for DP Mixture ModelsLiang, P. / Jordan, M. I. / Taskar, B. et al. | 2007
- 553
-
Quadratically Gated Mixture of Experts for Incomplete Data ClassificationLiao, X. / Li, H. / Carin, L. et al. | 2007
- 561
-
Trust Region Newton Methods for Large-Scale Logistic RegressionLin, C.-J. / Weng, R. C.-H. / Keerthi, S. et al. | 2007
- 569
-
Relational Clustering by Symmetric Convex CodingLong, B. / Zhang, Z. / Wu, X. / Yu, P. S. et al. | 2007
- 577
-
Discriminant Analysis in Correlation Similarity Measure SpaceMa, Y. / Lao, S. / Takikawa, E. / Kawade, M. et al. | 2007
- 585
-
Adaptive Mesh Compression in 3D Computer Graphics using Multiscale Manifold LearningMahadevan, S. et al. | 2007
- 593
-
Simple, Robust, Scalable Semi-supervised Learning via Expectation RegularizationMann, G. S. / McCallum, A. et al. | 2007
- 601
-
Automatic Shaping and Decomposition of Reward FunctionsMarthi, B. et al. | 2007
- 609
-
Asymmetric BoostingMasnadi-Shirazi, H. / Vasconcelos, N. et al. | 2007
- 617
-
Linear and Nonlinear Generative Probabilistic Class Models for Shape ContoursMcNeill, G. / Vijayakumar, S. et al. | 2007
- 625
-
Bottom-Up Learning of Markov Logic Network StructureMihalkova, L. / Mooney, R. J. et al. | 2007
- 633
-
Mixtures of Hierarchical Topics with Pachinko AllocationMimno, D. / Li, W. / McCallum, A. et al. | 2007
- 641
-
Three New Graphical Models for Statistical Language ModellingMnih, A. / Hinton, G. et al. | 2007
- 649
-
Fast and Effective Kernels for Relational Learning from TextsMoschitti, A. / Zanzotto, F. M. et al. | 2007
- 657
-
Dimensionality Reduction and GeneralizationMosci, S. / Rosasco, L. / Verri, A. et al. | 2007
- 665
-
Unsupervised Estimation for Noisy-Channel ModelsMylonakis, M. / Sima an, K. / Hwa, R. et al. | 2007
- 673
-
Revisiting Probabilistic Models for Clustering with ConstraintsNelson, B. / Cohen, I. et al. | 2007
- 681
-
Comparisons of Sequence Labeling Algorithms and ExtensionsNguyen, N. / Guo, Y. et al. | 2007
- 689
-
Multi-Task Learning for Sequential Data via iHMMs and the Nested Dirichlet ProcessNi, K. / Carin, L. / Dunson, D. et al. | 2007
- 697
-
Regression on Manifolds using Kernel Dimension ReductionNilsson, J. / Sha, F. / Jordan, M. I. et al. | 2007
- 705
-
Learning State-Action Basis Functions for Hierarchical MDPsOsentoski, S. / Mahadevan, S. et al. | 2007
- 713
-
A Fast Linear Separability Test by Projection of Positive Points on SubspacesYogananda, A. P. / Murty, M. N. / Gopal, L. et al. | 2007
- 721
-
Multi-armed Bandit Problems with Dependent ArmsPandey, S. / Chakrabarti, D. / Agarwal, D. et al. | 2007
- 729
-
Learning for Efficient Retrieval of Structured Data with Noisy QueriesParker, C. / Fern, A. / Tadepalli, P. et al. | 2007
- 737
-
Analyzing Feature Generation for Value-Function ApproximationParr, R. / Painter-Wakefield, C. / Li, L. / Littman, M. et al. | 2007
- 745
-
Reinforcement Learning by Reward-weighted Regression for Operational Space ControlPeters, J. / Schaal, S. et al. | 2007
- 751
-
Tracking Value Function Dynamics to Improve Reinforcement Learning with Piecewise Linear Function ApproximationPhua, C. W. / Fitch, R. et al. | 2007
- 759
-
Self-taught Learning: Transfer Learning from Unlabeled DataRaina, R. / Battle, A. / Lee, H. / Packer, B. / Ng, A. Y. et al. | 2007
- 767
-
Online Discovery of Similarity MappingsRakhlin, A. / Abernethy, J. / Bartlett, P. L. et al. | 2007
- 775
-
More Efficiency in Multiple Kernel LearningRakotomamonjy, A. / Bach, F. R. / Canu, S. / Grandvalet, Y. et al. | 2007
- 783
-
Graph Clustering With Network Structure IndicesRattigan, M. J. / Maier, M. / Jensen, D. et al. | 2007
- 791
-
Restricted Boltzmann Machines for Collaborative FilteringSalakhutdinov, R. / Mnih, A. / Hinton, G. et al. | 2007
- 799
-
Sample Compression Bounds for Decision TreesShah, M. et al. | 2007
- 807
-
Pegasos: Primal Estimated sub-GrAdient SOlver for SVMShalev-Shwartz, S. / Singer, Y. / Srebro, N. et al. | 2007
- 815
-
A Dependence Maximization View of ClusteringSong, L. / Smola, A. / Gretton, A. / Borgwardt, K. M. et al. | 2007
- 823
-
Supervised Feature Selection via Dependence EstimationSong, L. / Smola, A. / Gretton, A. / Borgwardt, K. M. / Bedo, J. et al. | 2007
- 831
-
Sparse Eigen Methods by DC ProgrammingSriperumbudur, B. / Torres, D. / Lanckriet, G. et al. | 2007
- 839
-
Learning to Solve Game TreesStern, D. / Herbrich, R. / Graepel, T. et al. | 2007
- 847
-
Robust Mixtures in the Presence of Measurement ErrorsSun, J. / Kaban, A. / Raychaudhury, S. et al. | 2007
- 855
-
A Kernel-based Causal Learning AlgorithmSun, X. / Janzing, D. / Scholkopf, B. / Fukumizu, K. et al. | 2007
- 863
-
Piecewise Pseudolikelihood for Efficient Training of Conditional Random FieldsSutton, C. / McCallum, A. et al. | 2007
- 871
-
On the Role of Tracking in Stationary EnvironmentsSutton, R. S. / Koop, A. / Silver, D. et al. | 2007
- 879
-
Cross-Domain Transfer for Reinforcement LearningTaylor, M. E. / Stone, P. et al. | 2007
- 887
-
Incremental Bayesian Networks for Structure PredictionTitov, I. / Henderson, J. et al. | 2007
- 895
-
Classifying Matrices with a Spectral RegularizationTomioka, R. / Aihara, K. et al. | 2007
- 903
-
Approximate Maximum Margin Algorithms with Rules Controlled by the Number of MistakesTsampouka, P. / Shawe-Taylor, J. et al. | 2007
- 911
-
Simpler Core Vector Machines with Enclosing BallsTsang, I. W. / Kocsor, A. / Kwok, J. T. et al. | 2007
- 919
-
Entire Regularization Paths for Graph DataTsuda, K. et al. | 2007
- 927
-
Discriminative Gaussian Process Latent Variable Models for ClassificationUrtasun, R. / Darrell, T. et al. | 2007
- 935
-
Experimental Perspectives on Learning from Imbalanced DataVan Hulse, J. D. / Khoshgoftaar, T. M. / Napolitano, A. et al. | 2007
- 943
-
Learning from Interpretations: A Rooted Kernel for Ordered HypergraphsWachman, G. / Khardon, R. et al. | 2007
- 951
-
A Kernel Path Algorithm for Support Vector MachinesWang, G. / Yeung, D.-Y. / Lochovsky, F. et al. | 2007
- 959
-
Dirichlet Aggregation: Unsupervised Learning towards an Optimal Metric for Proportional DataWang, H.-Y. / Zha, H. / Qin, H. et al. | 2007
- 967
-
Transductive Regression Piloted by Inter-Manifold RelationsWang, H. / Yan, S. / Huang, T. / Liu, J. / Tang, X. et al. | 2007
- 975
-
Multifactor Gaussian Process Models for Style-Content SeparationWang, J. M. / Fleet, D. J. / Hertzmann, A. et al. | 2007
- 983
-
Hybrid Huberized Support Vector Machines for Microarray ClassificationWang, L. / Zhu, J. / Zou, H. et al. | 2007
- 991
-
On Learning with Dissimilarity FunctionsWang, L. / Yang, C. / Feng, J. et al. | 2007
- 999
-
Winnowing SubspacesWarmuth, M. K. et al. | 2007
- 1007
-
What Is Decreased by the Max-sum Arc Consistency Algorithm?Werner, T. et al. | 2007
- 1015
-
Multi-Task Reinforcement Learning: A Hierarchical Bayesian ApproachWilson, A. / Fern, A. / Tadepalli, P. / Ray, S. et al. | 2007
- 1023
-
Beamforming using the Relevance Vector MachineWipf, D. / Nagarajan, S. et al. | 2007
- 1031
-
Learning to Combine Distances for Complex RepresentationsWoznica, A. / Kalousis, A. / Hilario, M. et al. | 2007
- 1039
-
Local Learning ProjectionsWu, M. / Yu, K. / Yu, S. / Scholkopf, B. et al. | 2007
- 1047
-
On Learning Linear Ranking Functions for Beam SearchXu, Y. / Fern, A. et al. | 2007
- 1055
-
Modeling Changing Dependency Structure in Multivariate Time SeriesXuan, X. / Murphy, K. et al. | 2007
- 1063
-
The Matrix Stick-Breaking Process for Flexible Multi-Task LearningXue, Y. / Dunson, D. / Carin, L. et al. | 2007
- 1071
-
Map Building without Localization by Dimensionality Reduction TechniquesYairi, T. et al. | 2007
- 1079
-
Asymptotic Bayesian Generalization Error When Training and Test Distributions Are DifferentYamazaki, K. / Kawanabe, M. / Watanabe, S. / Sugiyama, M. / Muller, K.-R. et al. | 2007
- 1087
-
Least Squares Linear Discriminant AnalysisYe, J. et al. | 2007
- 1095
-
Discriminant Kernel and Regularization Parameter Learning via Semidefinite ProgrammingYe, J. / Chen, J. / Ji, S. et al. | 2007
- 1103
-
Robust Multi-Task Learning with t-ProcessesYu, S. / Tresp, V. / Yu, K. et al. | 2007
- 1111
-
On the Value of Pairwise Constraints in Classification and ConsistencyZhang, J. / Yan, R. et al. | 2007
- 1119
-
Maximum Margin Clustering Made PracticalZhang, K. / Tsang, I. W. / Kwok, J. T. et al. | 2007
- 1127
-
Nonlinear Independent Component Analysis with Minimal Nonlinear DistortionZhang, K. / Chan, L. et al. | 2007
- 1135
-
Optimal Dimensionality of Metric Space for ClassificationZhang, W. / Xue, X. / Sun, Z. / Guo, Y.-F. / Lu, H. et al. | 2007
- 1143
-
Conditional Random Fields for Multi-agent Reinforcement LearningZhang, X. / Aberdeen, D. / Vishwanathan, S. V. N. et al. | 2007
- 1151
-
Spectral Feature Selection for Supervised and Unsupervised LearningZhao, Z. / Liu, H. et al. | 2007
- 1159
-
Spectral Clustering with Multiple ViewsZhou, D. / Burges, C. J. C. et al. | 2007
- 1167
-
On the Relation Between Multi-Instance Learning and Semi-Supervised LearningZhou, Z.-H. / Xu, J.-M. et al. | 2007
- 1175
-
Dynamic Hierarchical Markov Random Fields and their Application to Web Data ExtractionZhu, J. / Nie, Z. / Zhang, B. / Wen, J.-R. et al. | 2007
- 1183
-
Transductive Support Vector Machines for Structured VariablesZien, A. / Brefeld, U. / Scheffer, T. et al. | 2007
- 1191
-
Multiclass Multiple Kernel LearningZien, A. / Ong, C. S. et al. | 2007