Emilie Kaufmann
|
Books And Theses
Articles
Finding good policies in average-reward Markov Decision Processes without prior knowledge,
NeurIPS 2024, Vancouver (Canada), Canada
NeurIPS 2024, Vancouver (Canada), Canada
Optimistic PAC Reinforcement Learning: the Instance-Dependent View,
ALT 2023, Singapore (SG), Singapore
ALT 2023, Singapore (SG), Singapore
An ε-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond,
NeurIPS 2023, New Orleans, United States
NeurIPS 2023, New Orleans, United States
Adaptive Algorithms for Relaxed Pareto Set Identification,
NeurIPS 2023, La Nouvelle Orléans, LA, United States
NeurIPS 2023, La Nouvelle Orléans, LA, United States
Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs,
NeurIPS 2022, New Orleans, United States
NeurIPS 2022, New Orleans, United States
Optimal Thompson Sampling strategies for support-aware CVaR bandits,
ICML 2021, Virtual, United States
ICML 2021, Virtual, United States
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces,
AISTATS 2021, San Diego / Virtual, United States
AISTATS 2021, San Diego / Virtual, United States
Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited,
ALT 2021, Paris / Virtual, France
ALT 2021, Paris / Virtual, France
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity,
NeurIPS 2020, Vancouver, France
NeurIPS 2020, Vancouver, France
A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players,
AISTATS 2020, Palermo, Italy
AISTATS 2020, Palermo, Italy
Solving Bernoulli Rank-One Bandits with Unimodal Thompson Sampling,
ALT 2020, San Diego, United States
ALT 2020, San Diego, United States
Analyse non asymptotique d'un test séquentiel de détection de rupture et application aux bandits non stationnaires Non-asymptotic analysis of a sequential rupture detection test and its application to non-stationary bandits,
GRETSI 2019 - XXVIIème Colloque francophone de traitement du signal et des images 2019, Lille, France
GRETSI 2019 - XXVIIème Colloque francophone de traitement du signal et des images 2019, Lille, France
A simple dynamic bandit algorithm for hyper-parameter tuning,
Workshop on Automated Machine Learning at International Conference on Machine Learning 2019, Long Beach, United States
Workshop on Automated Machine Learning at International Conference on Machine Learning 2019, Long Beach, United States
General parallel optimization without a metric,
Algorithmic Learning Theory 2019, Chicago, United States
Algorithmic Learning Theory 2019, Chicago, United States
Adaptive black-box optimization got easier: HCT only needs local smoothness,
European Workshop on Reinforcement Learning 2018, Lille, France
European Workshop on Reinforcement Learning 2018, Lille, France
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling,
NeurIPS 2018, Montréal, Canada
NeurIPS 2018, Montréal, Canada
Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence,
ALT 2018, Lanzarote, Spain
ALT 2018, Lanzarote, Spain
Multi-Player Bandits Revisited Modèles de Bandits Multi-Joueurs Revisités,
Algorithmic Learning Theory 2018, Lanzarote, Spain
Algorithmic Learning Theory 2018, Lanzarote, Spain
Aggregation of Multi-Armed Bandits Learning Algorithms for Opportunistic Spectrum Access Agrégation d'algorithmes d'apprentissage pour les bandits multi-bras appliquée à l'accès opportuniste au spectre,
IEEE WCNC - IEEE Wireless Communications and Networking Conference 2018, Barcelona, Spain
IEEE WCNC - IEEE Wireless Communications and Networking Conference 2018, Barcelona, Spain
Multi-Armed Bandit Learning in IoT Networks: Learning helps even in non-stationary settings Apprentissage de Bandit Multi-Bras dans les réseaux Internet des Objets: l'apprentissage est utile même dans des cas non-stationnaires,
CROWNCOM 2017 - 12th EAI International Conference on Cognitive Radio Oriented Wireless Networks 2017, Lisbon, Portugal
CROWNCOM 2017 - 12th EAI International Conference on Cognitive Radio Oriented Wireless Networks 2017, Lisbon, Portugal
Learning the distribution with largest mean: two bandit frameworks,
ESAIM: Proceedings and Surveys 2017
ESAIM: Proceedings and Surveys 2017
Thompson Sampling for one-dimensial exponential family bandits,
NIPS 2013 - Neural Information Processing Systems Conference 2013, Lake Tahoe, United States
NIPS 2013 - Neural Information Processing Systems Conference 2013, Lake Tahoe, United States
Thompson sampling for one-dimensional exponential family bandits,
Advances in Neural Information Processing Systems 2013, United States
Advances in Neural Information Processing Systems 2013, United States
On Bayesian Upper Confidence Bounds for Bandit Problems,
AISTATS 2012, La Palma, Iles Canaries, Spain
AISTATS 2012, La Palma, Iles Canaries, Spain
Journal articles
Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits,
Journal of Machine Learning Research 2022
Journal of Machine Learning Research 2022
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals,
Journal of Machine Learning Research 2021
Journal of Machine Learning Research 2021
Machine learning applications in drug development,
Computational and Structural Biotechnology Journal 2020
Computational and Structural Biotechnology Journal 2020
A spectral algorithm with additive clustering for the recovery of overlapping communities in networks,
Theoretical Computer Science 2018
Theoretical Computer Science 2018
On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models,
Journal of Machine Learning Research 2016
Journal of Machine Learning Research 2016
Hdr
Editorship
Misc
Generated using
minifold
and minifold-lincs
at 2025-03-31 17:01.