Skip to content
You signed in with another tab or window. to refresh your session.
You signed out in another tab or window. to refresh your session.
You switched accounts on another tab or window. to refresh your session.
Here are
102 public repositories
matching this topic...
👁 SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
A hyperparameter optimization framework, inspired by Optuna.
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
Yahoo! news article recommendation system by linUCB
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
A lightweight python library for bandit algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
More about the exploration-exploitation tradeoff with harder bandits
Privacy-Preserving Bandits (MLSys'20)
A curated list on papers about combinatorial multi-armed bandit problems.
Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning
A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications
Deep contextual bandits in PyTorch: Neural Bandits, Neural Linear, and Linear Full Posterior Sampling with comprehensive benchmarking on synthetic and real datasets
This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.
Personalized and Interactive Music Recommendation with Bandit approach
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.
Improve this page
Add a description, image, and links to the
bandit-algorithms
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
bandit-algorithms
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.