VOOZH

URL: https://github.com/topics/bandit-algorithms

⇱ bandit-algorithms · GitHub Topics · GitHub

#

bandit-algorithms

Here are 102 public repositories matching this topic...

👁 SMPyBandits

SMPyBandits / SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

python open-source research internet-of-things simulations multi-arm-bandits multi-armed-bandit learning-theory bandit-algorithms cognitive-radio

Updated
Jupyter Notebook

c-bata / goptuna

A hyperparameter optimization framework, inspired by Optuna.

bayesian-optimization evolution-strategies blackbox-optimization bandit-algorithms

Updated
Go

WilliamLwj / PyXAB

PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms

data-science machine-learning algorithm reinforcement-learning optimization machine-learning-algorithms hyperparameter-optimization hyperparameter-tuning optimization-algorithms online-learning automl blackbox-optimization bandit-algorithms lipschitz-bandit x-armed-bandit continuous-armed-bandit

Updated
Python

KKeishiro / Yahoo_recommendation

Yahoo! news article recommendation system by linUCB

recommendation-system contextual-bandit bandit-algorithms linucb

Updated
Python

gdmarmerola / interactive-intro-rl

Big Data's open seminars: An Interactive Introduction to Reinforcement Learning

machine-learning reinforcement-learning bandit-algorithms

Updated
Jupyter Notebook

sshkhr / Practical_RL

My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow

reinforcement-learning tensorflow deep-reinforcement-learning pytorch policy-gradient evolutionary-algorithms markov-decision-processes td-learning monte-carlo-sampling bandit-algorithms

Updated
Jupyter Notebook

Alanthink / banditpylib

A lightweight python library for bandit algorithms

bandit-algorithms

Updated
Python

niffler92 / Bandit

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated
Python

kulinshah98 / Multi-Armed-Bandit-Algorithms

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

epsilon-greedy multi-armed-bandits upper-confidence-bounds bandit-algorithms stochastic-bandit-algorithms adversarial-bandit-algorithms exp3-algorithm

Updated
Python

doerlbh / MiniVox

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

paper speaker-recognition online-learning speaker-diarization contextual-bandits bandit-algorithms interspeech self-supervised-learning acml interspeech2020 online-speaker-diarization

Updated
Cuda

gdmarmerola / advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

machine-learning multi-armed-bandit bandit-algorithms

Updated
Jupyter Notebook

mmalekzadeh / privacy-preserving-bandits

Privacy-Preserving Bandits (MLSys'20)

machine-learning reinforcement-learning recommender-system recommendation bandit-learning differential-privacy contextual-bandits bandit-algorithm federated-learning bandit-algorithms privacy-preserving-machine-learning online-machine-learning criteo-dataset differentially-private privacy-preserving-bandits

Updated
Jupyter Notebook

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

Updated

sparsh-ai / reco-bandit

Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning

recommender-system contextual-bandits bandit-algorithms

Updated
Jupyter Notebook

singhsidhukuldeep / contextual-bandits

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

python machine-learning reinforcement-learning algorithms epsilon-greedy multi-armed-bandit contextual-bandits bandit-algorithms linucb

Updated
Python

babaniyi / Deep-contextual-bandits

Deep contextual bandits in PyTorch: Neural Bandits, Neural Linear, and Linear Full Posterior Sampling with comprehensive benchmarking on synthetic and real datasets

bandits bandit-algorithms multiarmed-bandits

Updated
Python

rssalessio / reading-list

This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.

learning machine-learning statistics reinforcement-learning deep-learning optimization reading-list bandit-algorithms

Updated

gokceuludogan / interactive-music-recommendation

Personalized and Interactive Music Recommendation with Bandit approach

music-recommendation bandit-algorithms exploration-exploitation bayes-ucb

Updated
Jupyter Notebook

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated
Python

ngutowski / algossim

This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

recommendation-system artificial-intelligence-algorithms contextual-bandits bandit-algorithms

Updated
Python

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."

You can’t perform that action at this time.