Parallel and synchronized UCB2 for online recommendation systems

Mahmuda Rahman, Jae C Oh

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

As users' preferences shift continuously, recommendation system has to learn quickly from them. It is an interesting online learning problem as recommender does not have any prior knowledge about the distribution of items over the users. In this work, we generate a small recommendation set from a large number of items, with an intention that at least one of recommended items would satisfy the user and thus minimize user abandonment. We used multiarmed bandit algorithm for this purpose and avail multiple instances of Upper Confidence Bound2 (UCB2). Although UCB2 is theoretically proved to have a better regret bound than UCB1, unlike UCB1, it has not been used for parallel execution. We designed an efficient algorithm which runs multiple instances of UCB2 in parallel. Our algorithm suitably handles parameter synchronization, reward update and exploration decisions across multiple instances of UCB2 and ensures that they are capable of covering different types of users. While applied to real data, our method shows comparable performance over a recommendation system that runs multiple instances of UCB1 in parallel. We compared our results with Ranked Bandit Algorithm and Independent Bandit Algorithm.

Original languageEnglish (US)
Title of host publicationProceedings - 2015 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages413-416
Number of pages4
Volume1
ISBN (Electronic)9781467396172
DOIs
StatePublished - Feb 2 2016
Event2015 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology Workshops, WI-IAT Workshops 2015 - Singapore, Singapore
Duration: Dec 6 2015Dec 9 2015

Other

Other2015 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology Workshops, WI-IAT Workshops 2015
CountrySingapore
CitySingapore
Period12/6/1512/9/15

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint Dive into the research topics of 'Parallel and synchronized UCB2 for online recommendation systems'. Together they form a unique fingerprint.

Cite this