Synerise Monad: A Foundation Model for Behavioral Event Data

Barbara Rychalska, Szymon Łukasik, Jacek Dąbrowski

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The complexity of industry-grade event-based datalakes grows dynamically each passing hour. Companies actively gather behavioral information on their customers, recording multiple types of events, such as clicks, likes, page views, card transactions, add-to-basket, or purchase events. In response to this, the Synerise Monad platform has been proposed. The primary focus of Monad is to produce Universal Behavioral Representations (UBRs) - large vectors encapsulating the behavioral patterns of each user. UBRs do not lose knowledge about individual events, in contrast to aggregated features or averaged embeddings. They are based on award-winning algorithms developed at Synerise - Cleora and EMDE - and allow to process real-life datasets composed of billions of events in record time. In this paper, we introduce a new aspect of Monad: private foundation models for behavioral data, trained on top of UBRs. The foundation models are trained in purely self-supervised manner and allow to exploit general knowledge about human behavior, which proves especially useful when multiple downstream models must be trained and time constraints are tight, or when labeled data is scarce. Experimental results show that the Monad foundation models can reduce training time by half and require 3x less data to reach optimal results, often achieving state-of-the-art results.

Original languageEnglish
Title of host publicationSIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages3344-3348
Number of pages5
ISBN (Electronic)9781450394086
DOIs
Publication statusPublished - 19 Jul 2023
Externally publishedYes
Event46th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2023 - Taipei, Taiwan, Province of China
Duration: 23 Jul 202327 Jul 2023

Publication series

NameSIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

Conference

Conference46th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2023
Country/TerritoryTaiwan, Province of China
CityTaipei
Period23/07/2327/07/23

Keywords

  • behavioral modeling
  • big data
  • foundation models
  • graph learning
  • representation learning

Fingerprint

Dive into the research topics of 'Synerise Monad: A Foundation Model for Behavioral Event Data'. Together they form a unique fingerprint.

Cite this