TY - GEN
T1 - The SYNERISE dataset
T2 - Workshop on the 19th ACM Conference on Recommender Systems, RecSysChallenge 2025
AU - Dabrowski, Jacek
AU - Janicka, Maria
AU - Sienkiewicz, Łukasz
AU - Stomfai, Gergely
AU - Dietmar, Jannach
AU - Barile, Francesco
AU - Polignano, Marco
AU - Pomo, Claudio
AU - Srivastava, Abhishek
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/9/21
Y1 - 2025/9/21
N2 - Research in the area of recommender systems heavily relies on offline experimentation with historical data. The validity of such research efforts may however be limited by the quality and representativeness of publicly available datasets. To address these limitations, we introduce the Synerise dataset as a new, large-scale e-commerce dataset derived from real-world logs. This dataset provides rich, time-stamped user-item interactions alongside detailed item metadata - including categories, descriptions, and prices - and incorporates user search and navigation behavior for a more holistic understanding of user intent. In the paper, we provide a description of the dataset and how it can be used for model evaluation in different research questions. Furthermore, we provide an overview of the ACM RecSys 2025 Challenge, which introduced the novel task of Universal Behavioral Modeling, and which was based on the Synerise dataset. The dataset can be downloaded at https://recsys.synerise.com.
AB - Research in the area of recommender systems heavily relies on offline experimentation with historical data. The validity of such research efforts may however be limited by the quality and representativeness of publicly available datasets. To address these limitations, we introduce the Synerise dataset as a new, large-scale e-commerce dataset derived from real-world logs. This dataset provides rich, time-stamped user-item interactions alongside detailed item metadata - including categories, descriptions, and prices - and incorporates user search and navigation behavior for a more holistic understanding of user intent. In the paper, we provide a description of the dataset and how it can be used for model evaluation in different research questions. Furthermore, we provide an overview of the ACM RecSys 2025 Challenge, which introduced the novel task of Universal Behavioral Modeling, and which was based on the Synerise dataset. The dataset can be downloaded at https://recsys.synerise.com.
KW - Dataset
KW - Deep Relational Learning
KW - Evaluation
KW - Recommender Systems
KW - Sequential Recommendation
UR - https://www.scopus.com/pages/publications/105020568843
U2 - 10.1145/3758126.3758188
DO - 10.1145/3758126.3758188
M3 - Conference contribution
AN - SCOPUS:105020568843
T3 - Proceedings of the Workshop on the ACM RecSys Challenge 2025
SP - 1
EP - 6
BT - Proceedings of the Workshop on the ACM RecSys Challenge 2025
PB - Association for Computing Machinery, Inc
Y2 - 22 September 2025 through 26 September 2025
ER -