A recommender system is only as effective as its understanding of user propensities. The SM-RS dataset links contextual impressions with self-reported preferences, enabling the development of personalized, multi-objective recommendations.
Recommender systems (RS) have long focused on delivering accurate results, aiming to align recommendations with user profiles. However, as user expectations evolve, beyond-accuracy metrics such as novelty, diversity, and fairness have gained prominence.
The SM-RS dataset, which we introduced at SIGIR ’24 with Patrik Dokoupil and Ladislav Peska, addresses this need by integrating single- and multi-objective recommendation strategies with contextual user impressions and self-reported preferences.
Traditional RS datasets lack nuanced information about user preferences for beyond-accuracy objectives like diversity or novelty. SM-RS fills this gap by:
- Introducing ground truth propensities. The dataset captures self-declared user preferences toward relevance, novelty, and diversity.
- Providing contextual impressions. SM-RD logs detailed user interactions with displayed recommendations, offering insights into decision-making processes.
- Enabling multi-objective recommendation. This dataset enables fine-grained optimization for both single- and multi-objective recommenders.
SM-RS Features
- User behavior and contextual impressions.
- Contains 55,000 displayed items and 11,000 user selections.
- Tracks the impact of impressions on user decision-making.
- Includes normalized marginal gains (NMG) for relevance, novelty, and diversity.
- Self-reported propensities.
- Gathers user feedback on the importance of beyond-accuracy metrics after each iteration.
- Enables within-session analysis of propensity changes.
- Additional components.
- MovieLens-derived metadata enriched with visuals and descriptions.
- JSON-based structure for user profiles, interactions, and propensity data.
- Pre-computed relevance, novelty, and diversity metrics.
Possible use cases
The dataset supports diverse applications, including:
- Unbiased evaluation of Multi-Objective Recommenders (MORS).
- Reduces bias inherent in single-objective training datasets.
- Evaluates the interplay between relevance, novelty, and diversity.
- User behavior analysis.
- Maps user actions to self-reported propensities, deepening understanding of decision patterns.
- Explores the influence of contextual impressions.
- Pareto optimization and decision-making.
- Evaluates algorithms selecting recommendations from the Pareto frontier.
- Assesses user satisfaction with proportionality in multi-objective trade-offs.
- Beyond-accuracy criteria exploration.
- Serves as a foundation to innovate on newer beyond-accuracy metrics.
Possible tasks
The dataset supports diverse tasks, including:
- Propensity prediction.
- Predict user preferences toward relevance, novelty, and diversity.
- Baseline methods include locally and globally normalized NMG computations.
- Proportional recommendation construction.
- Create recommendations matching user-defined propensities.
- Metrics include mean absolute error (MAE) and KL-divergence.
- Impression-aware re-ranking.
- Optimize single-objective recommendations by incorporating contextual impressions.
- Evaluated using nDCG@10 and precision@5.
Future directions
SM-RS sets a new benchmark for recommender systems by combining contextual impressions with multi-objective strategies. Nevertheless, it has some limitations:
- Small user base. 227 participants may restrict scalability for certain algorithms.
- Short-term context. Limited longitudinal insights due to brief study duration.
Future research can expand SM-RS by increasing participant diversity, extending study periods, and integrating additional domains beyond movies.