Search and Recommendation

Published on Tuesday, October 13, 2015 and tagged with recsys.

Search and recommendation are very interrelated concepts.

@alansaid @xamat @mitultiwari I'm not really a #recsys guy – am more of a search guy. Though #recsys is just search with a null query. :-)
— Daniel Tunkelang (@dtunkelang) October 13, 2013

IR folks: how is the "zero query" scenario not just rec sys?
— I am K (@vanessa_murdock) October 13, 2015

What is search, but query-directed recommendation? Or recommendation, but zero-query search?

My students asked me about this relationship in my recommender systems class last spring. Trying to answer this question brought me to a formulation that seemed to help them, and perhaps it will be more broadly illuminating as well. For some of you this may well be old hat.

In information seeking, there are several things that may influence the suitability of an item. These include:

the item’s properties
the user’s preferences and interaction history
a query or other articulated information need provided by the user
the context at the point and time of recommendation

Further, most common recommendation and retrieval algorithms operate by first computing a score for each candidate item, and then ranking items by score. There are exceptions, of course, and many algorithms post-process either the score or the recommendation list (e.g. for diversification). But at the core of many algorithms is a scoring function:

\[ s(i;u,q,x): I \times U \times Q \times X \to \mathbb{R} \]

The purpose of this function is to score an item i for a user u who articulated a query q in context x. We could also envision a direct retrieval function \(r(u,q,x): U \times Q \times X \to I^n\).

Under this formulation, the difference between traditional recommendation, traditional search, and other retrieval settings is simply the variables taken into account when computing the scores. We can define several kinds of tasks:

If \(s\) depends only on \(u\) (and \(i\)), we get a traditional recommender, producing suggestions for the user.
If \(s\) depends only on \(q\) and \(i\), we have a traditional non-personalized search engine.
If \(s\) depends on \(x\) and \(u\), we have a context-aware recommender system.
If \(s\) depends on all its parameters, \(u\), \(q\), \(x\), and \(i\), we have a context-aware personalized search system, e.g. Google.

We discussed this formulation in a couple different class sessions, and it seemed to click with students. I’m thinking that I’ll work it in more deeply next time I teach the class, perhaps even as the overall framing: our goal, students, is to study this function and different ways of computing it, assessing its effectiveness, and building it in to a user experience.