Online Recommendation Systems: What it Really Means to be 'Somebody Like You' - Part 1

PokitBlog is starting a new trend called Technical Tuesdays #techtuesdays (Throwback Thursdays are so yesterday). In honor of our amazing technical and data science teams along with the insightful content they have to share, we're kicking this one off from our own Denise Gosnell, PhD, engineer and data scientist extraordinaire, as recently seen quoted in USA Today. Here's Part 1 of What it Really Means to be 'Somebody Like You'. And, don't forget to join us again next Tuesday for Part 2 of this post on #techtuesdays.

Part 1

The other day, I hopped on a favorite online media streaming service and started looking for something to watch. As I clicked from movie to movie, I noticed something – the recommendations were getting smarter with each click. As a frequent online shopper, I've become accustomed to the product recommendation pane that typically appears at the bottom of a page, but this particular one intrigued me. As I browsed around the site, the recommendation section changed from “people who viewed this item also viewed...” to “people who viewed this product ultimately watched..." and the suggestions were right on.

My data-centric mind turning, I wondered, how are they doing this?

Being my nerdy self, I took to the drawing board to get a general idea of the logic behind this morphing algorithm. I wondered if they were using a graph database to make these suggestions. And if so, how would that work?

Let’s look at what is happening here from a graph perspective. [mental check: have you gone through our intro to graph theory? It’s a quick one - head over here and then come back.]

Using graph theory, this is what your interaction graph looks like when you start browsing movies on an online streaming service:

you_errors_corrected

The power of an online streaming recommendation system comes from the thousands of people who have previously looked at the same movies you are currently looking at. Consider two customers who looked at the same movies as you did, and let’s visualize their interaction graph:

alice_bob
In this example, your online browsing experience is directly influenced by the sequence of movies previously selected by people like you [in this case, our Alice and Bob]. The data trails from previous customers give us the golden list: the movies watched by other people who also clicked on the movies you are interested in. In real time, the whole interaction comes together like this:

output_389Hnn

As you click through your favorite movies, the power of graph databases enables dynamic and online processing of your data to create real time recommendations specifically for you.

While this is just an initial glimpse into the wide world of recommendation systems, it is natural to begin to think of ways to improve the system. For example, if we throw a few thousand more people like you into the mix, the more data we have and thus, the more personalized the recommendations become. We can choose to include varying windows of historical information into the system; the possibilities and techniques are endless.

Why would we, PokitDok, be educating you about graph systems set against a movie-watching backdrop? To find out, consider this a scene setter and stay tuned for a health inspired part two where we will demonstrate some our latest breakthroughs in Graph Theory technology along with what it could mean for the future of healthcare.

About Denise Gosnell, PhD

Dr. Gosnell, a driving member of the PokitDok Data Science team since 2014, has brought her research in applied graph theory to help architect the graph database while also serving as an analytics thought leader. Her work with the Data Science team aims to extract insight from the trenches of hidden data in healthcare and build products to bring the industry into the 21st century. She also helps organize the local chapter of Charleston Data Analytics, a Meetup PokitDok now sponsors, and has represented PokitDok's Data Science Team at numerous conferences including, PyData, KDD (Knowledge Discovery & Data Mining) and the inaugural GraphDay.

Prior to PokitPok, she earned her Ph.D. in Computer Science from the University of Tennessee - where she founded a branch of Sheryl Sandberg's Lean In Circle. The goal of this impressive organization is to guide women interested in computer science careers, as TechCrunch noted, and Denise has done that and more.

View All Posts

Leave a Reply

Your email address will not be published.