Bundled Product Recommendations

April 23, 2024

Amit Pande

Principal Data Scientist

Samaneh Karimi

Lead Data Scientist

David Relyea

Director of Data Science, Item Recommendations

Nowadays, our guests expect us to deliver personalized experiences for them. One of the ways to make life easier for our guests is to bundle multiple complimentary items and recommend products so they can add needed items to their carts with a single click. Bundled product recommendations enhance a guest’s shopping experience by allowing them to quickly view and hopefully purchase products that are typically bought together, like a television with a TV mount or a razor with its associated blades. On Target.com, bundled recommendations typically contain the title ‘Frequently Bought Together’ or ‘Compatible Items.’ 
 
Figure 1 illustrates a product bundle created for a car seat. When a user clicks on the car seat, a bundle with a baby car mirror, a car seat organizer, and a seat protector is displayed. 
 
Image titled "Compatible Items" with four products pictured side-by-side, (from left to right) child car seat, car mirror, seat protector, and seat organizer, with a "add all 4 to cart" call to action in a red button at the bottom of the image
Figure 1: An example product bundle created for a car seat. 
 
For the remainder of this post, we’ll refer to the primary product viewed by the guest as the “driver product” — the product around which the bundle was built — and all the recommended products in the bundle as the “go-together products”.  
 
Bundling Algorithm Evolution 
 
Over the past few years, the Item Personalization team has created several bundling algorithms. Our simplest algorithm uses a technique called Associate Rule Mining (ARM) on product attributes. This algorithm provides recommendations but suffers from many shortcomings; it’s not good at determining which products are driver products (Figure 2), it will occasionally recommend bundled products which are either overly similar to one another (Figure 3) or to the driver product, and it doesn’t provide recommendations for a large portion of our catalog. 
 
Four products pictured side-by-side in a bundle, from left to right, two different kinds of baby strollers highlighted with red dotted lines, a car seat carrier, and a stroller cover
Figure 2: The driver product is not identified in this example bundle. 
Four products pictured side-by-side in a bundle, from left to right, a small printer, two different types of instant film outlined in dotted red lines, and a third type of instant film
Figure 3: Similar products are recommended in this example bundle.
 
Bundling using Product Graphs and Graph Neural Networks (GNN)  
 
To address these shortcomings, the team turned to graph-based methods. Figure 4 shows each step for generating bundled recommendations from these methods. In the rest of this section, these steps are explained in detail. 
 
Diagram showing the GNN-based bundling model, with three stacked rectangles on the left that read "Customer interaction data," "Product price," and "Item attributes," which all point to a flow that shows four steps marked "graph creation," "adding similarity relations to the graph," "Selecting edges for interference," and "predicting bundles" that flow to a final box that says "recommended bundles"
Figure 4: Overall architecture of the Graph Neural Network-based bundling method. 
Step 1: Graph Creation  
 
We start by representing the product catalog as a collection of separate product graphs for each product category (e.g., “Kitchen”). Each graph is created with nodes representing products in the category and edges representing relationships between the products. Each edge between two products considers both the count and order of joint purchases and/or adds-to-cart of the product pair, so the graph edges are both weighted and directed. These directed edges let the algorithm disambiguate driver and go-together products. 
 
Figure 5 illustrates an example of graph creation. The guest has added four products to their cart: bodysuits, baby socks, a mixing bowl, and a whisk, respectively in that order. Based on this cart, we form an edge in the Clothing category graph (if one didn’t already exist) from bodysuits to baby socks, {(bodysuits, socks)} ∈ Clothing, and we then increase its weight by one. We do the same in the Kitchen category graph from the mixing bowl to the whisk, {(mixing bowl, whisk)} ∈ Kitchen. 
 
Screenshot showing an example of an online cart created by a Target guest. Items in the card include a whisk, mixing bowl set, baby socks, and a baby bodysuit
Figure 5: Example of products added to cart in a session by a guest. 
 
After the graph is populated with all guest joint purchases and adds-to-cart (over a specific time period), its edge’s weights are normalized and the graph is reduced by removing edges whose weights fall below a threshold. In Figure 6, the product graph for the recommendations of Figure 1 is shown. All products in Figure 1 are represented as nodes in this graph, and the edges from the car seat to the other three products show that we can recommend the other three products for the car seat.  
 
Product bundle showing the image of a car seat on the left with arrows and weighted numbers pointing to three separate products, a seat organizer (weighted 0.25), a car mirror (weighted 0.50), and a seat protector (weighted 0.25)
Figure 6: The bundle in Figure 1 represented as a weighted and directed graph. 
 
We initially used this graph as our bundle recommendation algorithm, pulling product pairs from the graph based on the strength of the edges and then applying price and diversity filters to the product pairs. This Complimentary Items (CI) graph approach produced better results than the ARM algorithm, but it still has massive gaps because we have a large catalog, so the odds that any two products have been frequently purchased or added to the cart together are low. 
 
Step 2: Leveraging Graph Neural Networks and Product Similarity for Link Prediction  
 
To generate more product relationships in the graph, we leverage two different techniques. First, we pad out the graph by leveraging product linkages predicted by our Similar Items algorithm (currently used to recommend substitute products for guests). Intuitively, we do this because similar driver products should have similar products in their bundle recommendations. Figure 7 provides an example of this intuition. The bottom car seat has three accessories as part of its bundle recommendation. The top car seat is similar to the bottom car seat, so it could potentially also have these three accessories in its bundle recommendation. Hence, the list of possible recommended products for the top product’s bundle will include these accessories. A complete description of the Similar Items algorithm would not fit here; in short, it creates a distinct graph for each category where the edges represent how similar products are to one another. For each category, we combine the Bundle graph and the Item Similarity graph by taking their sum to form a padded graph. 
 
Diagram showing five different products (two different car seats, a seat organizer, a car mirror, and a seat protector) using arrows to show the relations between the products based on similarity and complimentary relations. The two car seats have two green arrows pointing to one another, while dotted blue lines go from the first car seat to the other products showing a predicted complimentary reaction
Figure 7: Selecting graph edges for inference based on similarity and complimentary relations. 
 
Second, we train a GNN using the SEAL [1] framework. The SEAL framework is a GNN-based approach that takes node attributes and the subgraph information as input and predicts the probability of the link existence. To make training data, we use the existing edges in the padded graph as “positive” samples and an equal number of randomly chosen made-up edges as “negative” samples. We then use this data to predict if two nodes in the padded graph are connected at two edges (where at least one edge was in the original bundle graph). The GNN learns to predict which product relationships seem plausible, even if guests have never purchased those specific products together and the items’ embeddings did not immediately suggest similarity. After this step, we end up with a larger graph with both real and predicted product relationships.  
 
Finally, we use connections in the resulting graph to generate a final list of potential products for each driver product’s bundle. Depending on where the algorithm is being used, the top K predicted products (K=4 for the Frequently Bought Together headline on a product’s page) are shown to the guest.  
 
RESULTS 
 
We deployed the model to production and A/B tested it over live traffic. We observed over 35% improvement over the existing co-purchase-based model in terms of click-based revenue and over 50% improvement in coverage of products with recommendations. We have skipped the technical details here for brevity, but they are available in our paper [2] for those who may wish to explore this in more detail.  
 
 
 
ACKNOWLEDGMENTS  
The authors would like to thank Saloni Agarwal and Aparupa Das Gupta for their work on the bundling algorithm [2] at Target. 
 
 
References 
[1] Muhan Zhang and Yixin Chen. 2018. Link prediction based on graph neural networks. Advances in Neural Information Processing Systems 31 (2018), 5165– 5175.  
[2] Agarwal, S., Das Gupta, A., & Pande, A. (2022, April). Improving Bundles Recommendation Coverage in Sparse Product Graphs. In Companion Proceedings of the Web Conference 2022 (pp. 1037-1045). 

RELATED POSTS

Real-Time Personalization Using Microservices

By Amit Pande, Prathyusha Kanmanth Reddy, and Pushkar Chennu, May 11, 2023
How Target's Personalization team uses microservices to improve our guest experience

Elevating Guest Repurchasing Behavior Using Buy It Again Recommendations

By Amit Pande and Rankyung Park, November 2, 2023
Target’s Data Science team shares an inside look at our Buy It Again model.