{"id":3451,"date":"2018-11-01T22:12:20","date_gmt":"2018-11-01T16:42:20","guid":{"rendered":"http:\/\/www.intelligencenode.com\/blog\/?p=3451"},"modified":"2026-01-20T15:52:16","modified_gmt":"2026-01-20T10:22:16","slug":"how-to-classify-match-products-with-machine-learning","status":"publish","type":"post","link":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/","title":{"rendered":"How to Classify, Match Product With Machine Learning"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Billions of products are sold online, and there are numerous stores selling them. Identifying and matching products for various purposes like price comparison, becomes a challenge as there are no obvious global unique identifiers. This is where AI and Machine Learning come in.<\/span><\/p>\n<h2>So Many Products and No Way to Match Them Across Stores<\/h2>\n<p><span style=\"font-weight: 400;\">There are many situations where accurately identifying a product match is essential. Stores might want to compare competitor prices for the exact same products. Customers use price comparison tools to get the best deals. A store like Amazon that allows different sellers to offer the same products wants to be sure that they are the same products before listing the sellers in a single, unique product page.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">How do we approach this product matching problem? <\/span><\/p>\n<h2>The Sea of Confusion<\/h2>\n<p><span style=\"font-weight: 400;\">Product titles\/descriptions do not have a standardized format. Each store and different sellers within a store(say, eBay) might have a different title and description for the same product. Attribute listings also differ in format, images for the same product are also very different.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Of course, there are standardized unique identifiers like UPC, MPN, GTIN etc. However, these may not be mentioned in the product page in all stores selling them. The attributes themselves might be described differently &#8211; for instance 7\u201d and 7 inch.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Images may be included, but they can differ in perspective, clarity, tone etc. The Brand Name may also be referred to in different ways &#8211; GE and General Electric.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-3453\" src=\"\/blog\/wp-content\/uploads\/2018\/11\/Screen-Shot-2018-11-01-at-9.20.07-PM.png\" alt=\"product name\" width=\"794\" height=\"392\" \/><\/p>\n<p><span style=\"font-weight: 400;\">It is impossible for a human to visit different seller pages and check the product pages to ensure that they are matching the same products. If the process is to be automated, how does that product matching system make sense of it all? Well, the big guns have got it all figured out, and we\u2019re going to let you in on the secret of how it\u2019s done.<\/span><\/p>\n<h2>AI To The Rescue &#8211; Machine Learning for Product Matching<\/h2>\n<p><span style=\"font-weight: 400;\">In machine learning solutions for product matching first, the solution provider has to build a database of billions of products. This is done by collecting information through web crawls and feeds. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">The system then has to come up with a universal taxonomy. This is a challenge because different retailers use different classifications for their products, and the same product might be listed in more than one category. For instance, a particular shoe model might be listed under sports shoe and under men&#8217;s walking shoe. The Product Match system first has to design a standardized taxonomy, irrespective of how a particular store classifies its products.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">There are standard classification models like Google Taxonomy, GS1, and Amazon. However, <a href=\"\/solutions\/product-matching\/\">a product match solution<\/a> might devise its own taxonomy. This universal taxonomy is designed by identifying patterns and signals from titles, breadcrumbs where available, product descriptions and attributes, and from images.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Once the taxonomy is in place, then comes particular product matches. Here, there is a need for precise comparisons to ensure that it is the same unique product, despite differences in titles, images, descriptions etc. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">First, there is a search for UPC, GTIN or other unique identifiers on the product page. Then, the product title has to be compared. No two product titles are the same across different stores for the same product, for example:<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">Google Pixel 2 GSM\/CDMA Google Unlocked (Clearly White, 64GB, US warranty) &#8211; Amazon.com:\u00a0 [ <a href=\"https:\/\/www.intelligencenode.com\/blog\/product-matching-ai-for-ecommerce\/\">product matching<\/a> example\\<\/span><\/i><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-3454 size-full\" src=\"\/blog\/wp-content\/uploads\/2018\/11\/Screen-Shot-2018-11-01-at-9.27.00-PM.png\" alt=\"Google pixel 2 white - Amazon\" width=\"625\" height=\"363\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Google Pixel 2 64GB Clearly White (Unlocked) Smartphone &#8211; eBay.com:<\/span><\/i><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-3455 size-full\" src=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/Screen-Shot-2018-11-01-at-9.29.19-PM.png\" alt=\"Google Pixel 2 ebay product matching\" width=\"633\" height=\"463\" \/><\/p>\n<p><span style=\"font-weight: 400;\">Neural networks and deep learning techniques are used to identify and learn from similarities, to identify and learn from differences, and to create word-level embedding to create a system of representation for common words. This involves teaching the system to identify different references to a unique entity &#8211; &#8216;hp&#8217; and Hewlett Packard, 7\u201d and 7 Inch and so on, to come up with one unique representation for each entity.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">A product can be identified using its title, its description, images, and attributes or specifications list. In many cases, the product title itself will yield a lot of information and the product matching system has to learn to sort the product name (for instance brand model) from the attributes.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Samsung Galaxy Note 8 is the phone model, and the title provides additional information like the memory size, US version, Factory Unlocked Refurbished, etc.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">All this information has to be extracted and sorted and put into the appropriate slots &#8211; Phone model, version, memory size, etc. Different product matching techniques might be used to help the system learn to parse and classify the sets of information. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">The next comparison might be the same product with more information in the form of tags in the title, description that contains memory and screen size information, and a specs table. These help add more knowledge about the product, and the machine will be better able to identify an exact product match or mismatch in the next comparison.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The standard identifying signals are similar results or positive matches for unique identification numbers (UPC or MPN), Classification, Brand, Title, Attributes, and Image.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For each comparison, the system goes through many steps, checks or safety valves. There is a search for a unique identification number, a test for keyword similarities, brand normalization and match (for example, HP is the same as Hewlett Packard), Attribute normalization and match (7\u201d is the same as 7 inch, 7 in., 7 inches), image matching, etc. There is also a check for variation in attributes:<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">Apple iPhone 8 Plus 5.5&#8243;, 64GB, Fully Unlocked, Gold:<\/span><\/i><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-13418 size-full\" src=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-plus-8-5-5.jpg\" alt=\"Iphone 8 Plus\" width=\"491\" height=\"398\" srcset=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-plus-8-5-5.jpg 491w, https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-plus-8-5-5-300x243.jpg 300w\" sizes=\"auto, (max-width: 491px) 100vw, 491px\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Apple iPhone 8 4.7&#8243;, 64GB, Fully Unlocked, Gold:<\/span><\/i><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-13420 size-full\" src=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-8-4.jpg\" alt=\"I phone 8 4\" width=\"476\" height=\"387\" srcset=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-8-4.jpg 476w, https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/iphone-8-4-300x244.jpg 300w\" sizes=\"auto, (max-width: 476px) 100vw, 476px\" \/><\/p>\n<p><span style=\"font-weight: 400;\">For the best product match result, there has to be at least 99% of positive results. Otherwise, it is a mismatch, even if it is a variation within what is essentially the same product (iPhone 8 5.5\u201d and iPhone 8 4.7\u201d)<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This is a complicated process, and different product match solutions may employ different techniques and training methods. But the advantage is that neural networks and machine learning systems learn over time, even from their mistakes, and so get better with each use.<\/span><\/p>\n<p><em>Find the product you want with 99% product matching accuracy. Get instant access to the world&#8217;s largest GTIN, UPC and EAN product database with our product matching solution. <a href=\"https:\/\/info.intelligencenode.com\/book-a-demo\">Get a Demo.<\/a><\/em><\/p>\n<p><!--HubSpot Call-to-Action Code --><span id=\"hs-cta-wrapper-dbf1550b-fda9-477d-aa94-d0f4ee33ff0a\" class=\"hs-cta-wrapper\"><span id=\"hs-cta-dbf1550b-fda9-477d-aa94-d0f4ee33ff0a\" class=\"hs-cta-node hs-cta-dbf1550b-fda9-477d-aa94-d0f4ee33ff0a\"><!-- [if lte IE 8]>\n\n\n<div id=\"hs-cta-ie-element\"><\/div>\n\n\n<![endif]--><a href=\"https:\/\/cta-redirect.hubspot.com\/cta\/redirect\/857779\/dbf1550b-fda9-477d-aa94-d0f4ee33ff0a\"><img loading=\"lazy\" decoding=\"async\" id=\"hs-cta-img-dbf1550b-fda9-477d-aa94-d0f4ee33ff0a\" class=\"hs-cta-img alignnone\" style=\"border-width: 0px;\" src=\"https:\/\/no-cache.hubspot.com\/cta\/default\/857779\/dbf1550b-fda9-477d-aa94-d0f4ee33ff0a.png\" alt=\"Product matching demo banner\" width=\"895\" height=\"200\" \/><\/a><\/span><script charset=\"utf-8\" src=\"https:\/\/js.hscta.net\/cta\/current.js\"><\/script><script type=\"text\/javascript\"> hbspt.cta.load(857779, 'dbf1550b-fda9-477d-aa94-d0f4ee33ff0a', {}); <\/script><\/span><!-- end HubSpot Call-to-Action Code --><\/p>\n<h2><span style=\"font-weight: 400;\">References:<\/span><\/h2>\n<ol>\n<li><em><a href=\"https:\/\/www.intelligencenode.com\/blog\/the-tech-powering-product-matching-accuracy\/\">Product Matching in eCommerce<\/a> using deep learning &#8211; Medium<\/em><\/li>\n<li><em>Product Matching: You&#8217;ve not heard of it, but its powering your <a href=\"https:\/\/www.intelligencenode.com\/blog\/why-retailers-must-use-price-comparison-engine-for-their-ecommerce\/\">price comparison engine<\/a> &#8211; Semantics3<\/em><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Billions of products are sold online, and there are numerous stores selling them. Identifying and matching products for various purposes like price comparison, becomes a challenge as there are no&#8230;<\/p>\n","protected":false},"author":2,"featured_media":3452,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"content-type":"","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[914,16],"tags":[1011,1148],"class_list":["post-3451","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","category-ecommerce","tag-product-match","tag-product-matching"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.4 (Yoast SEO v27.5) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Product Matching With Machine Learning<\/title>\n<meta name=\"description\" content=\"Automate your product matching process and improve data accuracy with our powerful matching engine.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to Classify, Match Product With Machine Learning\" \/>\n<meta property=\"og:description\" content=\"Automate your product matching process and improve data accuracy with our powerful matching engine.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/intelligencenode\" \/>\n<meta property=\"article:published_time\" content=\"2018-11-01T16:42:20+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-01-20T10:22:16+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"550\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Intelligence Node\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@bigdataNODE\" \/>\n<meta name=\"twitter:site\" content=\"@bigdataNODE\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Intelligence Node\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Product Matching With Machine Learning","description":"Automate your product matching process and improve data accuracy with our powerful matching engine.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/","og_locale":"en_US","og_type":"article","og_title":"How to Classify, Match Product With Machine Learning","og_description":"Automate your product matching process and improve data accuracy with our powerful matching engine.","og_url":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/","og_site_name":"Blog","article_publisher":"https:\/\/www.facebook.com\/intelligencenode","article_published_time":"2018-11-01T16:42:20+00:00","article_modified_time":"2026-01-20T10:22:16+00:00","og_image":[{"width":1024,"height":550,"url":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","type":"image\/jpeg"}],"author":"Intelligence Node","twitter_card":"summary_large_image","twitter_creator":"@bigdataNODE","twitter_site":"@bigdataNODE","twitter_misc":{"Written by":"Intelligence Node","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#article","isPartOf":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/"},"author":{"name":"Intelligence Node","@id":"https:\/\/www.intelligencenode.com\/blog\/#\/schema\/person\/dec96545f790884e8a05f794934695f1"},"headline":"How to Classify, Match Product With Machine Learning","datePublished":"2018-11-01T16:42:20+00:00","dateModified":"2026-01-20T10:22:16+00:00","mainEntityOfPage":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/"},"wordCount":1057,"image":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","keywords":["product match","product matching"],"articleSection":["Artificial Intelligence","Ecommerce"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/","url":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/","name":"Product Matching With Machine Learning","isPartOf":{"@id":"https:\/\/www.intelligencenode.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#primaryimage"},"image":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","datePublished":"2018-11-01T16:42:20+00:00","dateModified":"2026-01-20T10:22:16+00:00","author":{"@id":"https:\/\/www.intelligencenode.com\/blog\/#\/schema\/person\/dec96545f790884e8a05f794934695f1"},"description":"Automate your product matching process and improve data accuracy with our powerful matching engine.","breadcrumb":{"@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#primaryimage","url":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","contentUrl":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","width":1024,"height":550,"caption":"product matching blog banner"},{"@type":"BreadcrumbList","@id":"https:\/\/www.intelligencenode.com\/blog\/how-to-classify-match-products-with-machine-learning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.intelligencenode.com\/blog\/"},{"@type":"ListItem","position":2,"name":"How to Classify, Match Product With Machine Learning"}]},{"@type":"WebSite","@id":"https:\/\/www.intelligencenode.com\/blog\/#website","url":"https:\/\/www.intelligencenode.com\/blog\/","name":"Blog","description":"Intelligence Node Blog - Tips to Maximize Ecommerce Growth","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.intelligencenode.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.intelligencenode.com\/blog\/#\/schema\/person\/dec96545f790884e8a05f794934695f1","name":"Intelligence Node","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/7480bd94ee02b87b4ebf5881cdc6b554b7bad668d9932aab5765809e15ab9a2d?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/7480bd94ee02b87b4ebf5881cdc6b554b7bad668d9932aab5765809e15ab9a2d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/7480bd94ee02b87b4ebf5881cdc6b554b7bad668d9932aab5765809e15ab9a2d?s=96&d=mm&r=g","caption":"Intelligence Node"}}]}},"jetpack_featured_media_url":"https:\/\/www.intelligencenode.com\/blog\/wp-content\/uploads\/2018\/11\/blog-banner-product-matching.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/posts\/3451","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/comments?post=3451"}],"version-history":[{"count":15,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/posts\/3451\/revisions"}],"predecessor-version":[{"id":13421,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/posts\/3451\/revisions\/13421"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/media\/3452"}],"wp:attachment":[{"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/media?parent=3451"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/categories?post=3451"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.intelligencenode.com\/blog\/wp-json\/wp\/v2\/tags?post=3451"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}