When I wrote down the title of this blog post, I first wondered if is worth while writing on. So I performed a quick google search on “the difference between segmentation and clustering” to get an impression. It yields a lot of links to … segmentation and clustering, but very few of them clearly give a distinction between the two.
Let us first look at some definitions we can find :
Clustering :
“the process of organizing objects into groups whose members are similar in some way” (here)
“a number of different algorithms and methods for grouping objects of similar kind into respective categories” (here)
“a way to form ‘natural groupings’ or clusters of patterns” (here)
“the assignment of a set of observations into subsets (called clusters) so that observations in the same cluster are similar in some sense” (http://en.wikipedia.org/wiki/Cluster_analysis)
Segmentation
“Customer segmentation is the practice of dividing a customer base into groups of individuals that are similar in specific ways relevant to marketing, such as age, gender, interests, spending habits, and so on” (here)
“A marketing technique that targets a group of customers with specific characteristics” (here)
“A market segment is a sub-set of a market made up of people or organizations sharing with one or more characteristics that cause them to demand similar product and/or services based on qualities of those products such as price or function. A true market segment meets all of the following criteria: it is distinct from other segments (different segments have different needs), it is homogeneous within the segment (exhibits common needs); it responds similarly to a market stimulus, and it can be reached by a market intervention.” (here)
Segmentation groups objects into similar groups The resulting groups contain members that are more similar to each other than they are to other groups(here).
It looks like there is a lot of similarity between the two, but, as I will explain, actually there is no similarity whatsoever !
Let’s begin with segmentation.
My definition is simple : “dividing something into pieces according to some criteria”. And we call each piece a segment.
When we deal with a group of customers, we talk about customer segmentation or market segmentation. It’ s just splitting the whole customer base into groups of customers that have some characteristic in common. It can be gender : men and women.
But it can also be age_category : people under 40 and people over 40, or net income : people who earn less than 10,000$ and those who earn more.
That’s all that is to segmentation : decide on your criteria (actually on the borders between the segments) and assign each customer, or whatever you want to segment, to its segment.
So what about clustering ?
Wikipedia gives your a few dozen of definitions for all sorts of clusters. The meaningful word that occurs the most is “group”. So a cluster is a group.
Even better is when we seach in google for pictures of clusters : We find for example a beautiful picture of the pleiades :
,
With “cluster analysis” as search term the first picture is :
A lot more simple than the pleiades, but gives the same impression : groups of points ! Or otherwise put : density differences in a two-dimensional space (pictures are essentially two-dimensional).
To conclude : my definition of clustering : finding regions in a (one-, two-, or multi-dimensional) space with a different density of items than the neighboring regions.
And “finding regions” means : finding the borders between the regions, because it is only if you know where the borders are that you can say where the region is situated.
Why then the seeming similarity between segmentation and clustering ?
In order to practice segmentation you have to decide on the borders between the segments.
That is simple if you only have to deal with one or two characteristics. In Marketing we sometimes have hundreds of characteristics of our customers. That makes it a bit more complex and often people use advanced statistical methods (like K-means clustering or Kohonen maps) to find these borders for them in the multidimensional space of their customer database. Eventually they will use these borders for actually segmenting their customer base.
Conclusion : clustering is finding borders between groups, segmenting is using borders to form groups.
And one Final remark :
Segmentation is always possible, even in an extremely homogeneous collection of items. You just decide where you will cut between the groups.
Finding clusters in this extremely homogeneous collection is impossible, since by definition there are no density differences, and hence no clusters to find !
Related articles by Zemanta
- market segmentation (slideshare.net)
- Clustering large datasets (metalev.org)
- Agglomerative Hierarchical & Divisive Clustering (slideshare.net)
[…] The Difference between Segmentation and Clustering (zyxo.wordpress.com) […]
By: Video by Google on K-Means Clustering | Herbert Mühlburger's Weblog on August 4, 2010
at 12:44 pm
You explained very well difference between Segmentation and Clustering. I was very confused about it. Thanks
By: Megha Agarwal on September 6, 2010
at 6:33 am
Excellent !!!!
By: Zain on July 1, 2011
at 7:14 pm
You just cleared up my confusion. Great work!
By: Ogre on November 21, 2011
at 9:02 am
Simply Superb…!!!!
By: vivek on December 14, 2011
at 4:39 am
I loved the explaination its a brilliant and concise way of explaination
By: sumiya on October 17, 2012
at 4:47 am
Well, so how can we implement segmentation?
By: in2008 on January 26, 2013
at 3:23 pm
Difficult to say from here. First of all you have to know what’s the purpose of that segmentation. Because segmentation is a tool, not an goal. So what do you want to do with that tool?
Zyxo
By: zyxo on January 26, 2013
at 4:15 pm
Hi! I want to better understand customer profiles to apply retention policies.
By: in2008 on February 8, 2013
at 5:34 pm
As I onderstand you want to identify customers who are going to churn. For me that is just another targeting model : from your history database you select people who churned and people who did not. With some algorithm like decision tree, logistic regression or whatever you can calculate a model to predict the probability of churning, based on whatever data you have at the moment before the churn.
Hope that helps.
Zyxo
By: zyxo on February 20, 2013
at 4:54 pm
Very Nice
By: Senthil on September 25, 2013
at 9:44 am
put it really simple, segmentation is an application to marketing of clustering statistical techniques, which are applied in similar ways to many other fields. if you wonder about their difference/similarities simply means that you never studied multivariate statistics. moreover, there are many ways to cluster units, finding similarities is just one of many. again, look for a good book in multivariate statistics.
By: georgie on February 14, 2014
at 3:26 pm
Georgie,
If you carefully read my post you will see that we totally agree, only not in the fact that I never studied multivariate statistics 🙂
Zyxo
By: zyxo on February 14, 2014
at 6:47 pm
[…] The Difference between Segmentation and Clustering (zyxo.wordpress.com) […]
By: Video by Google on K-Means Clustering | blog.muehlburger.at on June 9, 2014
at 10:42 am
What about difference between segmentation and classification in data mining tasks? Can we say that segmentation is classification when objects are costumers and criteria for dividing is relevant to Marketing?
By: Anaj on January 27, 2016
at 2:22 pm
Hi, Anaj,
Thanks for visiting my blog.
About segmentation and classification, I figure the two are synonyms (segment, class, category, subgroup …). Perhaps “segmentation” is more used in the marketing context.
Zyxo
By: zyxo on January 29, 2016
at 7:38 pm
Can someone please tell me the difference between Market Segmentation and Customer Classification?
By: Emily on February 22, 2016
at 1:53 pm
Hi, Emily,
As I see it, the two are largely synonyms, although there might be a difference:
market segmentation deals with the market, which means you divide your market into segments according to some criteria.
See for example this definition from wikipedia: “dividing a broad target market into subsets of consumers, businesses, or countries that have, or are perceived to have, common needs, interests, and priorities”
Customer classification means you take each customer and puts them in a particular class, category, market segment or whatever you like to call it.
By: zyxo on February 25, 2016
at 1:10 pm