Skip to main content

Machine Unlearning #3 (Clustering)

Machine Unlearning is a series broken up into tiny, one-minute readable pieces to humor our ever-shortening attention span. Sharing the links to every single piece right below:


We have already gone through classification and prediction. Now let us see what clustering is. Another popular learning technique, clustering is different from the other two since it is an unsupervised learning technique. What does that mean?

Let us revisit the classification technique. We show the machine an Orange and explains the features of the Orange to it. Similarly, each different fruit and its features are shown to the machine during the training phase. Once it has learned enough, we use the machine to label a randomly picked fruit.

In clustering, such training does not take place. We present the system with a basket full of different fruits (Apples, Oranges, Bananas, Cherries, and Mangoes) and expect the system to sort them. How would the system go about this task?

Well, some features come to play here as well. The fruits in the basket differ from each other on the basis of color, shape, length, or size. The system might pick one of these features in random. Let’s consider the color of the fruit. The system starts sorting the fruits based on their color first. In our basket, apples and cherries get sorted together since they are both red. Similarly, bananas and mangoes get grouped together since they are both yellow in color. There would be a third group consisting only of oranges. 

Then, the system would look at a different feature for the next round, say - the shape of the fruit. It looks at the red group of apples and cherries and checks if all fruits in the group are of the same shape. Clearly, they aren’t. Thus, the sphere-shaped cherries get sorted together while the others (apples) get sorted as a second group. The red fruits group is now split into apples and cherries. Similarly, the yellow group would also be split into two groups - consisting of bananas and mangoes. In our simple example, after round two, we are left with five unique groups (clusters) of fruits. This is how clustering is performed. The items in one cluster would be very similar to one another, while they would have differences with items of another cluster.

As in the previous cases, let us now check how we employ clustering in our own lives.

Let us assume that you, an Indian, is in Dubai looking for a job. When the Arab interviewer asks you where you are from, you introduce yourself as an Indian. You get employed, you greet your employer with a Marhaba, and earn your Dirhams at the end of every month. When you meet another Indian in Dubai, you are elated. You greet them Namaste, get excited about the upcoming India Pakistan cricket match, and probably make plans to celebrate Dussehra together with the Indian community in Dubai. However, the moment you arrive at the Dubai Indian Dussehra Party, you cease to be Indians and become Marathis, Tamilians, Rajasthanis, Assamese, or whichever state you are from. The differences between Indians become more pronounced. The Dussehra of the Delhiite becomes Durga Puja for the Bengalis or Vijayadashami for the Kannadigas.  The people from the south of India collectively become idli devouring, Telugu speaking Madrasis for the northerners.

Things become even complicated when you board the flight to come home for a vacation. Naturally, most passengers in the flight bound to your state would be people from the same state working in Dubai. As you interact with them, more differences start appearing. You start becoming Keralites less and Thekkans (people from the south of Kerala) or Vadakkans (north of Kerala) more. The Vadakkans, who were lamenting the bias against the South Indians by the Northerners, themselves start looking down at the Thekkans - calling them self centered and selfish. The Thekkans retaliate by making fun of the north’s sing-song accent.

As more and more features (country, state, and region) come into play - we get more and more divided on those lines. Like regression, clustering is not devious inherently. It is natural that people have their differences and express affinities towards the group they fit in. Trouble starts when people forget the bigger picture and start placing their group above the others.

The rising resentment over immigrants by natives, especially of first world countries could be termed as an example of the same. According to them, the resource in a country is a natural right of the people born in the country. Anyone coming from outside to their land is often considered parasites, or freeloaders. In a world where the place of our birth is just a matter of chance, how futile is this petty mindset! People migrate from their homelands in the hopes of a better life. They go to a culture alien to them, work hard, and strive for a decent living. Harassing them by calling them freeloaders while sitting in the comforts of your privilege is disgusting at best.









Comments

Popular posts from this blog

Book Review : Scion Of Ikshvaku

Author : Amish Tripathi Genre : Mythological Fantasy Published On : 22 June 2015 Watch trailer on Youtube Buy Online! As the makers put it, this first installment of the Ram Chandra series is the most awaited literary blockbuster of the year. The frenzied wait had its effect on me, as I had pre-ordered a copy, days before its release. Obviously, the most weighted factor which prompted me into this craze was The Shiva Trilogy, the previous work of the boring banker turned happy author. Shiva Trilogy achieved its cult status for two major reasons: a) the plot being discussed offered a welcome change from the modern day love stories that had mushroomed up in the post Chetan era. b) The author had meticulously researched through Vedas and other ancient texts, and had reconstructed the very image of Shiva, into a very capable man rather than a miracle performing God. To quote the First City, Amish had succeeded on reintroducing Hindu mythology to the youth of t...

The Plan

The son went and sat near his father. “It’s been a while, dad. How are things going?” “It really has been a while, hasn’t it? I stopped following time since God knows when.” “That’s quite unbecoming of you, dad. For what I know, you were someone who used to measure time and plan accordingly. Shall I say shrewd?” “I prefer meticulous. And yes, I used to measure time – every ounce of it. Each of my actions were prudently strategic. I wanted the best for myself and my folks.” “Are you telling me even I were a result of your deliberate planning? Interesting.” “You? Of course. Especially you. There was a time when my whole life was centered on you. I have had immense plans for you, even before you were born. Do you think you just happened to be born in the month of May?” “I am quite familiar with the nine months’ infancy phase, dad. I believe that though I came out only in May, I began to exist sometime in August, perhaps.” “That’s a way to look at it, righ...

Chennai Tidbits

‘2/14, Salvation Army Guest House, Doveton St, Chennai’  this address followed by a ten digit phone number was all that I had of Mr. Nageshwar Rao, who was to become my roommate in my new 'home' that Chennai was. Vipin Das is my name, an employee of All India Radio, and I had been transferred from New Delhi to Chennai only a few days ago. So there I was, just outside the Chennai Central, drained and haggard after the grueling train journey that gifted me two sleepless nights. I approached an auto-rickshaw with baggage in my hands, and showed him the address I had. He asked me to get in, and I was traveling through the South Indian metro, a first time in my life. It was only six in the morning, and the city was just waking up. Some young professionals, probably long distance commuters, security guards, newspaper boys on cycle, trucks and light traffic were all I could see on road. I took out that card and dialed Mr. Rao. No response. Maybe he was still sleeping. After all, it...