Skip to main content

Machine Unlearning #3 (Clustering)

Machine Unlearning is a series broken up into tiny, one-minute readable pieces to humor our ever-shortening attention span. Sharing the links to every single piece right below:


We have already gone through classification and prediction. Now let us see what clustering is. Another popular learning technique, clustering is different from the other two since it is an unsupervised learning technique. What does that mean?

Let us revisit the classification technique. We show the machine an Orange and explains the features of the Orange to it. Similarly, each different fruit and its features are shown to the machine during the training phase. Once it has learned enough, we use the machine to label a randomly picked fruit.

In clustering, such training does not take place. We present the system with a basket full of different fruits (Apples, Oranges, Bananas, Cherries, and Mangoes) and expect the system to sort them. How would the system go about this task?

Well, some features come to play here as well. The fruits in the basket differ from each other on the basis of color, shape, length, or size. The system might pick one of these features in random. Let’s consider the color of the fruit. The system starts sorting the fruits based on their color first. In our basket, apples and cherries get sorted together since they are both red. Similarly, bananas and mangoes get grouped together since they are both yellow in color. There would be a third group consisting only of oranges. 

Then, the system would look at a different feature for the next round, say - the shape of the fruit. It looks at the red group of apples and cherries and checks if all fruits in the group are of the same shape. Clearly, they aren’t. Thus, the sphere-shaped cherries get sorted together while the others (apples) get sorted as a second group. The red fruits group is now split into apples and cherries. Similarly, the yellow group would also be split into two groups - consisting of bananas and mangoes. In our simple example, after round two, we are left with five unique groups (clusters) of fruits. This is how clustering is performed. The items in one cluster would be very similar to one another, while they would have differences with items of another cluster.

As in the previous cases, let us now check how we employ clustering in our own lives.

Let us assume that you, an Indian, is in Dubai looking for a job. When the Arab interviewer asks you where you are from, you introduce yourself as an Indian. You get employed, you greet your employer with a Marhaba, and earn your Dirhams at the end of every month. When you meet another Indian in Dubai, you are elated. You greet them Namaste, get excited about the upcoming India Pakistan cricket match, and probably make plans to celebrate Dussehra together with the Indian community in Dubai. However, the moment you arrive at the Dubai Indian Dussehra Party, you cease to be Indians and become Marathis, Tamilians, Rajasthanis, Assamese, or whichever state you are from. The differences between Indians become more pronounced. The Dussehra of the Delhiite becomes Durga Puja for the Bengalis or Vijayadashami for the Kannadigas.  The people from the south of India collectively become idli devouring, Telugu speaking Madrasis for the northerners.

Things become even complicated when you board the flight to come home for a vacation. Naturally, most passengers in the flight bound to your state would be people from the same state working in Dubai. As you interact with them, more differences start appearing. You start becoming Keralites less and Thekkans (people from the south of Kerala) or Vadakkans (north of Kerala) more. The Vadakkans, who were lamenting the bias against the South Indians by the Northerners, themselves start looking down at the Thekkans - calling them self centered and selfish. The Thekkans retaliate by making fun of the north’s sing-song accent.

As more and more features (country, state, and region) come into play - we get more and more divided on those lines. Like regression, clustering is not devious inherently. It is natural that people have their differences and express affinities towards the group they fit in. Trouble starts when people forget the bigger picture and start placing their group above the others.

The rising resentment over immigrants by natives, especially of first world countries could be termed as an example of the same. According to them, the resource in a country is a natural right of the people born in the country. Anyone coming from outside to their land is often considered parasites, or freeloaders. In a world where the place of our birth is just a matter of chance, how futile is this petty mindset! People migrate from their homelands in the hopes of a better life. They go to a culture alien to them, work hard, and strive for a decent living. Harassing them by calling them freeloaders while sitting in the comforts of your privilege is disgusting at best.









Comments

Popular posts from this blog

First Rejection

About four years ago, I self-published my first novel, I Want Your Life . Self-publishing was not my initial plan. Like most aspiring writers, I envisioned my book being published under the label of one of the prominent publishing houses in the country, such as Penguin or HarperCollins. However, my editor gave me a reality check and explained that large publishing houses receive several submissions daily, making it highly unlikely for a first-time writer to get noticed. More than the prestige of being published under an established banner, my excitement lay in seeing my story in the form of a book. So, I chose the self-publishing route. After months of back-and-forth with my publishing team - proofreading, editing, revisions, illustrations, book registration, and printing - my book was finally out. It was truly a dream come true. The upside of self-publishing is the sweet certainty of getting published. The downside, however, is that you are responsible for everything, including promot...

IFF(FPC)K

 If you know me personally, you might also know that I am a regular at the International Film Festval of Kerala, held annually in December at Thiruvananthapuram (and if you don’t know me personally, just read the first sentence). A habit that started during my Thiruvananthapuram days in 2016, I look forward and try to attend the week long celebration of cinema - unless life or a killer virus gets in the way. Prologue Since the dates for the next fest are officially announced during the conclusion of the current fest, what follows is an year long wait. Nothing significant happens for the most part of the year, so let us skip ahead to November - when the registration opens for delegates. Yes, we know that there are over ten thousand delegate passes and registration window usually lasts days - still most of my friends (and I) hit the servers with a passion  until our registrations go through. Once that is done, the wait begins for the schedule to be released - where we would come...

Familial Cowswami

Last day, I had the misfortune of interacting with the husband of one of my cousins at a family event. Since we’ve already established my feelings toward him, let me clarify that he is not an evil person. It’s just that our ideologies and outlooks toward life could poetically be described as Yin-Yang—completely opposite, with no common ground. That’s fine; everyone is entitled to their vision, as long as they don’t try to claim their vision as the only correct one and attempt to shove it on others who haven’t asked for advice. After exchanging pleasantries, we got down to the “real talk.” He started by asking me why I am still sticking around here (India) and not moving abroad. To set the context, he has been working in the Middle East ever since I first met him at my cousin’s wedding, around fourteen years ago. His wife and kids, who used to live with him, recently moved to Canada, possibly to settle there once he retires. Of course, moving abroad seems to be the trend (or even the no...