![]() |
VOOZH | about |
After a wait of 3 long hours, it was my turn to enter the interview room. The first question asked to me by the interviewer was βCan you estimate the total number of cigarettes consumed per month in India?β. Having worked on a project for ITC in one of the core courses, I was able to crack the problem with relative ease. I started with the total number of factories of ITC in India. From there, I calculated the number of cigarettes manufactured by ITC in a year with the help of average turnover time. Further, I made good guesses on the % of cigarettes exported and the % share of ITC in India. Finally, I got the number of cigarettes consumed per month in India which convinced the panel.
Questions like these are very common in analytics and management consulting interviews. If you wish to appear for companies of this genre, you should be able to solve guess estimates (or guesstimates as weβll call them from here on) in double-quick time. And hence this article will be very useful. I was fortunate to have got this puzzle. What if I had no clue on the number of ITC factories producing cigarettes?
After this interview, I tried solving many such puzzles to get a comfort level with such problems. In this article, I will walk through some techniques I now use to crack such puzzles.
Guesstimates are one part of the entire data science interview process. We have penned down a comprehensive 7-step framework just for you, in our βAce Data Science Interviewsβ course. Come and learn the various aspects, tips and tricks to crack your next data science interview!
Very often in the role of an Analyst or Consultant, clients expect quick initial scaling or sizing of potential projects. This is the reason such questions are so common in interviews for recruitment of such roles. The interviewer is looking out for four key traits in this interview.
Knowledge of certain techniques used for such guess estimates helps keeping the approach structured in the interview. Letβs address the cigarette estimate problem from the demand side (without using the number of ITC factories) while discussing the key techniques. Following are the 4 key techniques which will help you in such case interviews :
Following are some factors one should keep in mind while solving a guess estimate problem :
Solution: A good proxy in such a problem is the population of India, i.e., 1.2 billion. Following is an effective way to segment this population:
Following were the key considerations in building the segmentation and the intermediate guesses:
Solution: A good proxy in this problem is the world population, i.e., ~7.2 Billion. Following is a possible approach to this problem:
The actual number of Whatsapp installed on Android phone is slightly more than 100 Million. As can be seen from this example that guess estimates can be fairly accurate if we choose good segments and approximations.
Solution: A good proxy in this problem is the number of cities in India i.e. ~1700. The catch in this problem is to analyze where all can we use tennis balls. Once we have the number of tennis balls used monthly, we can easily find the number of tennis ball bought in a month using the lifetime of tennis balls.
Following is an effective way to segment this population:
Following were the key considerations in building the segmentation and the intermediate guesses:
Here is a practical example you can give a shot. Imagine you sitting in an interview and the interviewer asks βEstimate the number of aircrafts in air across the globe at this moment in time.β How will you answer this question ? Write down your approach in the comment box below to get opinion from experts.
Guess estimates are one of the most common case studies asked in data science interviews. With the right tools and techniques, this case study becomes a cake walk.
Did you find the article useful? Share with us any other techniques you incorporate while solving a guess estimate problem. Do let us know your thoughts about this article in the comment section below.
Tavish Srivastava, co-founder and Chief Strategy Officer of Analytics Vidhya, is an IIT Madras graduate and a passionate data-science professional with 8+ years of diverse experience in markets including the US, India and Singapore, domains including Digital Acquisitions, Customer Servicing and Customer Management, and industry including Retail Banking, Credit Cards and Insurance. He is fascinated by the idea of artificial intelligence inspired by human intelligence and enjoys every discussion, theory or even movie related to this idea.
GPT-4 vs. Llama 3.1 β Which Model is Better?
Llama-3.1-Storm-8B: The 8B LLM Powerhouse Surpa...
A Comprehensive Guide to Building Agentic RAG S...
Top 10 Machine Learning Algorithms in 2026
45 Questions to Test a Data Scientist on Basics...
90+ Python Interview Questions and Answers (202...
8 Easy Ways to Access ChatGPT for Free
Prompt Engineering: Definition, Examples, Tips ...
What is LangChain?
What is Retrieval-Augmented Generation (RAG)?
This one is really helpful, as these type of questions are very frequent in Analyst interviews. Great work!!!
Hi Tavish, I liked the way you have presented your points in the article. Its very well organized and makes some interesting observations (especially about the closest proxy). However, I must say that I disagree with you on the relevance of the guesstimate problems for an analytics professional. The main objective of the entire analytics endeavor is to take "guessing" out of the game. I am not sure what questions are being asked in Analytics Consulting interviews in India, but I am sure there are better ways to evaluate an individual's structured approach to problem solving, his/her comfort with numbers (in a statistical sense as opposed to raw arithmetic) and his/her ability to evaluate the efficiency of different algorithms based on the problem at hand. IMHO, Tim Peters said it best when he wrote one of the tenets in the Zen of Python: "When faced with ambiguity, resist the temptation to guess" Guesstimate questions have traditionally been used by Management Consulting companies to test the candidates ability to think on their feet. In contrast, all the Data Scientists/Analysts that I have met are way more critical and deliberate in their thinking. I think that is one of the most valuable traits of an analytics professional. But by all means, this is my personal opinion. Best, Ayush
Ayush, Thank you for your elaborate comment. I agree with you that Guess estimates were traditionally used by Management Consulting companies. They still are equally important for them. For analytics companies in India they have become quite popular in recent past. I say this based on my experience and the conversation I had with people recruiting day in and day out in analytics. The reason they are so popular in Indian analytics companies is that analytics is still in its nascent stage. New hires have to make their own path and influence people in industry who are still hesitant to implement strategies driven on numbers. To access such capability we need people with skills very close to a management consultant, where business problems are not very well defined and client is not very keen on accepting fact based strategy changes. Also expertise in such problems gives candidate a comfort level with segmentation, which is the heart and soul of analytics industry. This is my perception of the Indian analytics industry. I am still open to discussion on the relevance of such case studies in analytics interviews. Talking from my personal experience, I have been asked such question in every interview I have appeared till date. Tavish
Edit
Resend OTP
Resend OTP in 45s