India has 900 million internet users, 22 official languages, and a consumer market growing faster than almost anywhere else on earth. Running primary research here requires a fundamentally different approach than anywhere else. This guide covers what you need to know before commissioning a single survey.
Most global research agencies treat India as a large version of an emerging market. Recruit respondents, translate the questionnaire, deploy the survey, deliver the data. That approach produces India-shaped numbers, not India-accurate insights.
The problem runs deeper than logistics. India speaks 780 languages. The dialect changes, in some regions, every few kilometres. A consumer survey designed for a Mumbai respondent will not work in the same form for a respondent in Coimbatore or Bhilai. The cognitive structure of questions, the social norms around disclosure, the expectations of the interviewer relationship, and the vocabulary for abstract concepts like satisfaction or value all differ in ways that a translation layer cannot fix.
The first decision is how you will collect data. In India, this matters more than in most markets because respondent accessibility varies dramatically by geography, age, income level, and language.
Trained interviewers call respondents and record answers through a digital questionnaire. CATI is the most reliable method for reaching tier 2 and tier 3 India, rural populations, and older respondents. Telephone penetration in India exceeds internet penetration in most non-metro areas. CATI also allows interviewers to clarify questions in the respondent's language, which matters significantly in vernacular research.
The limitation is cost and time. A 20-minute CATI survey across 600 respondents in five states takes significantly longer to field than an equivalent online survey. Quality CATI operations also require trained, supervised interviewers, not autodialers.
Web-based self-completion surveys. Fast, cost-effective, and scalable. The limitation in India is sample bias. Online panels in India skew heavily toward urban, young, English-comfortable respondents. A CAWI study claiming national coverage will typically oversample Mumbai, Delhi, Bangalore, and Hyderabad and undersample everywhere else.
CAWI is appropriate for studies targeting urban professionals, smartphone users, or digitally native demographics. It should not be used as the primary method for pan-India consumer research or for any study where Bharat representation matters.
The most common approach for serious pan-India research. Urban respondents are recruited through online panels and complete surveys via CAWI. Tier 2 and tier 3 respondents are reached through CATI or CAPI with trained interviewers. The hybrid approach produces genuinely representative national samples.
This is the most consequential decision in Indian research design, and the most commonly mishandled.
The standard agency approach to vernacular research is translation. Take the English questionnaire, send it to a translation service, have it reviewed by a bilingual team member, deploy. This produces questionnaires that are linguistically correct but cognitively broken.
Genuine vernacular research treats the questionnaire as a design document, not a source document. The research objectives are translated into each language from scratch, with native-speaking researchers asking: how would a Tamil-speaking woman in Coimbatore think about this topic? What words would she use? What concepts would she recognise?
This is not a language problem. It is a research design problem. Most agencies do not have the capacity to solve it because they do not have native-speaking research designers, only native-speaking translators.
The languages that matter most for consumer research in India, beyond Hindi, are Tamil, Telugu, Bengali, Marathi, Gujarati, Kannada, Malayalam, and Punjabi. Studies that claim national representativeness without vernacular execution in at least these eight languages are not representative of India.
Most research agencies claim Bharat coverage. Genuine Bharat research capability actually requires field teams operating across 500 or more tier 2 and tier 3 cities, trained local moderators who speak the city's predominant language natively, female field teams available for research involving women and domestic decision-making, and verification infrastructure to prove fieldwork happened where it was claimed to have happened.
Field research fraud in India is more common than the industry acknowledges. The verification infrastructure that credible agencies use includes GPS stamping on every field interview, audio recording for IDIs and CATI interviews, and callback validation on 15 to 20 percent of completed respondents.
Before commissioning fieldwork in India, ask every agency: Do you GPS-stamp every field interview? Do you record CATI and IDI interviews? What percentage of completed interviews undergo callback validation? If the answer to any of these is no, or if the agency cannot provide specific numbers, that is your answer.
Large global agencies like Ipsos, Kantar, and Nielsen have India operations but are built for multinational mandates. Their tier 2 and tier 3 execution is often thinner than their pitch suggests. Local Indian agencies range from genuine specialists with deep operational infrastructure to agencies with strong pitch decks and shallow delivery capability.
Specialist primary research execution firms with genuine tier 2 and tier 3 infrastructure, 22-language vernacular capability, and verified fieldwork operations are the right choice for pan-India studies where Bharat representation actually matters.
A pan-India hybrid consumer survey with 400 to 600 respondents takes 4 to 7 weeks from briefing to clean data. IDI programmes deliver transcripts within 24 hours of each session. Retail audit programmes with GPS-stamped digital collection can deliver same-day reporting. Studies requiring vernacular execution across 6 or more languages add 1 to 2 weeks to fieldwork timelines.