Duplicate Emails

Easy

56.5%

Updated 6/1/2025

Asked by 7 Companies

EPAM Systems Microsoft Meta tcs Amazon Google Bloomberg

Topics

Database

Duplicate Emails

What is this problem about?

The Duplicate Emails interview question is a fundamental database query task. You are given a table named Person with columns Id and Email. Your goal is to write a SQL query that identifies all email addresses that appear more than once in the table. This is a common data validation task used to ensure data integrity or identify user accounts with multiple entries.

Why is this asked in interviews?

This question is frequently used by companies like Microsoft and Meta to test basic SQL knowledge. It evaluates a candidate's understanding of database interview pattern concepts like grouping, aggregation, and filtering. Specifically, it checks if you know how to use the GROUP BY and HAVING clauses together, which is essential for any backend or data engineering role.

Algorithmic pattern used

This is a standard SQL Aggregation problem.

Use GROUP BY Email to group rows with the same email.
Use the COUNT() aggregate function to count the number of occurrences in each group.
Use the HAVING clause to filter for groups where the count is strictly greater than 1. SELECT Email FROM Person GROUP BY Email HAVING COUNT(Email) > 1;

Example explanation

Suppose the Person table looks like this:

Id	Email
1	a@b.com
2	c@d.com
3	a@b.com

Grouping by Email creates two groups: a@b.com (2 rows) and c@d.com (1 row).
The HAVING COUNT(Email) > 1 filter excludes c@d.com.
The result is a@b.com.

Common mistakes candidates make

Using WHERE instead of HAVING: Trying to use WHERE COUNT(Email) > 1, which fails because WHERE filters rows before grouping, while HAVING filters groups after aggregation.
Incorrect Column Selection: Selecting columns that are not part of the GROUP BY clause without using an aggregate function.
Complexity: Writing complex subqueries or self-joins when a simple GROUP BY is sufficient.

Interview preparation tip

Always remember the order of operations in SQL: FROM -> JOIN -> WHERE -> GROUP BY -> HAVING -> SELECT -> ORDER BY. Understanding that HAVING is specifically for aggregated results is key to passing database-focused interviews.

Title	Difficulty	Topics	LeetCode
Project Employees I	Easy	Database	Solve
Fix Names in a Table	Easy	Database	Solve
Not Boring Movies	Easy	Database	Solve
Primary Department for Each Employee	Easy	Database	Solve
Queries Quality and Percentage	Easy	Database	Solve

Duplicate Emails

Asked by 7 Companies

Topics

Duplicate Emails

What is this problem about?

Why is this asked in interviews?

Algorithmic pattern used

Example explanation

Common mistakes candidates make

Interview preparation tip

Similar Questions