Sql server, .net and c# video tutorial: Part 4 - Delete duplicate rows in sql

Suggested Videos:
Part 1 - How to find nth highest salary in sql
Part 2 - SQL query to get organization hierarchy
Part 3 - How does a recursive CTE work

In this video, we will discuss deleting all duplicate rows except one from a sql server table.

Let me explain what we want to achieve. We will be using Employees table for this demo.
Delete duplicate rows in sql

SQL Script to create Employees table

Create table Employees

(

ID int,

FirstName nvarchar(50),

LastName nvarchar(50),

Gender nvarchar(50),

Salary int

)

Insert into Employees values (1, 'Mark', 'Hastings', 'Male', 60000)

Insert into Employees values (2, 'Mary', 'Lambeth', 'Female', 30000)

Insert into Employees values (3, 'Ben', 'Hoskins', 'Male', 70000)

The delete query should delete all duplicate rows except one. The output should be as shown below, after the delete query is executed.

Delete all duplicate rows except one in sql

Here is the SQL query that does the job. PARTITION BY divides the query result set into partitions.

WITH EmployeesCTE AS

(

SELECT *, ROW_NUMBER()OVER(PARTITION BY ID ORDER BY ID) AS RowNumber

FROM Employees

)

DELETE FROM EmployeesCTE WHERE RowNumber > 1

sql server interview questions and answers

15 comments:

AnonymousNovember 22, 2014 at 6:58 AM
SELECT * FROM Employees

-- Delete duplicate rows in sql

SELECT DISTINCT ID,FirstName,LastName,Gender,Salary
FROM
Employees
AnonymousFebruary 7, 2016 at 3:41 AM
this work also

DELETE FROM Employees WHERE id NOT IN
(SELECT MIN(id) FROM Employees
GROUP BY FirstName,LastName,Gender,Salary)
AnonymousFebruary 7, 2016 at 3:57 AM
check this please

SELECT DISTINCT * INTO tblEmployee5
FROM tblEmployee
DROP TABLE tblEmployee
EXEC sp_rename 'tblEmployee5', 'tblEmployee'
UmeshNovember 17, 2016 at 10:27 PM
Hi Venkat,
Can you pls explain how to delete rows except one based on only some column values?
eg Employees table having FirstName, LastName, Salary & City column.
I want to delete all rows except one with FirstName, LastName and City duplicate having salary different.
Thanks in advance.
himanshu pareekFebruary 24, 2017 at 9:36 AM
Run this query, it will clear a lot about Rank, DenseRank and RowNumber

select ID, ROW_NUMBER() OVER (partition by ID Order By Id desc) as RowNumberCol,
RANK() OVER(Order By Id desc) as RankCol,
DENSE_RANK() OVER(Order By Id desc) as DenseRankCol
from Employees
AnonymousJuly 4, 2017 at 11:29 PM
i want to use this query in fronnt end .in c# or asp.net forms..plz help me.how to use
Mohamed SuleimanJuly 19, 2017 at 4:31 AM
Excellent .. Great .... Thank you so much
Manoj KumarSeptember 19, 2018 at 6:58 AM
In Mysql getting error
Table 'test.employeescte' doesn't exist

Can anyone help me how can i delete duplicate rows in mysql ?
UnknownAugust 21, 2019 at 6:03 AM
That's because you don't have the table "test.employeescte" . It says "doesn't exist". The table called "employeescte" belongs to author's database(as an example). So you must replace the code with your table name which has duplicated rows. Don't forget to replace "test" with your name of database.
Yam BasnetSeptember 27, 2020 at 9:54 AM
lets think about your scenario change
1) if there is multiple duplicate rows in table base on Name example :
(1, 'Mark', 'Hastings', 'Male', 60000)
(2, 'Mark', 'Hastings', 'Male', 50000)
(3, 'Mark', 'Hastings', 'Male', 80000)
from this duplicate rows just keep highest salary values row only and others delete?
2) if there is multiple duplicate rows in table base on Name example :
(1, 'Mark', 'Hastings', 'Male', 60000)
(2, 'Mark', 'Hastings', 'Male', 60000)
from this duplicate rows just keep latest ID values row only and delete others rows?

It would be great if you can help share these free resources