TutorialsArena

PostgreSQL: Retrieving Unique Records with DISTINCT

Eliminate duplicate rows from query results using PostgreSQL's DISTINCT clause. This guide explains the syntax and usage of DISTINCT.



Retrieving Unique Records in PostgreSQL with `DISTINCT`

Understanding the `DISTINCT` Clause

In PostgreSQL, the `DISTINCT` clause is used in `SELECT` statements to retrieve only unique rows from a table. It removes duplicate rows from the result set, ensuring that you get only unique combinations of values for the specified columns. This is helpful for tasks such as data analysis, reporting, or any scenario where you only need unique records.

`DISTINCT` Clause Syntax

The basic syntax for using `DISTINCT` is:

1. `DISTINCT` with a Single Column

Syntax

SELECT DISTINCT column_name
FROM table_name;

This selects only the unique values from the specified column.

2. `DISTINCT` with Multiple Columns

Syntax

SELECT DISTINCT column1, column2, ...
FROM table_name;

This selects only the unique combinations of values from the specified columns.

3. `DISTINCT ON`

Syntax

SELECT DISTINCT ON (column1) column_alias, column2
FROM table_name
ORDER BY column1, column2;

This returns the first row of each group of duplicates, based on the `ORDER BY` clause. The expression in the `ORDER BY` clause must start with the same expression that is in `DISTINCT ON`.

Examples: Using `DISTINCT`

These examples use the `DISTINCT` clause with a sample table named `demo_dist`. You would need to create this table and populate it with data to see the results. The `ORDER BY` clause is used to sort results for better readability.

1. Creating and Populating the Table

CREATE TABLE Statement

CREATE TABLE demo_dist (
  Serial_No SERIAL PRIMARY KEY,
  Summer_fruits VARCHAR,
  Winter_fruits VARCHAR
);
INSERT INTO Statement

INSERT INTO demo_dist (Summer_fruits, Winter_fruits) VALUES
('Mango', 'Grape'),
('Watermelon', 'Pears'),
('Apples', 'Apples'),
('Mango', NULL),
('Apples', 'Apples'),
('Guava', 'Oranges');

2. `DISTINCT` with One Column

SELECT DISTINCT (Single Column)

SELECT DISTINCT Summer_fruits
FROM demo_dist
ORDER BY Summer_fruits;

3. `DISTINCT` with Multiple Columns

SELECT DISTINCT (Multiple Columns)

SELECT DISTINCT Summer_fruits, Winter_fruits
FROM demo_dist
ORDER BY Summer_fruits, Winter_fruits;

4. `DISTINCT ON`

SELECT DISTINCT ON

SELECT DISTINCT ON (Summer_fruits) Summer_fruits, Winter_fruits
FROM demo_dist
ORDER BY Summer_fruits, Winter_fruits;

Conclusion

The `DISTINCT` clause in PostgreSQL is a valuable tool for retrieving unique rows from a table. Understanding its various forms allows you to efficiently manage and analyze data, focusing on unique values rather than duplicates.