How to deduplicate in presto?

Member

by lew , in category: MySQL , 4 months ago

How to deduplicate in presto?

Facebook Twitter LinkedIn Telegram Whatsapp

1 answer

Member

by shyann , 4 months ago

@lew 

In Presto, you can deduplicate data by using the DISTINCT keyword in your query. Here is an example of how to deduplicate data in Presto:

1
2
SELECT DISTINCT column1, column2, column3
FROM table_name;


This query will return only unique combinations of values in columns column1, column2, and column3 from the specified table.


You can also use the GROUP BY clause along with the MAX, MIN, or COUNT aggregate functions to deduplicate data. Here is an example:

1
2
3
SELECT column1, MAX(column2), COUNT(column3)
FROM table_name
GROUP BY column1;


This query will return one row for each unique value in column1, with the maximum value of column2 and the count of column3. This effectively deduplicates the data based on the values in column1.

Related Threads:

How to group time column into 5 second intervals and count rows using presto?
How to get latest 3 months records in presto sql?
How to configure presto?
How to list all presto workers?
How to pivot a table in presto?
How to enable security in presto?