MySQL GROUP BY clause

The GROUP BY clause in MySQL is used to group rows that have the same values in specified columns into summary rows. It is commonly used with aggregate functions such as COUNT(), SUM(), AVG(), MAX(), and MIN() to perform operations on each group of data. This allows for detailed aggregation and analysis based on categorical data.

Syntax

SELECT column1, column2, aggregate_function(column3)
FROM table_name
GROUP BY column1, column2;

column1, column2, ...: Columns by which to group the data. These columns define the groups for aggregation.
aggregate_function(column3): An aggregate function applied to another column, which provides summary data for each group.

Key Concepts

Grouping Data
The GROUP BY clause is used to aggregate data based on one or more columns. For instance, if you want to know how many employees work in each department, you would group by the department column:
```
SELECT department, COUNT(*) AS number_of_employees
FROM employees
GROUP BY department;
```
This query groups employees by department and counts the number of employees in each department.
Using Aggregate Functions
GROUP BY is often used with aggregate functions to summarize data within each group:
```
SELECT department, AVG(salary) AS average_salary
FROM employees
GROUP BY department;
```
This query calculates the average salary for each department.
Multiple Columns
You can group by multiple columns to create more detailed groups:
```
SELECT department, job_title, COUNT(*) AS number_of_employees
FROM employees
GROUP BY department, job_title;
```
This query groups employees by both department and job_title, and counts the number of employees in each combination of department and job title.
Filtering Groups
To filter groups based on aggregate values, use the HAVING clause. The HAVING clause is used to filter results after aggregation, whereas the WHERE clause is used to filter rows before aggregation:
```
SELECT department, COUNT(*) AS number_of_employees
FROM employees
GROUP BY department
HAVING COUNT(*) > 10;
```
This query retrieves departments with more than 10 employees.
Order of Execution
When processing a query with GROUP BY, MySQL performs the following steps:
1. FROM: The rows are selected from the table.
2. WHERE: Rows are filtered based on the WHERE clause.
3. GROUP BY: Rows are grouped based on the specified columns.
4. HAVING: Groups are filtered based on the HAVING clause.
5. SELECT: The result set is formed, including aggregate functions.
6. ORDER BY: The final result set is sorted if an ORDER BY clause is present.
Example with ORDER BY
You can sort the results after grouping them:
```
SELECT department, COUNT(*) AS number_of_employees
FROM employees
GROUP BY department
ORDER BY number_of_employees DESC;
```
This query groups employees by department and orders the result by the number of employees in descending order.

Best Practices

Indexing: Index columns used in GROUP BY to improve performance, especially with large datasets.
Combine with Aggregate Functions: Use GROUP BY with aggregate functions to summarize data effectively.
Filter Groups: Use HAVING to filter results based on aggregate calculations.
Performance Considerations: Be mindful of query performance with large datasets and complex groupings. Optimize queries and consider indexing.

Next Previous

MySQL Basic

MySQL Introduction

MySQL Workbench

MySQL Commands

MySQL SELECT

MySQL INSERT query

MySQL UPDATE query

MySQL DELETE query

MySQL CREATE DATABASE

MySQL DROP DATABASE

MySQL USE DATABASE

MySQL Creating a table

MySQL Viewing a Table

MySQL Modifying a Table

MySQL Deleting a Table

MySQL Renaming a Table

Truncating a Table

MySQL Copying a Table

MySQL Indexing Tables

MySQl Table constraints

MySQL WHERE

MySQL alias

MySQL ORDER BY

MySQL LIMIT OFFSET

MySQL COUNT() function

MySQL SUM function

MySQL AVG() function

MySQL MAX() function

MySQL MIN() function

MySQL GROUP BY clause

MySQL DISTINCT

MySQL ASC and DESC

MySQL AND, OR and NOT

MySQL IN and NOT IN

MySQL IS NULL and IS NOT NULL

MySQL BETWEEN

MySQL LIKE operator

MySQL Nested queries

MySQL UNION

MySQL joins

MySQL INNER JOIN

MySQL GROUP BY clause

Syntax

Key Concepts

Best Practices