MySQL Interview Questions

MySQL is one of the most popular relational database management systems in the world. Whether you are preparing for a junior or senior developer interview, these 20 commonly asked MySQL interview questions will help you solidify your understanding of core database concepts, SQL syntax, and performance optimization.

1. What is MySQL? What are its key features?

MySQL is an open-source relational database management system (RDBMS) that uses Structured Query Language (SQL) for managing data. It is developed and maintained by Oracle Corporation.

Open-source and free to use (Community Edition)
Cross-platform — runs on Linux, Windows, macOS
High performance with query caching and indexing
ACID compliant (with InnoDB engine)
Replication support — Master-Slave and Master-Master
Scalable — handles millions of rows efficiently
Strong community and extensive documentation

Check MySQL Version

-- Check MySQL version
SELECT VERSION();

-- Show all databases
SHOW DATABASES;

-- Create a new database
CREATE DATABASE my_app;

-- Use a database
USE my_app;

-- Show all tables in current database
SHOW TABLES;

2. What is the difference between MySQL and SQL?

SQL (Structured Query Language) is a language used to communicate with relational databases. MySQL is a database management system that uses SQL as its query language.

SQL	MySQL
A query language (standard)	A database software (RDBMS)
Defines how to interact with data	Implements SQL to store/retrieve data
Used by many RDBMS (PostgreSQL, SQLite, etc.)	One specific RDBMS that uses SQL

SQL is the language, MySQL is the tool

-- SQL is the language you write:
SELECT * FROM users WHERE age > 18;
INSERT INTO users (name, email) VALUES ('John', 'john@example.com');
UPDATE users SET name = 'Jane' WHERE id = 1;
DELETE FROM users WHERE id = 1;

-- MySQL is the software that executes these SQL statements
-- Other RDBMS like PostgreSQL, SQLite, SQL Server also use SQL
-- but with slight syntax differences

3. What are Storage Engines? InnoDB vs MyISAM?

Storage engines are the underlying components that MySQL uses to handle SQL operations for different table types. The two most common are InnoDB (default) and MyISAM.

Feature	InnoDB	MyISAM
Transactions	Yes (ACID)	No
Foreign Keys	Yes	No
Row-level Locking	Yes	No (table-level)
Full-text Search	Yes (5.6+)	Yes

Storage Engine Commands

-- Show available storage engines
SHOW ENGINES;

-- Create table with InnoDB (default, recommended)
CREATE TABLE orders (
    id INT PRIMARY KEY AUTO_INCREMENT,
    user_id INT,
    total DECIMAL(10,2),
    FOREIGN KEY (user_id) REFERENCES users(id)
) ENGINE=InnoDB;

-- Create table with MyISAM (legacy, no transactions)
CREATE TABLE logs (
    id INT PRIMARY KEY AUTO_INCREMENT,
    message TEXT,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP
) ENGINE=MyISAM;

-- Check engine of a specific table
SHOW TABLE STATUS WHERE Name = 'orders';

4. What are PRIMARY KEY, FOREIGN KEY, and UNIQUE KEY?

Keys are constraints that enforce data integrity in relational databases.

PRIMARY KEY — Uniquely identifies each row. Cannot be NULL. Only one per table.
FOREIGN KEY — References a PRIMARY KEY in another table. Enforces referential integrity.
UNIQUE KEY — Ensures all values in a column are distinct. Allows one NULL. Multiple per table.

Key Constraints Example

-- PRIMARY KEY: Unique identifier for each row
CREATE TABLE users (
    id INT PRIMARY KEY AUTO_INCREMENT,
    email VARCHAR(255) NOT NULL,
    name VARCHAR(100) NOT NULL
);

-- UNIQUE KEY: Ensures no duplicate values
ALTER TABLE users ADD UNIQUE KEY unique_email (email);

-- FOREIGN KEY: Links to another table's primary key
CREATE TABLE orders (
    id INT PRIMARY KEY AUTO_INCREMENT,
    user_id INT NOT NULL,
    product VARCHAR(255),
    amount DECIMAL(10,2),
    FOREIGN KEY (user_id) REFERENCES users(id)
        ON DELETE CASCADE
        ON UPDATE CASCADE
);

-- Composite PRIMARY KEY
CREATE TABLE order_items (
    order_id INT,
    product_id INT,
    quantity INT,
    PRIMARY KEY (order_id, product_id)
);

5. What are Indexes? What types exist?

An index is a data structure that improves the speed of data retrieval on a table at the cost of additional storage and slower writes. Think of it like an index in a book.

PRIMARY Index — Automatically created on PRIMARY KEY
UNIQUE Index — Ensures uniqueness of values
Composite Index — Index on multiple columns
FULLTEXT Index — For full-text search on text columns
SPATIAL Index — For geographic data types

Index Examples

-- Create a single-column index
CREATE INDEX idx_email ON users(email);

-- Create a composite index (multi-column)
CREATE INDEX idx_name_city ON users(last_name, city);

-- Create a unique index
CREATE UNIQUE INDEX idx_unique_email ON users(email);

-- Create a FULLTEXT index for text search
CREATE FULLTEXT INDEX idx_content ON articles(title, body);

-- Use FULLTEXT search
SELECT * FROM articles
WHERE MATCH(title, body) AGAINST('MySQL tutorial' IN BOOLEAN MODE);

-- Show indexes on a table
SHOW INDEX FROM users;

-- Drop an index
DROP INDEX idx_email ON users;

6. What is the difference between WHERE and HAVING?

Both filter rows, but they work at different stages of query execution.

WHERE — Filters rows before grouping (cannot use aggregate functions)
HAVING — Filters groups after GROUP BY (can use aggregate functions)

WHERE vs HAVING

-- WHERE: filters rows BEFORE grouping
SELECT department, COUNT(*) AS emp_count
FROM employees
WHERE salary > 50000        -- filter individual rows first
GROUP BY department;

-- HAVING: filters groups AFTER grouping
SELECT department, COUNT(*) AS emp_count
FROM employees
GROUP BY department
HAVING COUNT(*) > 5;        -- filter groups with more than 5 employees

-- Combined: WHERE + HAVING
SELECT department, AVG(salary) AS avg_salary
FROM employees
WHERE hire_date > '2020-01-01'   -- only employees hired after 2020
GROUP BY department
HAVING AVG(salary) > 60000;     -- only departments with avg salary > 60k

7. What are JOINs? Explain each type.

JOINs combine rows from two or more tables based on a related column. MySQL supports several types of joins:

INNER JOIN — Returns only matching rows from both tables
LEFT JOIN — All rows from left table + matched rows from right (NULL if no match)
RIGHT JOIN — All rows from right table + matched rows from left (NULL if no match)
FULL OUTER JOIN — All rows from both tables (MySQL uses UNION to simulate)
CROSS JOIN — Cartesian product of both tables (every combination)

JOIN Examples

-- Sample tables
-- users: id, name, email
-- orders: id, user_id, total

-- INNER JOIN: only matching rows
SELECT u.name, o.total
FROM users u
INNER JOIN orders o ON u.id = o.user_id;

-- LEFT JOIN: all users, even without orders
SELECT u.name, o.total
FROM users u
LEFT JOIN orders o ON u.id = o.user_id;

-- RIGHT JOIN: all orders, even without matching users
SELECT u.name, o.total
FROM users u
RIGHT JOIN orders o ON u.id = o.user_id;

-- FULL OUTER JOIN (simulated with UNION in MySQL)
SELECT u.name, o.total FROM users u LEFT JOIN orders o ON u.id = o.user_id
UNION
SELECT u.name, o.total FROM users u RIGHT JOIN orders o ON u.id = o.user_id;

-- CROSS JOIN: every user paired with every order
SELECT u.name, o.total
FROM users u
CROSS JOIN orders o;

-- Self JOIN: employees with their managers
SELECT e.name AS employee, m.name AS manager
FROM employees e
LEFT JOIN employees m ON e.manager_id = m.id;

8. What is a Subquery? Correlated vs Non-Correlated?

A subquery is a query nested inside another query. It can appear in SELECT, FROM, WHERE, or HAVING clauses.

Non-correlated — The inner query runs independently. It executes once and passes the result to the outer query.
Correlated — The inner query depends on the outer query. It executes once for each row of the outer query.

Subquery Examples

-- Non-correlated subquery: inner query runs ONCE independently
SELECT name, salary
FROM employees
WHERE salary > (
    SELECT AVG(salary) FROM employees   -- runs once, returns one value
);

-- Subquery in FROM clause (derived table)
SELECT dept, avg_sal
FROM (
    SELECT department AS dept, AVG(salary) AS avg_sal
    FROM employees
    GROUP BY department
) AS dept_averages
WHERE avg_sal > 60000;

-- Correlated subquery: inner query runs for EACH outer row
SELECT e.name, e.salary, e.department
FROM employees e
WHERE e.salary > (
    SELECT AVG(salary)
    FROM employees
    WHERE department = e.department  -- references outer query
);

-- EXISTS with correlated subquery
SELECT u.name
FROM users u
WHERE EXISTS (
    SELECT 1 FROM orders o WHERE o.user_id = u.id
);

9. What is Normalization? Explain 1NF, 2NF, 3NF.

Normalization is the process of organizing data to reduce redundancy and improve data integrity. Each normal form builds on the previous one:

1NF (First Normal Form) — Each column contains atomic (indivisible) values. No repeating groups.
2NF (Second Normal Form) — Must be in 1NF + all non-key columns fully depend on the entire primary key (no partial dependency).
3NF (Third Normal Form) — Must be in 2NF + no transitive dependency (non-key columns must not depend on other non-key columns).

Normalization Example

-- UNNORMALIZED (bad): repeating groups, multiple values in one cell
-- | id | name | phones                |
-- | 1  | John | 012345678, 098765432  |

-- 1NF: Atomic values, no repeating groups
CREATE TABLE users (
    id INT PRIMARY KEY,
    name VARCHAR(100)
);
CREATE TABLE user_phones (
    id INT PRIMARY KEY AUTO_INCREMENT,
    user_id INT,
    phone VARCHAR(20),
    FOREIGN KEY (user_id) REFERENCES users(id)
);

-- 2NF: Remove partial dependencies
-- Bad: order_items(order_id, product_id, product_name, quantity)
-- product_name depends only on product_id (partial dependency)

-- Good: separate into two tables
CREATE TABLE products (
    id INT PRIMARY KEY,
    name VARCHAR(255)
);
CREATE TABLE order_items (
    order_id INT,
    product_id INT,
    quantity INT,
    PRIMARY KEY (order_id, product_id),
    FOREIGN KEY (product_id) REFERENCES products(id)
);

-- 3NF: Remove transitive dependencies
-- Bad: employees(id, name, dept_id, dept_name, dept_location)
-- dept_name and dept_location depend on dept_id, not on employee id

-- Good: separate department info
CREATE TABLE departments (
    id INT PRIMARY KEY,
    name VARCHAR(100),
    location VARCHAR(255)
);
CREATE TABLE employees (
    id INT PRIMARY KEY,
    name VARCHAR(100),
    dept_id INT,
    FOREIGN KEY (dept_id) REFERENCES departments(id)
);

10. What are Transactions? What are ACID properties?

A transaction is a sequence of SQL operations executed as a single unit of work. Either all operations succeed (COMMIT) or all fail (ROLLBACK).

Atomicity — All or nothing. If one part fails, the entire transaction is rolled back.
Consistency — Database moves from one valid state to another.
Isolation — Concurrent transactions do not interfere with each other.
Durability — Once committed, changes are permanent even after a crash.

Transaction Example

-- Start a transaction
START TRANSACTION;

-- Transfer $100 from account 1 to account 2
UPDATE accounts SET balance = balance - 100 WHERE id = 1;
UPDATE accounts SET balance = balance + 100 WHERE id = 2;

-- If both succeed, make changes permanent
COMMIT;

-- If something goes wrong, undo everything
-- ROLLBACK;

-- Using SAVEPOINT for partial rollback
START TRANSACTION;

INSERT INTO orders (user_id, total) VALUES (1, 99.99);
SAVEPOINT order_created;

INSERT INTO order_items (order_id, product_id, quantity) VALUES (LAST_INSERT_ID(), 5, 2);
-- Oops, something wrong with this item
ROLLBACK TO order_created;

-- The order still exists, only the item was rolled back
INSERT INTO order_items (order_id, product_id, quantity) VALUES (LAST_INSERT_ID(), 3, 1);
COMMIT;

-- Check current isolation level
SELECT @@transaction_isolation;

-- Set isolation level
SET TRANSACTION ISOLATION LEVEL READ COMMITTED;

11. What is the difference between DELETE, TRUNCATE, and DROP?

All three remove data, but they work differently:

Feature	DELETE	TRUNCATE	DROP
Removes	Specific rows	All rows	Entire table
WHERE clause	Yes	No	No
Rollback	Yes	No	No
Speed	Slower	Faster	Fastest

DELETE vs TRUNCATE vs DROP

-- DELETE: removes specific rows (can use WHERE, supports rollback)
DELETE FROM users WHERE id = 5;
DELETE FROM orders WHERE created_at < '2024-01-01';

-- TRUNCATE: removes ALL rows instantly (resets AUTO_INCREMENT)
TRUNCATE TABLE logs;

-- DROP: removes the entire table (structure + data)
DROP TABLE IF EXISTS temp_data;

-- Key differences:
-- DELETE   -> DML, row-by-row, logged, can rollback, keeps AUTO_INCREMENT
-- TRUNCATE -> DDL, drops & recreates table, cannot rollback, resets AUTO_INCREMENT
-- DROP     -> DDL, removes table entirely, cannot rollback

12. What are Views?

A view is a virtual table based on the result of a SQL query. It does not store data itself — it dynamically pulls data from the underlying tables each time it is queried. Views simplify complex queries, restrict data access, and provide abstraction.

View Examples

-- Create a view
CREATE VIEW active_users AS
SELECT id, name, email, last_login
FROM users
WHERE status = 'active'
AND last_login > DATE_SUB(NOW(), INTERVAL 30 DAY);

-- Use the view like a regular table
SELECT * FROM active_users;
SELECT name, email FROM active_users WHERE name LIKE 'J%';

-- Create a view with JOINs
CREATE VIEW order_summary AS
SELECT
    u.name AS customer,
    COUNT(o.id) AS total_orders,
    SUM(o.total) AS total_spent
FROM users u
LEFT JOIN orders o ON u.id = o.user_id
GROUP BY u.id, u.name;

-- Query the view
SELECT * FROM order_summary WHERE total_spent > 1000;

-- Update a view
CREATE OR REPLACE VIEW active_users AS
SELECT id, name, email FROM users WHERE status = 'active';

-- Drop a view
DROP VIEW IF EXISTS active_users;

13. What are Stored Procedures and Functions?

Both are reusable blocks of SQL code stored in the database. The key difference:

Stored Procedure — Can return zero or multiple values. Called with CALL. Can have IN, OUT, INOUT parameters.
Function — Must return exactly one value. Can be used in SQL expressions (SELECT, WHERE, etc.).

Stored Procedure and Function

-- STORED PROCEDURE: reusable block of SQL
DELIMITER //

CREATE PROCEDURE GetUserOrders(IN userId INT)
BEGIN
    SELECT o.id, o.total, o.created_at
    FROM orders o
    WHERE o.user_id = userId
    ORDER BY o.created_at DESC;
END //

DELIMITER ;

-- Call the procedure
CALL GetUserOrders(1);

-- Procedure with OUT parameter
DELIMITER //

CREATE PROCEDURE GetOrderTotal(IN userId INT, OUT totalAmount DECIMAL(10,2))
BEGIN
    SELECT SUM(total) INTO totalAmount
    FROM orders
    WHERE user_id = userId;
END //

DELIMITER ;

CALL GetOrderTotal(1, @total);
SELECT @total;

-- FUNCTION: must return a single value
DELIMITER //

CREATE FUNCTION CalculateDiscount(price DECIMAL(10,2), discount_pct INT)
RETURNS DECIMAL(10,2)
DETERMINISTIC
BEGIN
    RETURN price - (price * discount_pct / 100);
END //

DELIMITER ;

-- Use function in a query
SELECT name, price, CalculateDiscount(price, 10) AS discounted_price
FROM products;

14. What are Triggers?

A trigger is a stored program that automatically executes in response to an event (INSERT, UPDATE, DELETE) on a table. Triggers are useful for auditing, validation, and maintaining derived data.

Trigger Example

-- Create a trigger that logs changes
DELIMITER //

CREATE TRIGGER before_user_update
BEFORE UPDATE ON users
FOR EACH ROW
BEGIN
    INSERT INTO user_audit_log (user_id, field_changed, old_value, new_value, changed_at)
    VALUES (OLD.id, 'email', OLD.email, NEW.email, NOW());
END //

DELIMITER ;

-- Trigger to auto-update stock after order
DELIMITER //

CREATE TRIGGER after_order_insert
AFTER INSERT ON order_items
FOR EACH ROW
BEGIN
    UPDATE products
    SET stock = stock - NEW.quantity
    WHERE id = NEW.product_id;
END //

DELIMITER ;

-- Trigger to prevent negative balance
DELIMITER //

CREATE TRIGGER before_withdrawal
BEFORE UPDATE ON accounts
FOR EACH ROW
BEGIN
    IF NEW.balance < 0 THEN
        SIGNAL SQLSTATE '45000'
        SET MESSAGE_TEXT = 'Insufficient balance';
    END IF;
END //

DELIMITER ;

-- Show all triggers
SHOW TRIGGERS;

-- Drop a trigger
DROP TRIGGER IF EXISTS before_user_update;

15. What is the difference between CHAR and VARCHAR?

Both store string data, but they handle storage differently:

Feature	CHAR	VARCHAR
Storage	Fixed-length (padded with spaces)	Variable-length
Max length	255 characters	65,535 characters
Performance	Faster for fixed-size data	Better for variable-size data
Use case	Country codes, status flags	Names, emails, descriptions

CHAR vs VARCHAR

-- CHAR: fixed-length, always uses specified bytes
CREATE TABLE countries (
    code CHAR(2) NOT NULL,      -- 'US', 'KH', 'JP' (always 2 chars)
    name VARCHAR(100)
);

-- VARCHAR: variable-length, uses only needed bytes + 1-2 bytes overhead
CREATE TABLE users (
    id INT PRIMARY KEY AUTO_INCREMENT,
    name VARCHAR(100),          -- 'John' uses 5 bytes, 'Alexander' uses 10 bytes
    email VARCHAR(255),
    status CHAR(1) DEFAULT 'A'  -- 'A' for active, 'I' for inactive
);

-- Storage comparison:
-- CHAR(10) storing 'Hello':  'Hello     ' (10 bytes, padded with spaces)
-- VARCHAR(10) storing 'Hello': 'Hello' (6 bytes = 5 chars + 1 length byte)

16. What is UNION vs UNION ALL?

Both combine results from multiple SELECT statements. The difference is how they handle duplicates:

UNION — Combines results and removes duplicates (slower, applies DISTINCT)
UNION ALL — Combines results and keeps all rows including duplicates (faster)

UNION vs UNION ALL

-- UNION: combines results, removes duplicates
SELECT name, email FROM customers
UNION
SELECT name, email FROM suppliers;
-- If 'John' appears in both tables, shown only once

-- UNION ALL: combines results, keeps duplicates (faster)
SELECT name, email FROM customers
UNION ALL
SELECT name, email FROM suppliers;
-- If 'John' appears in both tables, shown twice

-- Practical example: Get all transactions
SELECT id, amount, 'income' AS type FROM income
UNION ALL
SELECT id, amount, 'expense' AS type FROM expenses
ORDER BY amount DESC;

-- Rules:
-- 1. Same number of columns in each SELECT
-- 2. Columns must have compatible data types
-- 3. Column names come from the first SELECT

17. What is GROUP BY and aggregate functions?

GROUP BY groups rows that have the same values in specified columns into summary rows. It is used with aggregate functions:

COUNT() — Number of rows
SUM() — Total of a numeric column
AVG() — Average value
MAX() / MIN() — Highest / lowest value

GROUP BY and Aggregates

-- COUNT: number of rows
SELECT department, COUNT(*) AS employee_count
FROM employees
GROUP BY department;

-- SUM: total of a column
SELECT department, SUM(salary) AS total_salary
FROM employees
GROUP BY department;

-- AVG: average value
SELECT department, ROUND(AVG(salary), 2) AS avg_salary
FROM employees
GROUP BY department;

-- MAX / MIN: highest and lowest
SELECT department,
    MAX(salary) AS highest_salary,
    MIN(salary) AS lowest_salary
FROM employees
GROUP BY department;

-- Multiple aggregates combined
SELECT
    department,
    COUNT(*) AS headcount,
    SUM(salary) AS total_salary,
    ROUND(AVG(salary), 2) AS avg_salary,
    MAX(salary) AS max_salary,
    MIN(salary) AS min_salary
FROM employees
GROUP BY department
ORDER BY avg_salary DESC;

-- GROUP BY with multiple columns
SELECT department, job_title, COUNT(*) AS count
FROM employees
GROUP BY department, job_title;

18. What is a Deadlock? How to prevent it?

A deadlock occurs when two or more transactions are waiting for each other to release locks, creating a circular dependency where none can proceed. MySQL automatically detects deadlocks and rolls back one transaction.

Access tables in the same order across all transactions
Keep transactions short and minimize lock time
Use appropriate isolation levels
Add proper indexes to reduce lock scope

Deadlock Scenario and Prevention

-- Deadlock scenario:
-- Transaction 1:                    Transaction 2:
-- UPDATE accounts SET ... WHERE id=1  UPDATE accounts SET ... WHERE id=2
-- UPDATE accounts SET ... WHERE id=2  UPDATE accounts SET ... WHERE id=1
-- (waits for T2 to release id=2)     (waits for T1 to release id=1)
-- DEADLOCK!

-- Prevention strategy 1: Consistent lock ordering
-- Always lock rows in the same order (by id ascending)
START TRANSACTION;
SELECT * FROM accounts WHERE id = 1 FOR UPDATE;
SELECT * FROM accounts WHERE id = 2 FOR UPDATE;
-- Now do your updates
UPDATE accounts SET balance = balance - 100 WHERE id = 1;
UPDATE accounts SET balance = balance + 100 WHERE id = 2;
COMMIT;

-- Prevention strategy 2: Use lock timeout
SET innodb_lock_wait_timeout = 5;  -- wait max 5 seconds

-- Check for deadlocks in the log
SHOW ENGINE INNODB STATUS;

-- View current locks
SELECT * FROM information_schema.INNODB_TRX;

19. What is Replication? Master-Slave?

Replication is the process of copying data from one MySQL server (source/master) to one or more servers (replica/slave). This provides:

High Availability — If master fails, a replica can take over
Read Scaling — Distribute read queries across replicas
Backup — Replicas serve as live backups
Analytics — Run heavy reports on replicas without impacting the master

Replication Configuration

-- MASTER configuration (my.cnf)
-- [mysqld]
-- server-id = 1
-- log_bin = mysql-bin
-- binlog_do_db = my_app

-- On Master: create replication user
CREATE USER 'replica_user'@'%' IDENTIFIED BY 'secure_password';
GRANT REPLICATION SLAVE ON *.* TO 'replica_user'@'%';
FLUSH PRIVILEGES;

-- Check master status
SHOW MASTER STATUS;
-- +------------------+----------+
-- | File             | Position |
-- | mysql-bin.000001 | 154      |
-- +------------------+----------+

-- SLAVE configuration (my.cnf)
-- [mysqld]
-- server-id = 2
-- relay_log = mysql-relay-bin

-- On Slave: configure connection to master
CHANGE MASTER TO
    MASTER_HOST = '192.168.1.100',
    MASTER_USER = 'replica_user',
    MASTER_PASSWORD = 'secure_password',
    MASTER_LOG_FILE = 'mysql-bin.000001',
    MASTER_LOG_POS = 154;

-- Start replication
START SLAVE;

-- Check slave status
SHOW SLAVE STATUS\G

20. How to optimize MySQL queries?

Query optimization is crucial for application performance. Here are the most effective strategies:

Use EXPLAIN to analyze query execution plans
Add proper indexes on columns used in WHERE, JOIN, ORDER BY
Avoid SELECT * — only select needed columns
Use LIMIT for pagination
Optimize JOINs — ensure join columns are indexed
Avoid functions on indexed columns in WHERE clauses
Use query caching where appropriate
Normalize data but denormalize for read-heavy workloads

Query Optimization Techniques

-- 1. Use EXPLAIN to analyze queries
EXPLAIN SELECT * FROM users WHERE email = 'john@example.com';
EXPLAIN ANALYZE SELECT * FROM orders WHERE user_id = 5;

-- 2. Add indexes on frequently queried columns
CREATE INDEX idx_users_email ON users(email);
CREATE INDEX idx_orders_user_date ON orders(user_id, created_at);

-- 3. Avoid SELECT * — only select what you need
-- Bad:
SELECT * FROM users;
-- Good:
SELECT id, name, email FROM users;

-- 4. Use LIMIT for pagination
SELECT id, name FROM products ORDER BY created_at DESC LIMIT 20 OFFSET 40;

-- 5. Avoid functions on indexed columns in WHERE
-- Bad (cannot use index):
SELECT * FROM users WHERE YEAR(created_at) = 2025;
-- Good (can use index):
SELECT * FROM users WHERE created_at >= '2025-01-01' AND created_at < '2026-01-01';

-- 6. Use EXISTS instead of IN for large subqueries
-- Slower:
SELECT * FROM users WHERE id IN (SELECT user_id FROM orders);
-- Faster:
SELECT * FROM users u WHERE EXISTS (SELECT 1 FROM orders o WHERE o.user_id = u.id);

-- 7. Optimize JOINs — ensure join columns are indexed
SELECT u.name, COUNT(o.id) AS order_count
FROM users u
LEFT JOIN orders o ON u.id = o.user_id  -- u.id (PK, indexed), o.user_id (needs index)
GROUP BY u.id, u.name;

-- 8. Use query profiling
SET profiling = 1;
SELECT * FROM users WHERE status = 'active';
SHOW PROFILES;
SHOW PROFILE FOR QUERY 1;

Summary

✓MySQL Basics — Open-source RDBMS using SQL, with InnoDB as the default engine
✓Keys and Indexes — PRIMARY, FOREIGN, UNIQUE keys enforce integrity; indexes boost query speed
✓JOINs — INNER, LEFT, RIGHT, FULL, CROSS for combining data from multiple tables
✓Subqueries — Nested queries, correlated and non-correlated
✓Normalization — 1NF, 2NF, 3NF to reduce redundancy
✓Transactions — ACID properties guarantee data reliability
✓Views, Procedures, Triggers — Database objects for abstraction and automation
✓Performance — Use EXPLAIN, proper indexing, and query optimization techniques
✓Replication — Master-Slave setup for high availability and read scaling
✓Deadlocks — Prevent with consistent lock ordering and short transactions