Answering this question is not easy. Like always, the best response is “it depends” !
But let’s try to give you all the necessary info the provide the most accurate answer. Also, may be fixing one single query is not enough and looking for that specific statement will lead in finding multiple problematic statements.
The most consuming one
The first candidate to be fixed is the query that consumes most of the execution time (latency). To identify it, we will use the sys
schema and join it with events_statements_summary_by_digest
from performance_schema
to retrieve a real example of the query (see this post for more details).
Let’s take a look at what sys
schema has to offer us related to our mission:
> show tables like 'statements_with%';
+---------------------------------------------+
| Tables_in_sys (statements_with%) |
+---------------------------------------------+
| statements_with_errors_or_warnings |
| statements_with_full_table_scans |
| statements_with_runtimes_in_95th_percentile |
| statements_with_sorting |
| statements_with_temp_tables |
+---------------------------------------------+
We will then use the statements_with_runtimes_in_95th_percentile
to achieve our first task. However we will use the version of the view with raw data (not human readable formatted), to be able to sort the results as we want. The raw data version of sys
schema views start with x$
:
SELECT schema_name, format_time(total_latency) tot_lat,
exec_count, format_time(total_latency/exec_count) latency_per_call,
query_sample_text
FROM sys.x$statements_with_runtimes_in_95th_percentile AS t1
JOIN performance_schema.events_statements_summary_by_digest AS t2
ON t2.digest=t1.digest
WHERE schema_name NOT in ('performance_schema', 'sys')
ORDER BY (total_latency/exec_count) desc LIMIT 1\G
*************************** 1. row ***************************
schema_name: library
tot_lat: 857.29 ms
exec_count: 1
latency_per_call: 857.29 ms
query_sample_text: INSERT INTO `books` (`doc`) VALUES ('{\"_id\": \"00005d44289d000000000000007d\", \"title\": \"lucky luke, tome 27 : l alibi\", \"isbn10\": \"2884710086\", \"isbn13\": \"978-2884710084\", \"langue\": \"français\", \"relié\": \"48 pages\", \"authors\": [\"Guylouis (Auteur)\", \"Morris (Illustrations)\"], \"editeur\": \"lucky comics (21 décembre 1999)\", \"collection\": \"lucky luke\", \"couverture\": \" ...
1 row in set (0.2838 sec)
This statement is complicated to optimize as it’s a simple insert, and it was run only once. Insert can be slower because of disk response time (I run in full durability of course). Having too many indexes may also increase the response time, this is why I invite you to have a look at these two sys
schema tables:
schema_redundant_indexes
schema_unused_indexes
You will have to play with the limit of the query to find some valid candidates and then, thanks to the query_sample_text
we have the possibility to run an EXPLAIN
on the query without having to rewrite it !
Full table scans
Another query I would try to optimize is the one doing full table scans:
SELECT schema_name, sum_rows_examined, (sum_rows_examined/exec_count) avg_rows_call,
format_time(total_latency) tot_lat, exec_count,
format_time(total_latency/exec_count) AS latency_per_call,
query_sample_text
FROM sys.x$statements_with_full_table_scans AS t1
JOIN performance_schema.events_statements_summary_by_digest AS t2
ON t2.digest=t1.digest
WHERE schema_name NOT in ('performance_schema', 'sys')
ORDER BY (total_latency/exec_count) desc LIMIT 1\G
*************************** 1. row ***************************
schema_name: wp_lefred
sum_rows_examined: 268075
avg_rows_call: 3277.0419
tot_lat: 31.31 s
exec_count: 124
latency_per_call: 252.47 ms
query_sample_text: SELECT count(*) as mytotal
FROM wp_posts
WHERE (post_content LIKE '%youtube.com/%'
OR post_content LIKE '%youtu.be/%')
AND post_status = 'publish'
1 row in set (0.0264 sec)
We can then see that this query was executed 124 times for a total execution time of 31.31 seconds which makes 252.47 milliseconds per call. We can also see that this query examined more than 268k rows which means that on average those full table scans are examining 3277 records per query.
This is a very good one for optimization.
Temp tables
Creating temporary tables is also sub optimal for your workload, if you have some slow ones you should have identified them already with the previous queries. But if you want to hunt those specifically, once again, sys
schema helps you to catch them:
SELECT schema_name, format_time(total_latency) tot_lat, exec_count,
format_time(total_latency/exec_count) latency_per_call, query_sample_text
FROM sys.x$statements_with_temp_tables AS t1
JOIN performance_schema.events_statements_summary_by_digest AS t2
ON t2.digest=t1.digest
WHERE schema_name NOT in ('performance_schema', 'sys') AND disk_tmp_tables=1
ORDER BY 2 desc,(total_latency/exec_count) desc LIMIT 1\G
Fortunately, I had none on my system.
Query optimization is not the most exciting part of the DBA job… but it has to be done ;-). You have now an easy method to find where to start, good luck ! And don’t forget that if you need any help, you can always joins the MySQL Community Slack channel !
Query optimization is not the most exiting part of the DBA job…
“exiting”
Thank you for pointing this mistake, I updated the post.
Hi Lefred,
How can we use above SQL queries in 5.7?
Regards!
Mohsin
Hi Mohsin, if you check the first article linked, you will see that QUERY_SAMPLE_TEXT is an addition in MySQL 8.0 only. You can use the SYS schema in 5.7 without any join.
Thanks
This is great. I’m just upgrading a DB from 5.6 to 5.7 and I’m researching how I can best leverage the new sys schema. This has been very helpful. Also, one of the best things that I learned from this post is that the `x$` tables have the _full SQL statement_, whereas the “user friendly” tables have really brief digests. I thought I was going to have to pick those apart – but, being able to `JOIN` to the non “user friendly” table to get the full SQL statement is such a huge win! Thanks.
you will see that QUERY_SAMPLE_TEXT
[…] already wrote an article on how to use Performance_Schema and Sys, let’s point out the query consuming most of the execution time (latency) with this new […]