You are viewing a plain text version of this content. The canonical link for it is here.
Posted to derby-dev@db.apache.org by "Tony Brusseau (JIRA)" <ji...@apache.org> on 2013/04/01 21:21:15 UTC
[jira] [Resolved] (DERBY-6093) dramatically worse query plan for
prepared stamement vs calling directly
[ https://issues.apache.org/jira/browse/DERBY-6093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tony Brusseau resolved DERBY-6093.
----------------------------------
Resolution: Invalid
Marking this as invalid. I don't have time to investigate this issue more at the current time. If I do get time and investigate it, I'll reopen with extra details.
> dramatically worse query plan for prepared stamement vs calling directly
> ------------------------------------------------------------------------
>
> Key: DERBY-6093
> URL: https://issues.apache.org/jira/browse/DERBY-6093
> Project: Derby
> Issue Type: Bug
> Components: Store
> Affects Versions: 10.9.1.0
> Environment: Linux
> Reporter: Tony Brusseau
>
> If I run a count query given all the parameters directly, then the query executes in .05s, however, if I send it across as a prepared statement, then the query takes 40s to run (because is does a complete table scan on a table with 15m rows).
> The query looks like:
> SELECT COUNT(*)
> FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2
> WHERE
> (fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0)
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0)
>
> or the prepared version like:
> SELECT COUNT(*)
> FROM KB.FORMULA_ENTRIES fe1
> , KB.FORMULA_ENTRIES fe2
> WHERE
> (fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?)
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?)
> The table is defined as:
> DROP TABLE KB.FORMULA_ENTRIES;
> CREATE TABLE KB.FORMULA_ENTRIES
> (
> formula_entries_id BIGINT NOT NULL,)
> formula_id BIGINT NOT NULL,
> arg_0_term BIGINT NOT NULL,
> arg_term BIGINT NOT NULL,
> arg_num INTEGER NOT NULL,
> formula_type SMALLINT NOT NULL
> );
> ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_formula_entries_pk PRIMARY KEY (formula_entries_id);
> ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_fomula_entries_formula_id_fk
> FOREIGN KEY (formula_id) REFERENCES kb.formula_term (term_id) ON DELETE CASCADE;
> CREATE INDEX kb_formula_entries_formula_term_type ON kb.formula_entries (arg_term, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_arg0_type ON kb.formula_entries (arg_term, arg_0_term, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_num_type ON kb.formula_entries (arg_term, arg_num, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_arg0_num_type ON kb.formula_entries (arg_term, arg_0_term, arg_num, formula_type);
> *************************************************************************
> The good query plan (when *not* sent as a prepared statements) looks like:
> Tue Feb 26 16:15:21 CST 2013 Thread[DRDAConnThread_5,5,main] (XID = 40971291), (SESSIONID = 7), SELECT COUNT(*)
> FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2
> WHERE
> (fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0)
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0) ******* Project-Restrict ResultSet (10):
> Number of opens = 1
> Rows seen = 1
> Rows filtered = 0
> restriction = false
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 1.00
> optimizer estimated cost: 7184.99
> Source result set:
> Scalar Aggregate ResultSet:
> Number of opens = 1
> Rows input = 3863
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 7184.99
> Index Key Optimization = false
> Source result set:
> Project-Restrict ResultSet (9):
> Number of opens = 1
> Rows seen = 3863
> Rows filtered = 0
> restriction = false
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 7184.99
> Source result set:
> Nested Loop Join ResultSet:
> Number of opens = 1
> Rows seen from the left = 3863
> Rows seen from the right = 3863
> Rows filtered = 0
> Rows returned = 3863
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 7184.99
> Left result set:
> Project-Restrict ResultSet (5):
> Number of opens = 1
> Rows seen = 3865
> Rows filtered = 2
> restriction = true
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 35.89
> optimizer estimated cost: 6785.98
> Source result set:
> Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> Number of opens = 1
> Rows seen = 3865
> Columns accessed from heap = {1, 2, 3, 4, 5}
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 35.89
> optimizer estimated cost: 6785.98
> Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> Number of opens = 1
> Rows seen = 3865
> Rows filtered = 0
> Fetch Size = 16
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> next time in milliseconds/row = 0
> scan information:
> Bit set of columns fetched=All
> Number of columns fetched=3
> Number of deleted rows visited=0
> Number of pages visited=7
> Number of rows qualified=3865
> Number of rows visited=3866
> Scan type=btree
> Tree height=3
> start position:
> >= on first 2 column(s).
> Ordered null semantics on the following columns:
> 0 1
> stop position:
> > on first 2 column(s).
> Ordered null semantics on the following columns:
> 0 1
> qualifiers:
> None
> optimizer estimated row count: 35.89
> optimizer estimated cost: 6785.98
> Right result set:
> Project-Restrict ResultSet (8):
> Number of opens = 3863
> Rows seen = 15452
> Rows filtered = 11589
> restriction = true
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 399.01
> Source result set:
> Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> Number of opens = 3863
> Rows seen = 15452
> Columns accessed from heap = {1, 2, 3, 4, 5}
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 399.01
> Index Scan ResultSet for FORMULA_ENTRIES using constraint KB_FOMULA_ENTRIES_FORMULA_ID_FK at read committed isolation level using instantaneous share row locking chosen by the optimizer
> Number of opens = 3863
> Rows seen = 15452
> Rows filtered = 0
> Fetch Size = 16
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> next time in milliseconds/row = 0
> scan information:
> Bit set of columns fetched=All
> Number of columns fetched=2
> Number of deleted rows visited=0
> Number of pages visited=11606
> Number of rows qualified=15452
> Number of rows visited=19315
> Scan type=btree
> Tree height=3
> start position:
> >= on first 1 column(s).
> Ordered null semantics on the following columns:
> 0
> stop position:
> > on first 1 column(s).
> Ordered null semantics on the following columns:
> 0
> qualifiers:
> None
> optimizer estimated row count: 0.00
> optimizer estimated cost: 399.01
> *************************************************************************
> The bad query plan (when sent as a prepared statements) looks like:
> Tue Feb 26 16:09:35 CST 2013 Thread[DRDAConnThread_4,5,main] (XID = 40971265), (SESSIONID = 5), SELECT COUNT(*)
> FROM KB.FORMULA_ENTRIES fe1
> , KB.FORMULA_ENTRIES fe2
> WHERE
> (fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?)
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?) ******* Project-Restrict ResultSet (9):
> Number of opens = 1
> Rows seen = 1
> Rows filtered = 0
> restriction = false
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 1.00
> optimizer estimated cost: 43.36
> Source result set:
> Scalar Aggregate ResultSet:
> Number of opens = 1
> Rows input = 3863
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 43.36
> Index Key Optimization = false
> Source result set:
> Project-Restrict ResultSet (8):
> Number of opens = 1
> Rows seen = 3863
> Rows filtered = 0
> restriction = false
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 43.36
> Source result set:
> Nested Loop Join ResultSet:
> Number of opens = 1
> Rows seen from the left = 3863
> Rows seen from the right = 3863
> Rows filtered = 0
> Rows returned = 3863
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 43.36
> Left result set:
> Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> Number of opens = 1
> Rows seen = 3863
> Columns accessed from heap = {1, 2, 3, 4, 5}
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 2.98
> optimizer estimated cost: 10.89
> Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> Number of opens = 1
> Rows seen = 3863
> Rows filtered = 0
> Fetch Size = 16
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> next time in milliseconds/row = 0
> scan information:
> Bit set of columns fetched=All
> Number of columns fetched=5
> Number of deleted rows visited=0
> Number of pages visited=8
> Number of rows qualified=3863
> Number of rows visited=3864
> Scan type=btree
> Tree height=3
> start position:
> >= on first 4 column(s).
> Ordered null semantics on the following columns:
> stop position:
> > on first 4 column(s).
> Ordered null semantics on the following columns:
> qualifiers:
> None
> optimizer estimated row count: 2.98
> optimizer estimated cost: 10.89
> Right result set:
> Project-Restrict ResultSet (7):
> Number of opens = 3863
> Rows seen = 14922769
> Rows filtered = 14918906
> restriction = true
> projection = true
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> restriction time (milliseconds) = 0
> projection time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 32.47
> Source result set:
> Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> Number of opens = 3863
> Rows seen = 14922769
> Columns accessed from heap = {1, 2, 3, 4, 5}
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> optimizer estimated row count: 0.00
> optimizer estimated cost: 32.47
> Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> Number of opens = 3863
> Rows seen = 14922769
> Rows filtered = 0
> Fetch Size = 16
> constructor time (milliseconds) = 0
> open time (milliseconds) = 0
> next time (milliseconds) = 0
> close time (milliseconds) = 0
> next time in milliseconds/row = 0
> scan information:
> Bit set of columns fetched=All
> Number of columns fetched=5
> Number of deleted rows visited=0
> Number of pages visited=34767
> Number of rows qualified=14922769
> Number of rows visited=14926632
> Scan type=btree
> Tree height=3
> start position:
> >= on first 4 column(s).
> Ordered null semantics on the following columns:
> stop position:
> > on first 4 column(s).
> Ordered null semantics on the following columns:
> qualifiers:
> None
> optimizer estimated row count: 0.00
> optimizer estimated cost: 32.47
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira