You are viewing a plain text version of this content. The canonical link for it is here.
Posted to derby-dev@db.apache.org by "Tony Brusseau (JIRA)" <ji...@apache.org> on 2013/04/01 21:21:15 UTC

[jira] [Resolved] (DERBY-6093) dramatically worse query plan for prepared stamement vs calling directly

     [ https://issues.apache.org/jira/browse/DERBY-6093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tony Brusseau resolved DERBY-6093.
----------------------------------

    Resolution: Invalid

Marking this as invalid. I don't have time to investigate this issue more at the current time. If I do get time and investigate it, I'll reopen with extra details.
                
> dramatically worse query plan for prepared stamement vs calling directly
> ------------------------------------------------------------------------
>
>                 Key: DERBY-6093
>                 URL: https://issues.apache.org/jira/browse/DERBY-6093
>             Project: Derby
>          Issue Type: Bug
>          Components: Store
>    Affects Versions: 10.9.1.0
>         Environment: Linux
>            Reporter: Tony Brusseau
>
> If I run a count query given all the parameters directly, then the query executes in .05s, however, if I send it across as a prepared statement, then the query takes 40s to run (because is does a complete table scan on a table with 15m rows).
> The query looks like:
> SELECT COUNT(*) 
> FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2  
> WHERE 
> (fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0) 
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0)
>  
> or the prepared version like:
>  SELECT COUNT(*) 
> FROM KB.FORMULA_ENTRIES fe1 
> , KB.FORMULA_ENTRIES fe2  
> WHERE 
> (fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?) 
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?)
> The table is defined as:
> DROP TABLE KB.FORMULA_ENTRIES;
> CREATE TABLE KB.FORMULA_ENTRIES
> (
>         formula_entries_id           BIGINT NOT NULL,)
>         formula_id                   BIGINT NOT NULL,
>         arg_0_term                   BIGINT NOT NULL,
>         arg_term                     BIGINT NOT NULL,
>         arg_num                      INTEGER NOT NULL,
>         formula_type                 SMALLINT NOT NULL
> );
> ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_formula_entries_pk PRIMARY KEY (formula_entries_id);
> ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_fomula_entries_formula_id_fk 
>         FOREIGN KEY (formula_id) REFERENCES kb.formula_term (term_id) ON DELETE CASCADE;
> CREATE INDEX kb_formula_entries_formula_term_type ON kb.formula_entries (arg_term, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_arg0_type ON kb.formula_entries (arg_term, arg_0_term, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_num_type ON kb.formula_entries (arg_term, arg_num, formula_type);
> CREATE INDEX kb_formula_entries_formula_term_arg0_num_type ON kb.formula_entries (arg_term, arg_0_term, arg_num, formula_type);
> *************************************************************************
> The good query plan (when *not* sent as a prepared statements) looks like:
> Tue Feb 26 16:15:21 CST 2013 Thread[DRDAConnThread_5,5,main] (XID = 40971291), (SESSIONID = 7), SELECT COUNT(*) 
> FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2  
> WHERE 
> (fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0) 
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0) ******* Project-Restrict ResultSet (10):
> Number of opens = 1
> Rows seen = 1
> Rows filtered = 0
> restriction = false
> projection = true
> 	constructor time (milliseconds) = 0
> 	open time (milliseconds) = 0
> 	next time (milliseconds) = 0
> 	close time (milliseconds) = 0
> 	restriction time (milliseconds) = 0
> 	projection time (milliseconds) = 0
> 	optimizer estimated row count: 1.00
> 	optimizer estimated cost: 7184.99
> Source result set:
> 	Scalar Aggregate ResultSet:
> 	Number of opens = 1
> 	Rows input = 3863
> 		constructor time (milliseconds) = 0
> 		open time (milliseconds) = 0
> 		next time (milliseconds) = 0
> 		close time (milliseconds) = 0
> 		optimizer estimated row count: 0.00
> 		optimizer estimated cost: 7184.99
> 	Index Key Optimization = false
> 	Source result set:
> 		Project-Restrict ResultSet (9):
> 		Number of opens = 1
> 		Rows seen = 3863
> 		Rows filtered = 0
> 		restriction = false
> 		projection = true
> 			constructor time (milliseconds) = 0
> 			open time (milliseconds) = 0
> 			next time (milliseconds) = 0
> 			close time (milliseconds) = 0
> 			restriction time (milliseconds) = 0
> 			projection time (milliseconds) = 0
> 			optimizer estimated row count: 0.00
> 			optimizer estimated cost: 7184.99
> 		Source result set:
> 			Nested Loop Join ResultSet:
> 			Number of opens = 1
> 			Rows seen from the left = 3863
> 			Rows seen from the right = 3863
> 			Rows filtered = 0
> 			Rows returned = 3863
> 				constructor time (milliseconds) = 0
> 				open time (milliseconds) = 0
> 				next time (milliseconds) = 0
> 				close time (milliseconds) = 0
> 				optimizer estimated row count: 0.00
> 				optimizer estimated cost: 7184.99
> 			Left result set:
> 				Project-Restrict ResultSet (5):
> 				Number of opens = 1
> 				Rows seen = 3865
> 				Rows filtered = 2
> 				restriction = true
> 				projection = true
> 					constructor time (milliseconds) = 0
> 					open time (milliseconds) = 0
> 					next time (milliseconds) = 0
> 					close time (milliseconds) = 0
> 					restriction time (milliseconds) = 0
> 					projection time (milliseconds) = 0
> 					optimizer estimated row count: 35.89
> 					optimizer estimated cost: 6785.98
> 				Source result set:
> 					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> 					Number of opens = 1
> 					Rows seen = 3865
> 					Columns accessed from heap = {1, 2, 3, 4, 5}
> 						constructor time (milliseconds) = 0
> 						open time (milliseconds) = 0
> 						next time (milliseconds) = 0
> 						close time (milliseconds) = 0
> 						optimizer estimated row count: 35.89
> 						optimizer estimated cost: 6785.98
> 						Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> 						Number of opens = 1
> 						Rows seen = 3865
> 						Rows filtered = 0
> 						Fetch Size = 16
> 							constructor time (milliseconds) = 0
> 							open time (milliseconds) = 0
> 							next time (milliseconds) = 0
> 							close time (milliseconds) = 0
> 							next time in milliseconds/row = 0
> 						scan information:
> 							Bit set of columns fetched=All
> 							Number of columns fetched=3
> 							Number of deleted rows visited=0
> 							Number of pages visited=7
> 							Number of rows qualified=3865
> 							Number of rows visited=3866
> 							Scan type=btree
> 							Tree height=3
> 							start position:
> 								>= on first 2 column(s).
> 								Ordered null semantics on the following columns: 
> 								0 1 
> 							stop position:
> 								> on first 2 column(s).
> 								Ordered null semantics on the following columns: 
> 								0 1 
> 							qualifiers:
> 								None
> 							optimizer estimated row count: 35.89
> 							optimizer estimated cost: 6785.98
> 			Right result set:
> 				Project-Restrict ResultSet (8):
> 				Number of opens = 3863
> 				Rows seen = 15452
> 				Rows filtered = 11589
> 				restriction = true
> 				projection = true
> 					constructor time (milliseconds) = 0
> 					open time (milliseconds) = 0
> 					next time (milliseconds) = 0
> 					close time (milliseconds) = 0
> 					restriction time (milliseconds) = 0
> 					projection time (milliseconds) = 0
> 					optimizer estimated row count: 0.00
> 					optimizer estimated cost: 399.01
> 				Source result set:
> 					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> 					Number of opens = 3863
> 					Rows seen = 15452
> 					Columns accessed from heap = {1, 2, 3, 4, 5}
> 						constructor time (milliseconds) = 0
> 						open time (milliseconds) = 0
> 						next time (milliseconds) = 0
> 						close time (milliseconds) = 0
> 						optimizer estimated row count: 0.00
> 						optimizer estimated cost: 399.01
> 						Index Scan ResultSet for FORMULA_ENTRIES using constraint KB_FOMULA_ENTRIES_FORMULA_ID_FK at read committed isolation level using instantaneous share row locking chosen by the optimizer
> 						Number of opens = 3863
> 						Rows seen = 15452
> 						Rows filtered = 0
> 						Fetch Size = 16
> 							constructor time (milliseconds) = 0
> 							open time (milliseconds) = 0
> 							next time (milliseconds) = 0
> 							close time (milliseconds) = 0
> 							next time in milliseconds/row = 0
> 						scan information:
> 							Bit set of columns fetched=All
> 							Number of columns fetched=2
> 							Number of deleted rows visited=0
> 							Number of pages visited=11606
> 							Number of rows qualified=15452
> 							Number of rows visited=19315
> 							Scan type=btree
> 							Tree height=3
> 							start position:
> 								>= on first 1 column(s).
> 								Ordered null semantics on the following columns: 
> 								0 
> 							stop position:
> 								> on first 1 column(s).
> 								Ordered null semantics on the following columns: 
> 								0 
> 							qualifiers:
> 								None
> 							optimizer estimated row count: 0.00
> 							optimizer estimated cost: 399.01
> *************************************************************************
> The bad query plan (when sent as a prepared statements) looks like:
> Tue Feb 26 16:09:35 CST 2013 Thread[DRDAConnThread_4,5,main] (XID = 40971265), (SESSIONID = 5), SELECT COUNT(*) 
> FROM KB.FORMULA_ENTRIES fe1 
> , KB.FORMULA_ENTRIES fe2  
> WHERE 
> (fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?) 
> AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?) ******* Project-Restrict ResultSet (9):
> Number of opens = 1
> Rows seen = 1
> Rows filtered = 0
> restriction = false
> projection = true
> 	constructor time (milliseconds) = 0
> 	open time (milliseconds) = 0
> 	next time (milliseconds) = 0
> 	close time (milliseconds) = 0
> 	restriction time (milliseconds) = 0
> 	projection time (milliseconds) = 0
> 	optimizer estimated row count: 1.00
> 	optimizer estimated cost: 43.36
> Source result set:
> 	Scalar Aggregate ResultSet:
> 	Number of opens = 1
> 	Rows input = 3863
> 		constructor time (milliseconds) = 0
> 		open time (milliseconds) = 0
> 		next time (milliseconds) = 0
> 		close time (milliseconds) = 0
> 		optimizer estimated row count: 0.00
> 		optimizer estimated cost: 43.36
> 	Index Key Optimization = false
> 	Source result set:
> 		Project-Restrict ResultSet (8):
> 		Number of opens = 1
> 		Rows seen = 3863
> 		Rows filtered = 0
> 		restriction = false
> 		projection = true
> 			constructor time (milliseconds) = 0
> 			open time (milliseconds) = 0
> 			next time (milliseconds) = 0
> 			close time (milliseconds) = 0
> 			restriction time (milliseconds) = 0
> 			projection time (milliseconds) = 0
> 			optimizer estimated row count: 0.00
> 			optimizer estimated cost: 43.36
> 		Source result set:
> 			Nested Loop Join ResultSet:
> 			Number of opens = 1
> 			Rows seen from the left = 3863
> 			Rows seen from the right = 3863
> 			Rows filtered = 0
> 			Rows returned = 3863
> 				constructor time (milliseconds) = 0
> 				open time (milliseconds) = 0
> 				next time (milliseconds) = 0
> 				close time (milliseconds) = 0
> 				optimizer estimated row count: 0.00
> 				optimizer estimated cost: 43.36
> 			Left result set:
> 				Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> 				Number of opens = 1
> 				Rows seen = 3863
> 				Columns accessed from heap = {1, 2, 3, 4, 5}
> 					constructor time (milliseconds) = 0
> 					open time (milliseconds) = 0
> 					next time (milliseconds) = 0
> 					close time (milliseconds) = 0
> 					optimizer estimated row count: 2.98
> 					optimizer estimated cost: 10.89
> 					Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> 					Number of opens = 1
> 					Rows seen = 3863
> 					Rows filtered = 0
> 					Fetch Size = 16
> 						constructor time (milliseconds) = 0
> 						open time (milliseconds) = 0
> 						next time (milliseconds) = 0
> 						close time (milliseconds) = 0
> 						next time in milliseconds/row = 0
> 					scan information:
> 						Bit set of columns fetched=All
> 						Number of columns fetched=5
> 						Number of deleted rows visited=0
> 						Number of pages visited=8
> 						Number of rows qualified=3863
> 						Number of rows visited=3864
> 						Scan type=btree
> 						Tree height=3
> 						start position:
> 							>= on first 4 column(s).
> 							Ordered null semantics on the following columns: 
> 						stop position:
> 							> on first 4 column(s).
> 							Ordered null semantics on the following columns: 
> 						qualifiers:
> 							None
> 						optimizer estimated row count: 2.98
> 						optimizer estimated cost: 10.89
> 			Right result set:
> 				Project-Restrict ResultSet (7):
> 				Number of opens = 3863
> 				Rows seen = 14922769
> 				Rows filtered = 14918906
> 				restriction = true
> 				projection = true
> 					constructor time (milliseconds) = 0
> 					open time (milliseconds) = 0
> 					next time (milliseconds) = 0
> 					close time (milliseconds) = 0
> 					restriction time (milliseconds) = 0
> 					projection time (milliseconds) = 0
> 					optimizer estimated row count: 0.00
> 					optimizer estimated cost: 32.47
> 				Source result set:
> 					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
> 					Number of opens = 3863
> 					Rows seen = 14922769
> 					Columns accessed from heap = {1, 2, 3, 4, 5}
> 						constructor time (milliseconds) = 0
> 						open time (milliseconds) = 0
> 						next time (milliseconds) = 0
> 						close time (milliseconds) = 0
> 						optimizer estimated row count: 0.00
> 						optimizer estimated cost: 32.47
> 						Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
> 						Number of opens = 3863
> 						Rows seen = 14922769
> 						Rows filtered = 0
> 						Fetch Size = 16
> 							constructor time (milliseconds) = 0
> 							open time (milliseconds) = 0
> 							next time (milliseconds) = 0
> 							close time (milliseconds) = 0
> 							next time in milliseconds/row = 0
> 						scan information:
> 							Bit set of columns fetched=All
> 							Number of columns fetched=5
> 							Number of deleted rows visited=0
> 							Number of pages visited=34767
> 							Number of rows qualified=14922769
> 							Number of rows visited=14926632
> 							Scan type=btree
> 							Tree height=3
> 							start position:
> 								>= on first 4 column(s).
> 								Ordered null semantics on the following columns: 
> 							stop position:
> 								> on first 4 column(s).
> 								Ordered null semantics on the following columns: 
> 							qualifiers:
> 								None
> 							optimizer estimated row count: 0.00
> 							optimizer estimated cost: 32.47

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira