You are viewing a plain text version of this content. The canonical link for it is here.
Posted to derby-dev@db.apache.org by "Tony Brusseau (JIRA)" <ji...@apache.org> on 2013/02/26 23:40:12 UTC

[jira] [Created] (DERBY-6093) dramatically worse query plan for prepared stamement vs calling directly

Tony Brusseau created DERBY-6093:
------------------------------------

             Summary: dramatically worse query plan for prepared stamement vs calling directly
                 Key: DERBY-6093
                 URL: https://issues.apache.org/jira/browse/DERBY-6093
             Project: Derby
          Issue Type: Bug
          Components: Store
    Affects Versions: 10.9.1.0
         Environment: Linux
            Reporter: Tony Brusseau


If I run a count query given all the parameters directly, then the query executes in .05s, however, if I send it across as a prepared statement, then the query takes 40s to run (because is does a complete table scan on a table with 15m rows).

The query looks like:

SELECT COUNT(*) 
FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2  
WHERE 
(fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0) 
AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0)
 
or the prepared version like:

 SELECT COUNT(*) 
FROM KB.FORMULA_ENTRIES fe1 
, KB.FORMULA_ENTRIES fe2  
WHERE 
(fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?) 
AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?)

The table is defined as:

DROP TABLE KB.FORMULA_ENTRIES;
CREATE TABLE KB.FORMULA_ENTRIES
(
        formula_entries_id           BIGINT NOT NULL,)
        formula_id                   BIGINT NOT NULL,
        arg_0_term                   BIGINT NOT NULL,
        arg_term                     BIGINT NOT NULL,
        arg_num                      INTEGER NOT NULL,
        formula_type                 SMALLINT NOT NULL
);
ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_formula_entries_pk PRIMARY KEY (formula_entries_id);
ALTER TABLE kb.formula_entries ADD CONSTRAINT kb_fomula_entries_formula_id_fk 
        FOREIGN KEY (formula_id) REFERENCES kb.formula_term (term_id) ON DELETE CASCADE;

CREATE INDEX kb_formula_entries_formula_term_type ON kb.formula_entries (arg_term, formula_type);
CREATE INDEX kb_formula_entries_formula_term_arg0_type ON kb.formula_entries (arg_term, arg_0_term, formula_type);
CREATE INDEX kb_formula_entries_formula_term_num_type ON kb.formula_entries (arg_term, arg_num, formula_type);
CREATE INDEX kb_formula_entries_formula_term_arg0_num_type ON kb.formula_entries (arg_term, arg_0_term, arg_num, formula_type);

The good query plan looks (when *not* sent as a prepared statements) looks like:

Tue Feb 26 16:15:21 CST 2013 Thread[DRDAConnThread_5,5,main] (XID = 40971291), (SESSIONID = 7), SELECT COUNT(*) 
FROM KB.FORMULA_ENTRIES fe1, KB.FORMULA_ENTRIES fe2  
WHERE 
(fe1.arg_0_term = 1407374883620920) AND (fe1.arg_num = 1) AND (fe1.arg_term = 1407374883574780) AND (fe1.formula_type = 0) 
AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = 1407374883620920) AND (fe2.arg_num = 2) AND (fe2.arg_term = 1407374883663337) AND (fe2.formula_type = 0) ******* Project-Restrict ResultSet (10):
Number of opens = 1
Rows seen = 1
Rows filtered = 0
restriction = false
projection = true
	constructor time (milliseconds) = 0
	open time (milliseconds) = 0
	next time (milliseconds) = 0
	close time (milliseconds) = 0
	restriction time (milliseconds) = 0
	projection time (milliseconds) = 0
	optimizer estimated row count: 1.00
	optimizer estimated cost: 7184.99
Source result set:
	Scalar Aggregate ResultSet:
	Number of opens = 1
	Rows input = 3863
		constructor time (milliseconds) = 0
		open time (milliseconds) = 0
		next time (milliseconds) = 0
		close time (milliseconds) = 0
		optimizer estimated row count: 0.00
		optimizer estimated cost: 7184.99
	Index Key Optimization = false
	Source result set:
		Project-Restrict ResultSet (9):
		Number of opens = 1
		Rows seen = 3863
		Rows filtered = 0
		restriction = false
		projection = true
			constructor time (milliseconds) = 0
			open time (milliseconds) = 0
			next time (milliseconds) = 0
			close time (milliseconds) = 0
			restriction time (milliseconds) = 0
			projection time (milliseconds) = 0
			optimizer estimated row count: 0.00
			optimizer estimated cost: 7184.99
		Source result set:
			Nested Loop Join ResultSet:
			Number of opens = 1
			Rows seen from the left = 3863
			Rows seen from the right = 3863
			Rows filtered = 0
			Rows returned = 3863
				constructor time (milliseconds) = 0
				open time (milliseconds) = 0
				next time (milliseconds) = 0
				close time (milliseconds) = 0
				optimizer estimated row count: 0.00
				optimizer estimated cost: 7184.99
			Left result set:
				Project-Restrict ResultSet (5):
				Number of opens = 1
				Rows seen = 3865
				Rows filtered = 2
				restriction = true
				projection = true
					constructor time (milliseconds) = 0
					open time (milliseconds) = 0
					next time (milliseconds) = 0
					close time (milliseconds) = 0
					restriction time (milliseconds) = 0
					projection time (milliseconds) = 0
					optimizer estimated row count: 35.89
					optimizer estimated cost: 6785.98
				Source result set:
					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
					Number of opens = 1
					Rows seen = 3865
					Columns accessed from heap = {1, 2, 3, 4, 5}
						constructor time (milliseconds) = 0
						open time (milliseconds) = 0
						next time (milliseconds) = 0
						close time (milliseconds) = 0
						optimizer estimated row count: 35.89
						optimizer estimated cost: 6785.98
						Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
						Number of opens = 1
						Rows seen = 3865
						Rows filtered = 0
						Fetch Size = 16
							constructor time (milliseconds) = 0
							open time (milliseconds) = 0
							next time (milliseconds) = 0
							close time (milliseconds) = 0
							next time in milliseconds/row = 0

						scan information:
							Bit set of columns fetched=All
							Number of columns fetched=3
							Number of deleted rows visited=0
							Number of pages visited=7
							Number of rows qualified=3865
							Number of rows visited=3866
							Scan type=btree
							Tree height=3
							start position:
								>= on first 2 column(s).
								Ordered null semantics on the following columns: 
								0 1 
							stop position:
								> on first 2 column(s).
								Ordered null semantics on the following columns: 
								0 1 
							qualifiers:
								None
							optimizer estimated row count: 35.89
							optimizer estimated cost: 6785.98

			Right result set:
				Project-Restrict ResultSet (8):
				Number of opens = 3863
				Rows seen = 15452
				Rows filtered = 11589
				restriction = true
				projection = true
					constructor time (milliseconds) = 0
					open time (milliseconds) = 0
					next time (milliseconds) = 0
					close time (milliseconds) = 0
					restriction time (milliseconds) = 0
					projection time (milliseconds) = 0
					optimizer estimated row count: 0.00
					optimizer estimated cost: 399.01
				Source result set:
					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
					Number of opens = 3863
					Rows seen = 15452
					Columns accessed from heap = {1, 2, 3, 4, 5}
						constructor time (milliseconds) = 0
						open time (milliseconds) = 0
						next time (milliseconds) = 0
						close time (milliseconds) = 0
						optimizer estimated row count: 0.00
						optimizer estimated cost: 399.01
						Index Scan ResultSet for FORMULA_ENTRIES using constraint KB_FOMULA_ENTRIES_FORMULA_ID_FK at read committed isolation level using instantaneous share row locking chosen by the optimizer
						Number of opens = 3863
						Rows seen = 15452
						Rows filtered = 0
						Fetch Size = 16
							constructor time (milliseconds) = 0
							open time (milliseconds) = 0
							next time (milliseconds) = 0
							close time (milliseconds) = 0
							next time in milliseconds/row = 0

						scan information:
							Bit set of columns fetched=All
							Number of columns fetched=2
							Number of deleted rows visited=0
							Number of pages visited=11606
							Number of rows qualified=15452
							Number of rows visited=19315
							Scan type=btree
							Tree height=3
							start position:
								>= on first 1 column(s).
								Ordered null semantics on the following columns: 
								0 
							stop position:
								> on first 1 column(s).
								Ordered null semantics on the following columns: 
								0 
							qualifiers:
								None
							optimizer estimated row count: 0.00
							optimizer estimated cost: 399.01

The bad query plan looks (when sent as a prepared statements) looks like:

Tue Feb 26 16:09:35 CST 2013 Thread[DRDAConnThread_4,5,main] (XID = 40971265), (SESSIONID = 5), SELECT COUNT(*) 
FROM KB.FORMULA_ENTRIES fe1 
, KB.FORMULA_ENTRIES fe2  
WHERE 
(fe1.arg_0_term = ?) AND (fe1.arg_num = ?) AND (fe1.arg_term = ?) AND (fe1.formula_type = ?) 
AND (fe1.formula_id = fe2.formula_id) AND (fe2.arg_0_term = ?) AND (fe2.arg_num = ?) AND (fe2.arg_term = ?) AND (fe2.formula_type = ?) ******* Project-Restrict ResultSet (9):
Number of opens = 1
Rows seen = 1
Rows filtered = 0
restriction = false
projection = true
	constructor time (milliseconds) = 0
	open time (milliseconds) = 0
	next time (milliseconds) = 0
	close time (milliseconds) = 0
	restriction time (milliseconds) = 0
	projection time (milliseconds) = 0
	optimizer estimated row count: 1.00
	optimizer estimated cost: 43.36
Source result set:
	Scalar Aggregate ResultSet:
	Number of opens = 1
	Rows input = 3863
		constructor time (milliseconds) = 0
		open time (milliseconds) = 0
		next time (milliseconds) = 0
		close time (milliseconds) = 0
		optimizer estimated row count: 0.00
		optimizer estimated cost: 43.36
	Index Key Optimization = false
	Source result set:
		Project-Restrict ResultSet (8):
		Number of opens = 1
		Rows seen = 3863
		Rows filtered = 0
		restriction = false
		projection = true
			constructor time (milliseconds) = 0
			open time (milliseconds) = 0
			next time (milliseconds) = 0
			close time (milliseconds) = 0
			restriction time (milliseconds) = 0
			projection time (milliseconds) = 0
			optimizer estimated row count: 0.00
			optimizer estimated cost: 43.36
		Source result set:
			Nested Loop Join ResultSet:
			Number of opens = 1
			Rows seen from the left = 3863
			Rows seen from the right = 3863
			Rows filtered = 0
			Rows returned = 3863
				constructor time (milliseconds) = 0
				open time (milliseconds) = 0
				next time (milliseconds) = 0
				close time (milliseconds) = 0
				optimizer estimated row count: 0.00
				optimizer estimated cost: 43.36
			Left result set:
				Index Row to Base Row ResultSet for FORMULA_ENTRIES:
				Number of opens = 1
				Rows seen = 3863
				Columns accessed from heap = {1, 2, 3, 4, 5}
					constructor time (milliseconds) = 0
					open time (milliseconds) = 0
					next time (milliseconds) = 0
					close time (milliseconds) = 0
					optimizer estimated row count: 2.98
					optimizer estimated cost: 10.89
					Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
					Number of opens = 1
					Rows seen = 3863
					Rows filtered = 0
					Fetch Size = 16
						constructor time (milliseconds) = 0
						open time (milliseconds) = 0
						next time (milliseconds) = 0
						close time (milliseconds) = 0
						next time in milliseconds/row = 0

					scan information:
						Bit set of columns fetched=All
						Number of columns fetched=5
						Number of deleted rows visited=0
						Number of pages visited=8
						Number of rows qualified=3863
						Number of rows visited=3864
						Scan type=btree
						Tree height=3
						start position:
							>= on first 4 column(s).
							Ordered null semantics on the following columns: 
						stop position:
							> on first 4 column(s).
							Ordered null semantics on the following columns: 
						qualifiers:
							None
						optimizer estimated row count: 2.98
						optimizer estimated cost: 10.89

			Right result set:
				Project-Restrict ResultSet (7):
				Number of opens = 3863
				Rows seen = 14922769
				Rows filtered = 14918906
				restriction = true
				projection = true
					constructor time (milliseconds) = 0
					open time (milliseconds) = 0
					next time (milliseconds) = 0
					close time (milliseconds) = 0
					restriction time (milliseconds) = 0
					projection time (milliseconds) = 0
					optimizer estimated row count: 0.00
					optimizer estimated cost: 32.47
				Source result set:
					Index Row to Base Row ResultSet for FORMULA_ENTRIES:
					Number of opens = 3863
					Rows seen = 14922769
					Columns accessed from heap = {1, 2, 3, 4, 5}
						constructor time (milliseconds) = 0
						open time (milliseconds) = 0
						next time (milliseconds) = 0
						close time (milliseconds) = 0
						optimizer estimated row count: 0.00
						optimizer estimated cost: 32.47
						Index Scan ResultSet for FORMULA_ENTRIES using index KB_FORMULA_ENTRIES_FORMULA_TERM_ARG0_NUM_TYPE at read committed isolation level using instantaneous share row locking chosen by the optimizer
						Number of opens = 3863
						Rows seen = 14922769
						Rows filtered = 0
						Fetch Size = 16
							constructor time (milliseconds) = 0
							open time (milliseconds) = 0
							next time (milliseconds) = 0
							close time (milliseconds) = 0
							next time in milliseconds/row = 0

						scan information:
							Bit set of columns fetched=All
							Number of columns fetched=5
							Number of deleted rows visited=0
							Number of pages visited=34767
							Number of rows qualified=14922769
							Number of rows visited=14926632
							Scan type=btree
							Tree height=3
							start position:
								>= on first 4 column(s).
								Ordered null semantics on the following columns: 
							stop position:
								> on first 4 column(s).
								Ordered null semantics on the following columns: 
							qualifiers:
								None
							optimizer estimated row count: 0.00
							optimizer estimated cost: 32.47


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira