About tuning store procedure

Question

Post reply

About tuning store procedure

Derek-412704

SSChasing Mays

Points: 609
More actions
September 7, 2007 at 12:00 am

#95393

i am trying to optimize this store procedure and in some part i have created temporary table, and i know store procedure with temp table cannot be pre compiled. so how do i use table variable instead, please help me out, or if anyone has better idea appreciate it if you could share.
here is portion of my store procedure.
if not exists (select * from sysobjects where id = object_id('#ScorecardTemp') )
BEGIN
CREATE TABLE #ScorecardTemp (
businessDate datetime null,
Cycle char (5) NULL ,
daypart smallint NULL ,
EarnedHours money NULL ,
ActualHours money NULL ,
ForecastHours money NULL ,
ScheduleHours money NULL ,
ActualDayParts smallint NULL ,
ScheduleDayparts smallint NULL ,
ActualRedDayparts smallint NULL ,
ScheduleRedDayparts smallint NULL
)
END
Insert into #ScorecardTemp
select
c.enddate BusinessDate,
eventType Cycle,
daypart,
sum(EarnedHours) EarnedHours,
sum(ActualHours) ActualHours,
sum(ForecastHours) ForecastHours,
sum(ScheduleHours) ScheduleHours,
count(ActualRed) ActualDayParts,
count(ScheduleRed) ScheduleDayparts,
sum(ActualRed) ActualRedDayparts,
sum(ScheduleRed) ScheduleRedDayparts
from fSystemCalendarRollups(@closedate, @closeDate,'DWP') c
left OUter join fLaborRedDayparts(@startPeriod, @closeDate,0)
on businessDate between c.startDate and c.endDate
--where (c.eventtype <> 'D' or c.enddate = @closeDate)
group by c.enddate, eventtype, daypart
-- Now load the data from the temp table
insert into ScorecardLaborDayparts (BusinessDate, value, cycle,daypart,hours)
select businessDate, 'Earned', cycle, daypart, EarnedHours hours
from #ScorecardTemp

Viewing 14 posts - 1 through 13 (of 13 total)

You must be logged in to reply to this topic. Login to reply

Andras Belokosztolszki SSC-Insane Points: 21971 More actions · Answer 1

You could do:

DECLARE @ScorecardTemp table (

businessDate datetime null,

Cycle char (5) NULL ,

daypart smallint NULL ,

EarnedHours money NULL ,

ActualHours money NULL ,

ForecastHours money NULL ,

ScheduleHours money NULL ,

ActualDayParts smallint NULL ,

ScheduleDayparts smallint NULL ,

ActualRedDayparts smallint NULL ,

ScheduleRedDayparts smallint NULL

)

Insert into @ScorecardTemp

select

c.enddate BusinessDate,

eventType Cycle,

daypart,

sum(EarnedHours) EarnedHours,

sum(ActualHours) ActualHours,

sum(ForecastHours) ForecastHours,

sum(ScheduleHours) ScheduleHours,

count(ActualRed) ActualDayParts,

count(ScheduleRed) ScheduleDayparts,

sum(ActualRed) ActualRedDayparts,

sum(ScheduleRed) ScheduleRedDayparts

from fSystemCalendarRollups(@closedate, @closeDate,'DWP') c

left OUter join fLaborRedDayparts(@startPeriod, @closeDate,0)

on businessDate between c.startDate and c.endDate

--where (c.eventtype 'D' or c.enddate = @closeDate)

group by c.enddate, eventtype, daypart

-- Now load the data from the temp table

insert into ScorecardLaborDayparts (BusinessDate, value, cycle,daypart,hours)

select businessDate, 'Earned', cycle, daypart, EarnedHours hours

from @ScorecardTemp

Procs with table variables can be pre-compiled. They are also not logged in the transaction log

Regards,

Andras

Andras Belokosztolszki, MCPD, PhD
GoldenGate Software

Andras Belokosztolszki SSC-Insane Points: 21971 More actions · Answer 2

I mentioned that table variables are not logged in the transaction log, and that unlike temp tables, procs that contain them can be precompiled. However, this does not necessarily improve performance. I've just found http://www.sql-server-performance.com/articles/per/temp_tables_vs_variables_p1.aspx which suggests that while on 2000 it is a good idea to use table variables, on 2k5 it may be a bad idea (depends on many things though, so it is important to check this with your own db/data)

Regards,

Andras

Andras Belokosztolszki, MCPD, PhD
GoldenGate Software

Jeff Moden SSC Guru Points: 1000539 More actions · Answer 3

Best if you keep the content of table variables pretty short... they do not use nor can they be made to use Statistics... there are other disadvantages to table variables, as well. Pay particular attention to Q3/A3 and Q4/A4... and note that both table variables and temp tables are built in the same place

http://support.microsoft.com/default.aspx?scid=kb;en-us;305977&Product=sql2k

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.
"Change is inevitable... change for the better is not".

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)
Intro to Tally Tables and Functions

JP-317675 SSC Eights! Points: 985 More actions · Answer 4

Is there a case for not using a temp table or table variable at all? If you wrap your temp table select statement in the from clause of your insert statement you achieve the same result. I am unsure what locking issues may arise if you select statement is a long running query.

-- Now load the data from the temp table

insert into ScorecardLaborDayparts

(

BusinessDate,

value,

cycle,

daypart,

hours

)

select businessDate,

'Earned',

cycle,

daypart,

EarnedHours hours

from (

select c.enddate BusinessDate,

eventType Cycle,

daypart,

sum(EarnedHours) EarnedHours,

sum(ActualHours) ActualHours,

sum(ForecastHours) ForecastHours,

sum(ScheduleHours) ScheduleHours,

count(ActualRed) ActualDayParts,

count(ScheduleRed) ScheduleDayparts,

sum(ActualRed) ActualRedDayparts,

sum(ScheduleRed) ScheduleRedDayparts

from fSystemCalendarRollups(@closedate, @closeDate, 'DWP') c

left OUter join fLaborRedDayparts(@startPeriod, @closeDate, 0)

on businessDate between c.startDate and c.endDate

--where (c.eventtype 'D' or c.enddate = @closeDate)

group by c.enddate,

eventtype,

daypart

) AS ToInsert

-- JP

Phil Factor SSC-Insane Points: 20224 More actions · Answer 5

I realise I'm drifting off-topic here, but I'm struggling to understand the SQL. In what circumstances would one adopt this particular LEFT JOIN construction, with an ambiguous ON join-criterion, particularly without checking on NULLs in the WHERE clause.

Best wishes,
Phil Factor

JP-317675 SSC Eights! Points: 985 More actions · Answer 6

The outer join will ensure that all records from fSystemCalendarRollups are grouped and returned in the result set.

Where the outer join was unsuccessful the values in the fLabourRedDayParts will be returned as null for that row. Perhaps it is important for every record from the left table to be returned regardless of the joins success or failure.

In the query the JOIN statement is effectively doing what a WHERE clause cannot which is preserving rows and appending them to the result set. If the between statement was used in the WHERE clause then there would be no way to return all rows from the fSystemCalendarRollups table.

This example below shows the difference.

CREATE TABLE #Tester1 (low int, high int)

CREATE TABLE #Tester2 (value int)

INSERT INTO #Tester1 (low, high) VALUES (1, 10)

INSERT INTO #Tester1 (low, high) VALUES (2, 10)

INSERT INTO #Tester1 (low, high) VALUES (3, 10)

INSERT INTO #Tester1 (low, high) VALUES (4, 10)

INSERT INTO #Tester1 (low, high) VALUES (5, 10)

INSERT INTO #Tester1 (low, high) VALUES (6, 10)

INSERT INTO #Tester1 (low, high) VALUES (7, 10)

INSERT INTO #Tester1 (low, high) VALUES (8, 10)

INSERT INTO #Tester1 (low, high) VALUES (9, 10)

INSERT INTO #Tester1 (low, high) VALUES (10, 10)

INSERT INTO #Tester2 (value) VALUES (5)

-- Shows where join was successful

SELECT *

FROM #Tester1 AS t1

LEFT OUTER JOIN #Tester2 AS t2

ON value between t1.low and t1.high

-- This shows how the group by will return null

SELECT value, sum(low)

FROM #Tester1 AS t1

LEFT OUTER JOIN #Tester2 AS t2

ON value between t1.low and t1.high

GROUP BY value

-- A cartesian join using a where clause

SELECT value, sum(low)

FROM #Tester1 AS t1, #Tester2 AS t2

WHERE t2.value between t1.low and t1.high

GROUP BY value

-- Group by using where clause

SELECT *

FROM #Tester1 AS t1, #Tester2 AS t2

WHERE t2.value between t1.low and t1.high

DROP TABLE #Tester1

DROP TABLE #Tester2

-- JP

Phil Factor SSC-Insane Points: 20224 More actions · Answer 7

Thanks Jonathan.

As far as I can see, this is an aggregation of data into arbitrary date/time slices (defined by a start and end date) in such a way that each time-slice is represented by a row in the result: in other words, no time slice is left without aggregated data, even if it is a null. It would rely on its accuracy on there being no overlap in the calendar table containing the start and end dates.

I was just wondering if there was any, more immediate, performance gains we could suggest for this stored procedure than changing from a temp table to a table variable. In this case, the join is unindexed as it is between two table functions. This wouldn't matter if there wasn't much data but...

Best wishes,
Phil Factor

Bob Fazio SSChampion Points: 10794 More actions · Answer 8

The following does a much better job explaining when/why to use table variables.

http://blogs.msdn.com/sqlprogrammability/archive/2007/01/18/11-0-temporary-tables-table-variables-and-recompiles.aspx

Also, be careful not to get into optimization overload. I still suggest that you spend time optimizing the procedure from the beginning, but make sure that the time and effort is worth it.

FYI< I am making these numbers up!!!>

For example: Each recomple that the procedure experiences causes say .01 seconds delay... You have 10 recompiles so to total .1 seconds in a .15 second procedure.

Well, each re-compile will reduce the procedure run time by a noticable amount. If you can cut them out by 1/2, you can improve the procedure by 33%.

If the procedure is run 5 Milliion times per day that would result in roughly 7 hours of CPU TIME!... That would be significant. However, if that procedure is only run 5x per day, then we are talking about less than a second a day Spend your time where you get the the most results.

Another example is say for a procedure that runs for 2 min. Removing the 5 recompiles, really doesn't do much for you.

TheSQLGuru SSC Guru Points: 134017 More actions · Answer 9

I agree with the post that states that if there isn't a huge concern for locking the best thing is to simply remove the temp table altogether and simply do a straight insert/select.

Best,
Kevin G. Boles
SQL Server Consultant
SQL MVP 2007-2012
TheSQLGuru on googles mail service

Jeff Moden SSC Guru Points: 1000539 More actions · Answer 10

Bob Fazio (9/12/2007)
The following does a much better job explaining when/why to use table variables.
Also, be careful not to get into optimization overload. I still suggest that you spend time optimizing the procedure from the beginning, but make sure that the time and effort is worth it.
FYI< I am making these numbers up!!!>
For example: Each recomple that the procedure experiences causes say .01 seconds delay... You have 10 recompiles so to total .1 seconds in a .15 second procedure.
Well, each re-compile will reduce the procedure run time by a noticable amount. If you can cut them out by 1/2, you can improve the procedure by 33%.
If the procedure is run 5 Milliion times per day that would result in roughly7 hours of CPU TIME!... That would be significant. Spend your time where you get the the most results.
Another example is say for a procedure that runs for 2 min. Removing the 5 recompiles, really doesn't do much for you.

5 million times a day? Sure, if it's RBAR for a GUI... the "other" world is batch and 1 recompile isn't going to amount to a hill of beans.

The other thing is, if you put the DDL for the temp table at the beginning of the proc (or whatever) and don't mix it in with the DML, you won't cause a recompile... even for GUI code.

--Jeff Moden

RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
First step towards the paradigm shift of writing Set Based code:
________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.
"Change is inevitable... change for the better is not".

Helpful Links:
How to post code problems
How to Post Performance Problems
Create a Tally Function (fnTally)
Intro to Tally Tables and Functions

Scott Coleman One Orange Chip Points: 27451 More actions · Answer 11

I don't have anythng to add to the variable vs. temp table discussion, but I have a correction for the original script.

The line "if not exists (select * from sysobjects where id = object_id('#ScorecardTemp') )" only works if tempdb is the current database. I always use "if object_id('tempdb..#ScorecardTemp') is null".

Derek-412704 SSChasing Mays Points: 609 More actions · Answer 12

Thanks scott,

i'll surely look on to that and at the moment i am also doing some readings on optimizing queries.

appreciate it.

Derek

mengus Ten Centuries Points: 1187 More actions · Answer 13

Derek,

From looking at your stored procedure, I would have to agree with Jonathon Prosper. I do not think you would really need a temp table or a table variable; this would induce more READ costs. The data that you want is already in the SELECT statement, for which you are inserting into the temp table.

I'd recommend checking the indexes for each table associated with the columns in your WHERE clause. I would also double check the function that you are calling within your stored procedure to ensure the proper performance threshold.

Regards,

Wameng Vang

MCTS