SQL Server pick random (or first) value with aggregation

Question

AVADHESH PATEL · Answer

Hi Ezra!There is an undocumented aggregate called ANY which is not valid syntax but is possible to get to appear in your execution plans. This does not provide any performance advantage however.Assuming the following table and index structureCREATE TABLE T(id int identity primary key,[group] char(1)and )CREATE NONCLUSTERED INDEX ix ON T([group])INSERT INTO TSELECT TOP 1000000 CHAR( 65 + ROW_NUMBER() OVER (ORDER BY @@SPID) % 3)FROM sys.all_objects o1, sys.all_objects o2, sys.all_objects o3I have also populated with sample data such that there are many rows per group.Your original querySELECT MAX(id),and  and  and  and [group]FROM and  TGROUP and BY [group] and Gives Table 'T'. Scan count 1, logical reads 1367 and the planand  |--Stream Aggregate(GROUP BY:([[T].[group]) DEFINE:([Expr1003]=MAX([[T].[id])))and  and  and  and |--Index Scan(OBJECT:([[T].[ix]), ORDERED FORWARD)Rewritten to get the ANY aggregate...;WITH cte AS(SELECT *,and  and  and  and  ROW_NUMBER() OVER (PARTITION BY [group] ORDER BY [group] ) AS RNFROM T)SELECT id,and  and  and  and [group]FROM and  and cte and  and and WHERE RN=1Gives Table 'T'. Scan count 1, logical reads 1367 and the planand  |--Stream Aggregate(GROUP BY:([[T].[group]) DEFINE:([[T].[id]=ANY([[T].[id])))and  and  and  and |--Index Scan(OBJECT:([[T].[ix]), ORDERED FORWARD)Even though potentially SQL Server could stop processing the group as soon as the first value is found and skip to the next one it doesn't. It still processes all rows and the logical reads are the same.For this particular example with many rows in the group a more efficient version would be a recursive CTE.WITH and  and RecursiveCTEAS and  and  and (and  and  and  and  SELECT TOP 1 id, [group]and  and  and  and  FROM Tand  and  and  and  ORDER BY [group]and  and  and  and  UNION and  ALLand  and  and  and  SELECT and R.id, R.[group]and  and  and  and  FROM and  and (and  and  and  and  and  and  and  and  SELECT and T.*,and  and  and  and  and  and  and  and  and  and  and  and  rn = ROW_NUMBER() OVER (ORDER BY (SELECT 0))and  and  and  and  and  and  and  and  FROM and  and Tand  and  and  and  and  and  and  and  JOIN and  and RecursiveCTE Rand  and  and  and  and  and  and  and  and  and  and  and  ON and R.[group] andlt; T.[group]and  and  and  and  and  and  and  and  ) Rand  and  and  and  WHERE and  R.rn = 1and  and  and  and  )SELECT and *FROM and  and RecursiveCTEOPTION and (MAXRECURSION 0);Which givesTable 'Worktable'. Scan count 2, logical reads 19Table 'T'. Scan count 4, logical reads 12The logical reads are much less as it retrieves the first row per group then seeks into the next group rather than reading a load of records that don't contribute to the final result.

forum

SQL Server pick random (or first) value with aggregation

Anonymous User

Can you answer this question?

1 Answers

Liked By