Chapter 13 – OLAP Functions

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 13 – OLAP Functions

“Don’t count the days, make the days count.”

- Mohammed Ali

CSUM

This ANSI version of CSUM is SUM() Over. Right now, the syntax wants to see the sum of the Daily_Sales after it is first sorted by Sale_Date. Rows Unbounded Preceding makes this a CSUM. The ANSI Syntax seems difficult, but only at first.

CSUM – The Sort Explained

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Sale_Date

ROWS UNBOUNDED PRECEDING) AS SUMOVER

FROM Sales_Table WHERE Product_ID BETWEEN 1000 and 2000 ;

The first thing the above query does before calculating is SORT all the rows by Sale_Date. The Sort is located right after the ORDER BY.

CSUM – Rows Unbounded Preceding Explained

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Sale_Date

ROWS UNBOUNDED PRECEDING) AS SUMOVER

FROM Sales_Table WHERE Product_ID BETWEEN 1000 and 2000 ;

The keywords Rows Unbounded Preceding determines that this is a CSUM. There are only a few different statements and Rows Unbounded Preceding is the main one. It means start calculating at the beginning row, and continue calculating until the last row.

CSUM – Making Sense of the Data

SELECTProduct_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Sale_Date

ROWS UNBOUNDED PRECEDING) AS SUMOVER

FROM Sales_Table WHERE Product_ID BETWEEN 1000 and 2000 ;

The second “SUMOVER” row is 90739.28. That is derived by the first row’s Daily_Sales (41888.88) added to the SECOND row’s Daily_Sales (48850.40).

CSUM – Making Even More Sense of the Data

SELECTProduct_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Sale_Date

ROWS UNBOUNDED PRECEDING) AS SUMOVER

FROM Sales_Table WHERE Product_ID BETWEEN 1000 and 2000 ;

The third “SUMOVER” row is 138739.28. That is derived by taking the first row’s Daily_Sales (41888.88) and adding it to the SECOND row’s Daily_Sales (48850.40). Then, you add that total to the THIRD row’s Daily_Sales (48000.00).

CSUM – The Major and Minor Sort Key(s)

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS SumOVER

FROM Sales_Table ;

You can have more than one SORT KEY. In the top query, Product_ID is the MAJOR Sort, and Sale_Date is the MINOR Sort.

Reset with a PARTITION BY Statement

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING)

AS SumANSI

FROM Sales_Table ;

The PARTITION Statement is how you reset in ANSI. This will cause the SUMANSI to start over (reset) on its calculating for each NEW Product_ID.

PARTITION BY only Resets a Single OLAP not ALL of them

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS Subtotal,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS GRANDTotal

FROM Sales_Table ;

Above are two OLAP statements. Only one has PARTITION BY, so only it resets.

ANSI Moving Window is Current Row and Preceding n Rows

The SUM () Over allows you to get the moving SUM of a certain column.

How ANSI Moving SUM Handles the Sort

The SUM OVER places the sort after the ORDER BY.

Quiz – How is that Total Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS Sum3_ANSI

FROM Sales_Table ;

With a Moving Window of 3, how is the 139350.69 amount derived in the Sum3_ANSI column in the third row?

Answer to Quiz – How is that Total Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS Sum3_ANSI

FROM Sales_Table ;

With a Moving Window of 3, how is the 139350.69 amount derived in the Sum3_ANSI column in the third row? It is the sum of 48850.40, 54500.22 and 36000.07. The current row of Daily_Sales plus the previous two rows of Daily_Sales.

Moving SUM every 3-rows Vs a Continuous Average

SELECT Product_ID , Sale_Date, Daily_Sales,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS SUM3,

SUM(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED Preceding) AS Continuous

FROM Sales_Table;

The ROWS 2 Preceding gives the MSUM for every 3 rows. The ROWS UNBOUNDED Preceding gives the continuous MSUM.

Partition by Resets an ANSI OLAP

Use a PARTITION BY Statement to Reset the ANSI OLAP. Notice it only resets the OLAP command containing the Partition By statement, but not the other OLAPs.

Moving Average

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG_3

FROM Sales_Table ;

Notice the Moving Window of 3 in the syntax and that it is a 2 in the ANSI version. That is because in ANSI, it is considered the Current Row and 2 preceding.

The Moving Window is Current Row and Preceding

The AVG () Over allows you to get the moving average of a certain column. The Rows 2 Preceding is a moving window of 3 in ANSI.

How Moving Average Handles the Sort

Much like the SUM OVER Command, the Average OVER places the sort keys via the ORDER BY keywords.

Quiz – How is that Total Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG_3_ANSI

FROM Sales_Table ;

With a Moving Window of 3, how is the 46450.23 amount derived in the AVG_3_ANSI column in the third row?

Answer to Quiz – How is that Total Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG_3_ANSI

FROM Sales_Table ;

AVG of 48850.40, 54500.22, and 36000.07

Quiz – How is that 4^th Row Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG_3_ANSI

FROM Sales_Table ;

With a Moving Window of 3, how is the 43566.91 amount derived in the AVG_3_ANSI column in the fourth row?

Answer to Quiz – How is that 4^th Row Calculated?

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG_3_ANSI

FROM Sales_Table ;

AVG of 54500.22, 36000.07, and 40200.43

With a Moving Window of 3, how is the 43566.91 amount derived in the AVG_3_ANSI column in the fourth row?

Moving Average every 3-rows Vs a Continuous Average

SELECT Product_ID , Sale_Date, Daily_Sales,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS 2 Preceding) AS AVG3,

AVG(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED Preceding) AS Continuous

FROM Sales_Table;

The ROWS 2 Preceding gives the MAVG for every 3 rows. The ROWS UNBOUNDED Preceding gives the continuous MAVG.

Partition By Resets an ANSI OLAP

Use a PARTITION BY Statement to Reset the ANSI OLAP. The Partition By statement only resets the column using the statement. Notice that only Continuous resets.

RANK Defaults to Ascending Order

SELECT Product_ID ,Sale_Date , Daily_Sales,

RANK() OVER (ORDER BY Daily_Sales) AS Rank1

FROM Sales_Table

WHERE Product_ID IN (1000, 2000) ;

This is the RANK() OVER. It provides a rank for your queries. Notice how you do not place anything within the () after the word RANK. Default Sort is ASC.

Getting RANK to Sort in DESC Order

SELECT Product_ID ,Sale_Date , Daily_Sales,

RANK() OVER (ORDER BY Daily_Sales DESC)

AS Rank1

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

Is the query above in ASC mode or DESC mode for sorting?

RANK() OVER and PARTITION BY

SELECT Product_ID ,Sale_Date , Daily_Sales,

RANK() OVER (PARTITION BY Product_ID

ORDER BY Daily_Sales DESC)

AS Rank1

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

What does the PARTITION Statement in the RANK() OVER do? It resets the rank.

RANK() OVER And LIMIT

SELECT Product_ID ,Sale_Date , Daily_Sales,

RANK() OVER (ORDER BY Daily_Sales DESC)

AS Rank1

FROM Sales_Table

WHERE Product_ID IN (1000, 2000)

Limit 6 ;

The Limit statement limits rows once the Rank's been calculated.

PERCENT_RANK() OVER

SELECT Product_ID ,Sale_Date , Daily_Sales,

PERCENT_RANK() OVER (PARTITION BY PRODUCT_ID

ORDER BY Daily_Sales DESC) AS PercentRank1

FROM Sales_Table WHERE Product_ID in (1000, 2000) ;

We now have added a Partition statement which resets on Product_ID so this produces 7 rows for each of our Product_IDs.

PERCENT_RANK() OVER with 14 rows in Calculation

SELECT Product_ID ,Sale_Date , Daily_Sales,

PERCENT_RANK()

OVER ( ORDER BY Daily_Sales DESC) AS PercentRank1

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

Percentage_Rank is just like RANK, however, it gives you the Rank as a percent, but only a percent of all the other rows up to 100%.

PERCENT_RANK() OVER with 21 rows in Calculation

SELECT Product_ID ,Sale_Date , Daily_Sales,

PERCENT_RANK() OVER ( ORDER BY Daily_Sales DESC)

AS PercentRank1

FROM Sales_Table ;

Percentage_Rank is just like RANK, however, it gives you the Rank as a percent but only a percent of all the other rows up to 100%.

Quiz – What Causes the Product_ID to Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

PERCENT_RANK() OVER (PARTITION BY PRODUCT_ID

ORDER BY Daily_Sales DESC) AS PercentRank1

FROM Sales_Table WHERE Product_ID in (1000, 2000) ;

What caused the Product_IDs to be sorted?

Answer to Quiz – What Cause the Product_ID to Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

PERCENT_RANK() OVER (PARTITION BY PRODUCT_ID

ORDER BY Daily_Sales DESC) AS PercentRank1

FROM Sales_Table WHERE Product_ID in (1000, 2000) ;

What caused the Product_IDs to be sorted? It was the PARTITION BY statement.

COUNT OVER for a Sequential Number

SELECT Product_ID ,Sale_Date , Daily_Sales,

COUNT(*) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS Seq_Number

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

This is the COUNT OVER. It will provide a sequential number starting at 1. The Keyword(s) ROWS UNBOUNDED PRECEDING causes Seq_Number to start at the beginning and increase sequentially to the end.

Quiz – What caused the COUNT OVER to Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

COUNT(*) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS StartOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

What Keyword(s) caused StartOver to reset?

Answer to Quiz – What caused the COUNT OVER to Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

COUNT(*) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS StartOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

What Keyword(s) caused StartOver to reset? It is the PARTITION BY statement.

The MAX OVER Command

SELECT Product_ID ,Sale_Date , Daily_Sales,

MAX(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS MaxOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

After the sort, the Max() Over shows the Max Value up to that point.

MAX OVER with PARTITION BY Reset

SELECT Product_ID ,Sale_Date , Daily_Sales,

MAX(Daily_Sales) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS MaxOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

The largest value is 64300.00 in the column MaxOver. Once it was evaluated, it did not continue until the end because of the PARTITION BY reset.

The MIN OVER Command

SELECT Product_ID, Sale_Date ,Daily_Sales

,MIN(Daily_Sales) OVER (ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS MinOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

After the sort, the MIN () Over shows the Max Value up to that point.

Quiz – Fill in the Blank

SELECT Product_ID ,Sale_Date , Daily_Sales,

MIN(Daily_Sales) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS MinOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

The last two answers (MinOver) are blank, so you can fill in the blank.

Answer – Fill in the Blank

SELECT Product_ID ,Sale_Date , Daily_Sales,

MIN(Daily_Sales) OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date

ROWS UNBOUNDED PRECEDING) AS MinOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

The last two answers (MinOver) are filled in.

The Row_Number Command

SELECT Product_ID ,Sale_Date , Daily_Sales,

ROW_NUMBER() OVER

(ORDER BY Product_ID, Sale_Date) AS Seq_Number

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

The ROW_NUMBER() Keyword(s) caused Seq_Number to increase sequentially. Notice that this does NOT have a Rows Unbounded Preceding, and it still works!

Quiz – How did the Row_Number Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

ROW_NUMBER() OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date ) AS StartOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

What Keyword(s) caused StartOver to reset?

Quiz – How did the Row_Number Reset?

SELECT Product_ID ,Sale_Date , Daily_Sales,

ROW_NUMBER() OVER (PARTITION BY Product_ID

ORDER BY Product_ID, Sale_Date ) AS StartOver

FROM Sales_Table WHERE Product_ID IN (1000, 2000) ;

What Keyword(s) caused StartOver to reset? It is the PARTITION BY statement.

Standard Deviation Functions Using STDDEV / OVER

There are actually three different versions of the standard deviation:

STDDEV - returns the standard deviation of the
expression, which is the square root of VARIANCE.
When VARIANCE returns null, this function returns null.

STDDEV_POP - computes the population standard
deviation. This function is the same as the square root of
the VAR_POP function. When VAR_POP returns null,
this function returns null.

STDDEV_SAMP - computes the sample standard
deviation, which is the square root of VAR_ SAMP.
When VAR_SAMP returns null, this function returns null.

The above information is an introduction to Standard Deviation in an OLAP statement.

Standard Deviation Functions and STDDEV / OVER Syntax

The following ANSI syntax is used with the
three standard deviation functions:

{ STDDEV | STDDEV_POP | STDDEV_SAMP (<column-name>) OVER

([ PARTITION BY <column-list> ]

ORDER BY <column-list>

[ ROWS [BETWEEN] { UNBOUNDED | <number>} PRECEDING

[ AND UNBOUNDED | <number> } FOLLOWING ] ]

[EXCLUDE CURRENT ROW | EXCLUDE GROUP | EXCLUDE TIES |

EXCLUDE NO OTHERS] ) ;

The above information is the syntax for standard deviation using the OVER clause. In order to provide the moving functionality, it is necessary to have a method that designates the number of rows to include in the STDDEV. It allows the function to calculate values from columns contained in rows that are before the current row and also rows that are after the current row.

STDDEV / OVER Example

Wow! What an amazing example.

VARIANCE / OVER Syntax

The following ANSI syntax is used with
the three standard deviation functions:

{ VARIANCE | VAR_SAMP (<column-name>) OVER

([ PARTITION BY<column-list> ]

ORDER BY <column-list>

[ ROWS [BETWEEN] { UNBOUNDED | <number>} PRECEDING

[ AND UNBOUNDED | <number> }FOLLOWING ] ]

[EXCLUDE CURRENT ROW | EXCLUDE GROUP | EXCLUDE TIES |

EXCLUDE NO OTHERS] ) ;

The above information is the syntax for VARIANCE and VAR_SAMP using the OVER clause. In order to provide the moving functionality, it is necessary to have a method that designates the number of rows to include in the VARIANCE. It allows the function to calculate values from columns contained in rows that are before the current row and also rows that are after the current row.

Variance Functions Using VARIANCE / OVER

VARIANCE Function

Returns the variance of the expression. If you apply this function to an empty set, it returns null. Matrix calculates the expression as follows:

• 0 if the number of rows in expression = 1

• VAR_SAMP if the number of rows in expression > 1

VARIANCE = (SUM(expr2) - ((SUM(expr))2 / COUNT(expr))) / (COUNT(expr) - 1)

VAR_POP Function

Returns the population variance of a set of numbers after discarding the nulls in this set. If you apply this function to an empty set, it returns null.

VAR_POP = (SUM(expr2) - ((SUM(expr))2 / COUNT(expr))) / COUNT(expr)

VAR_ SAMP Function

Returns the sample variance of a set of numbers after discarding the nulls in this set. If you apply this function to an empty set, it returns null.

The above information is an introduction to the Variance functions used in conjunction with an OVER statement.

Using VARIANCE with PARTITION BY Example

SELECTProduct_ID AS "Prod", Sale_Date, Daily_Sales AS "Sales"

,VARIANCE(Daily_Sales) OVER ( ORDER BY sale_date

ROWS unboundedPRECEDING) AS VAR_S

,VARIANCE(Daily_Sales) OVER ( PARTITION BY sale_date

ORDER BY sale_dateROWS unbounded PRECEDING ) VAR_P

,VAR_POP(Daily_Sales) OVER ( ORDER BY sale_date

ROWS unboundedPRECEDING) AS VAR_POP

,VAR_SAMP(Daily_Sales) OVER ( ORDER BY sale_date

ROWS unboundedPRECEDING) AS VAR_SMP

FROMsql_class..sales_table

WHERE EXTRACT(MONTH FROM Sale_Date) = 9 ;

Wow! Another amazing example. The above example uses all three standard deviation functions to produce output sorting on the sales date for the dates in September.

Using FIRST_VALUE and LAST_VALUE

The FIRST_VALUE and LAST_VALUE functions allow you to specify sorted
aggregate groups and return the first or last value of each group. The function
needs to know the length of the data at run time and does not allow a decimal value.

Syntax for FIRST_VALUE and LAST_VALUE:

{FIRST_VALUE | LAST_VALUE} ({ <column reference> | <value expression> | *})

OVER

([PARTITION BY <column reference>[,...]]

ORDER BY {<column reference> [ASC | DESC] } [,...]]

[ ROWS | RANGE {{ CURRENT ROW | UNBOUNDED

| <literal value> PRECEDING} | BETWEEN {CURRENT ROW |

UNBOUNDED PRECEDING | <literal value> PRECEDING}

AND {CURRENT ROW

| UNBOUNDED FOLLOWING | <literal value> FOLLOWING}}]

[ EXCLUDE CURRENT ROW | EXCLUDE GROUP | EXCLUDE TIES

| EXCLUDE NO OTHERS ] ) ;

The above information provides information and the syntax for FIRST_VALUE and LAST_Value.

Using FIRST_VALUE

SELECTLast_name, first_name, dept_no

‚FIRST_VALUE(first_name)

OVER (ORDER BY dept_no, last_name desc

rows unbounded preceding) AS "First All"

‚FIRST_VALUE(first_name)

OVER (PARTITION BY dept_no

ORDER BY dept_no, last_name desc

rows unbounded preceding) AS "First Partition"

FROMSQL_Class..Employee_Table;

The above example uses FIRST_VALUE to show you the very first first_name returned. It also uses the keyword Partition to show you the very first first_name returned in each department.

Using LAST_VALUE

SELECTLast_name, first_name, dept_no

‚LAST_VALUE(first_name)

OVER (ORDER BY dept_no, last_name desc

rows unbounded preceding)AS "Last All"

‚LAST_VALUE(first_name)

OVER (PARTITION BY dept_no

ORDER BY dept_no, last_name desc

rows unbounded preceding) AS "Last Partition"

FROMsql_class.Employee_Table;

The FIRST_VALUE and LAST_VALUE are good to use anytime you need to propagate a value from one row to all or multiple rows based on a sorted sequence. However, the output from the LAST_VALUE function appears to be incorrect or incomplete until you understand a few concepts. The SQL request specifies "rows unbounded preceding" and LAST_VALUE looks at the last row. The current row is always the last row, and therefore, it appears in the output.

Using LAG and LEAD

The LAG and LEAD functions allow you to compare different
rows of a table by specifying an offset from the current row.
You can use these functions to analyze change and variation.

Syntax for LAG and LEAD:

{LAG | LEAD} (<value expression>, [<offset> [, <default>]]) OVER

([PARTITION BY <column reference>[,...]]

ORDER BY <column reference> [ASC | DESC] [,...] );

“Only he who attempts the ridiculous may achieve the impossible.”

– Don Quixote

The above provides information and the syntax for LAG and LEAD.

Using LEAD

SELECT last_name, dept_no

,lead(dept_no)
over (order by dept_no, last_name) as "Lead All"
,lead(dept_no) over (Partition by dept_no
order by dept_no, last_name) as "Lead Partition"

FROM employee_table;

As you can see, the first LEAD brings back the value from the next row except for the last which has no row following it. The offset value was not specified in this example, so it defaulted to a value of 1 row.

Using LEAD With and Offset of 2

SELECT last_name, dept_no

,lead(dept_no,2)
over (order by dept_no, last_name) as "Lead All"
,lead(dept_no,2) over (Partition by dept_no
order by dept_no, last_name) as "Lead Partition"

FROM employee_table;

Above, each value in the first LEAD is 2 rows away, and the partitioning only shows when values are contained in each value group with 1 more than offset value.

Using LAG

SELECT last_name, dept_no

,lag(dept_no)

over (order by dept_no, last_name) as "Lag All"

,lag(dept_no)

over (Partition by dept_no

order by dept_no, last_name) as "Lag Partition"

FROM employee_table;

From the example above, you see that LAG uses the value from a previous row and makes it available in the next row. For LAG, the first row(s) will contain a null based on the value in the offset. Above is defaulted to 1. The first null comes from the function, whereas the second row gets the null from the first row.

Using LAG with an Offset of 2

SELECT last_name, dept_no

,lag(dept_no,2)

over (order by dept_no, last_name) as "Lag All"

,lag(dept_no,2)

over (Partition by dept_no

order by dept_no, last_name) as "Lag Partition"

FROM employee_table;

For this example, the first two rows have a null because there is not a row two rows before these. The number of nulls will always be the same as the offset value. There is a third null because Jones Dept_No is null.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 13 – OLAP Functions

Create new playlist

Sign In

Sign Up

Chapter 13 – OLAP Functions

CSUM

CSUM – The Sort Explained

CSUM – Rows Unbounded Preceding Explained

CSUM – Making Sense of the Data

CSUM – Making Even More Sense of the Data

CSUM – The Major and Minor Sort Key(s)

Reset with a PARTITION BY Statement

PARTITION BY only Resets a Single OLAP not ALL of them

ANSI Moving Window is Current Row and Preceding n Rows

How ANSI Moving SUM Handles the Sort

Quiz – How is that Total Calculated?

Answer to Quiz – How is that Total Calculated?

Moving SUM every 3-rows Vs a Continuous Average

Partition by Resets an ANSI OLAP

Moving Average

The Moving Window is Current Row and Preceding

How Moving Average Handles the Sort

Quiz – How is that Total Calculated?

Answer to Quiz – How is that Total Calculated?

Quiz – How is that 4th Row Calculated?

Answer to Quiz – How is that 4th Row Calculated?

Moving Average every 3-rows Vs a Continuous Average

Partition By Resets an ANSI OLAP

RANK Defaults to Ascending Order

Getting RANK to Sort in DESC Order

RANK() OVER and PARTITION BY

RANK() OVER And LIMIT

PERCENT_RANK() OVER

PERCENT_RANK() OVER with 14 rows in Calculation

PERCENT_RANK() OVER with 21 rows in Calculation

Quiz – What Causes the Product_ID to Reset?

Answer to Quiz – What Cause the Product_ID to Reset?

COUNT OVER for a Sequential Number

Quiz – What caused the COUNT OVER to Reset?

Answer to Quiz – What caused the COUNT OVER to Reset?

The MAX OVER Command

MAX OVER with PARTITION BY Reset

The MIN OVER Command

Quiz – Fill in the Blank

Answer – Fill in the Blank

The Row_Number Command

Quiz – How did the Row_Number Reset?

Quiz – How did the Row_Number Reset?

Standard Deviation Functions Using STDDEV / OVER

Standard Deviation Functions and STDDEV / OVER Syntax

STDDEV / OVER Example

VARIANCE / OVER Syntax

Variance Functions Using VARIANCE / OVER

Using VARIANCE with PARTITION BY Example

Using FIRST_VALUE and LAST_VALUE

Using FIRST_VALUE

Using LAST_VALUE

Using LAG and LEAD

Using LEAD

Using LEAD With and Offset of 2

Using LAG

Using LAG with an Offset of 2

Table of Contents for
Chapter 13 – OLAP Functions

Quiz – How is that 4^th Row Calculated?

Answer to Quiz – How is that 4^th Row Calculated?