EXPAND ON Clause

Purpose

Expands a column having a PERIOD data type, creating a regular time series of rows based on the period value in the input row.

The expansion of a PERIOD column produces one value‑equivalent row for each of the time granules in the epoch or timestamp representation of the specified time granule.

You can perform time series expansions only on PERIOD expressions.

Syntax

where:

Syntax Element …

Specifies …

EXPAND ON expand_expression

the time series expansion information in a query expression.

The EXPAND ON clause generates a regular time series as a sequence of values at each of the granules or at each point of a predefined interval in the specified period from an input row.

The expand_expression variable specifies a PERIOD column name or PERIOD expression on which the selected rows are to be expanded. This is referred to as the input row for the EXPAND ON operation.

If the expansion period is null, then the expanded row has a null expanded value.

The specified column or expression must have a PERIOD data type.

expand_column_alias

AS expand_column_alias

the aliased name of the PERIOD expression to be expanded.

The aliased column or expression must have a PERIOD data type.

You can reference expand_column_alias in the select list, including inside an expression that does not reference any other columns and refers to the expanded value for an expanded row.

You cannot specify expand_column_alias in other clauses within the same query block except for an ORDER BY clause.

You cannot specify expand_column_alias in any clauses in a subquery or correlated subquery, nor can you specify it in any aggregate or statistical function in the select list.

BY expand_interval

the interval literal (interval expression) or anchored literal by which expand_column_name is to be expanded, where expand_interval is one of the valid expand_interval options. See “Expand Interval Literal Expressions” on page 198 for a comprehensive list of the valid expand interval literals.

This value specifies the granularity of the value‑equivalent rows produced by the EXPAND ON clause.

If you do not specify the BY expand_interval option, then the expansion interval defaults to the granularity of the element type of the PERIOD value for the column.

The expansion commences from the beginning value of the expansion period and terminates at the ending value of the expansion period, incrementing by interval_literal for each expanded row.

In this process, the last expanded row might not cover the expansion interval duration, producing only a partial period. By definition, a partial period is a period with a duration that is less than the expanded interval.

ANCHOR anchor_name

ANCHOR PERIOD anchor_name

the expansion is an anchor PERIOD expansion.

Specify PERIOD only for anchor PERIOD expansions.

If you specify PERIOD for noninterval data, then the expansion is an Anchor PERIOD expansion.

If you do not specify PERIOD for noninterval data, then the expansion is an Anchor Point expansion.

The variable anchor_name specifies an anchor name literal. See “Anchor Name Literal Expressions” on page 199 for a comprehensive list of the valid anchor name literals.

AT time_literal]

an optional Time literal (see SQL Data Types and Literals for a comprehensive list of valid Time literals).

If you do not specify a time literal value, the value defaults to ‘00:00:00.000000 00:00’ for an ANCHOR_MILLISECOND, ANCHOR_SECOND, ANCHOR_MINUTE, ANCHOR_HOUR, WEEK_BEGIN, MONTH_BEGIN, QUARTER_BEGIN, or YEAR_BEGIN anchor and to ‘23:59:59.999999 00:00’ for any other anchors.

FOR period_expression

an optional PERIOD expression. For a comprehensive list of valid PERIOD expressions, see SQL Data Types and Literals.

The purpose of specifying a PERIOD expression is to limit the number of rows to be expanded to period_expression, where period_expression represents the period of interest.

The expansion period is the overlapping period of the qualified row and the PERIOD constant you specify as period_expression.

Otherwise, the expanding period is the PERIOD value of the selected row.

The data type of period_expression must be comparable with the PERIOD data type of the expanded column.

If the specified PERIOD expression is either null or does not overlap with the row, then Teradata Database does not expand the row because it does not qualify for expansion.

ANSI Compliance

EXPAND ON is a Teradata extension to the ANSI SQL:2011 standard.

Because of this, Teradata Database reports a warning message if you use an EXPAND ON clause in a session in which the SQL Flagger is set. For information about the SQL Flagger, see SQL Fundamentals.

Note that the Teradata SQL EXPAND ON clause does not share any functionality with the EXPANDING clause of the ANSI SQL:2011 standard.

Time Series, Dense Representations, and Sparse Representations of Temporal Data

A time series is an ordered sequence of measurements of a variable that are arranged according to the time of their occurrence. Time series are typically measured at some constant frequency and their data points are generally, but not necessarily, spaced at uniform time intervals.

The characteristic properties of a time series include the following.

The data points are not independent of one another.

The dispersion of data points varies as a function of time.

The data frequently indicates trends.

The data tends to be cyclic.

Typical business applications for time series analysis include the following.

Budgetary analysis

Economic forecasting

Inventory analysis

Process control

Quality control

Sales forecasting

Stock market analysis

Workload projections

Yield projections

The EXPAND ON clause enables several different forms of time series expansion on a PERIOD column value of an input row by producing a set of value‑equivalent rows, one for each granule in the specified time period. The number of granules is defined by the anchor name you specify for the clause.

You can expand sparse PERIOD representations of relational data into a dense representation of the same data. Data converted to a dense form can be more easily manipulated by complex analyses such as moving average calculations without having to write complex SQL requests to respond to business questions made against sparse relational data.

The available forms of time series expansion for the EXPAND ON clause are the following.

Interval expansion, where rows are expanded by user‑specified intervals.

Anchor point expansion, where rows are expanded by user‑specified anchored points.

Anchor PERIOD expansion, where rows are expanded by user‑specified anchored periods.

Rules and Restrictions for the EXPAND ON Clause

The rules and restrictions are:

There are three different ways to expand rows:

By user‑specified intervals such as 1 DAY, 3 MONTH, and so on.

You must specify an interval literal in the EXPAND ON clause to execute this form of expansion. This type of expansion is called an interval expansion.

An interval expansion might be useful for answering queries such as “compute the moving window average of inventory cost by week during the year 2010."

By user‑specified anchored points in a time line.

You must specify an anchored interval without specifying a PERIOD keyword to execute this form of expansion. Such expansion is called an anchor point expansion.

An anchor point expansion produces a specific date or time point in the expanded rows, where the date or time points are the anchor points present in the input period being expanded.

For example, BY ANCHOR MONTH_BEGIN returns multiple expanded rows, one for each beginning value of a month present in the input period. This form of expansion might be useful for answering queries such as “get the month end average inventory cost during the last quarter of the year 2010.”

By user‑specified anchored time durations in a time line.

You must specify an anchored interval with the PERIOD keyword to execute this form of expansion.

Such an expansion is called as anchor period expansion.

An anchor PERIOD expansion produces rows whose period beginning bounds are always aligned to specific DateTime values derived from the anchor names.

For example, BY ANCHOR PERIOD MONTH_BEGIN, BY ANCHOR PERIOD MONDAY, and so on.

The PERIOD value of the row along with fixed durations enables weighted computations such as “compute the weekly weighted average of inventory cost.”

You can use the EXPAND ON clause in query expressions, except for:

Subqueries in search conditions. See “Specifying Subqueries in Search Conditions” on page 129.

For example, the EXPAND ON clause in the following statement (which assumes the existence of a column named pd1 in employee) is not valid:

   SELECT *

   FROM employee

   WHERE salary IN (SELECT salary

                    FROM salary_table

                    WHERE salary > 10000

                    EXPAND ON pd1);

A WITH clause is specified in the query expression. See “WITH Clause” on page 237.

An updatable cursor SELECT statement. See SQL Stored Procedures and Embedded SQL.

A CREATE JOIN INDEX statement. See “CREATE JOIN INDEX” in SQL Data Definition Language.

The query expression is a SELECT AND CONSUME statement. See “SELECT AND CONSUME” on page 55.

The expand expression can be a column or column alias from the select list or it can be a number that specifies the positional sequence of the column or expression in the select list that is to be expanded.

If the specified column sequential position is not valid, for example, if it is greater than the projected number of columns, Teradata Database returns an error.

The specified column must have a data type of PERIOD.

You must specify a table reference in a SELECT statement with an EXPAND ON clause.

You can specify an EXPAND ON clause in any of the following places:

Within a derived table.

This includes cases where the derived table is contained within a subquery.

See “Example 3: Expansion Over a UNION Operator” on page 202.

As part of the individual query expression of a SELECT statement that specifies set operators such as UNION, INTERSECT, or MINUS/EXCEPT (see SQL Functions, Operators, Expressions, and Predicates).

See “Example 3: Expansion Over a UNION Operator” on page 202.

As part of the SELECT specification in the following SQL DDL statements (see SQL Data Definition Language):

CREATE RECURSIVE VIEW/REPLACE RECURSIVE VIEW in the seed and the recursive statements of the view definition

CREATE TABLE … AS

CREATE VIEW/REPLACE VIEW

Note that a view that contains an EXPAND ON clause in its definition is not updatable.

As part of the SELECT specification in an INSERT … SELECT statement (see “INSERT/INSERT … SELECT” on page 370).

You cannot specify an EXPAND ON clause is any of the following places:

Anywhere in a SELECT AND CONSUME statement (see “SELECT AND CONSUME” on page 55).

Anywhere in an updatable cursor (see SQL Stored Procedures and Embedded SQL).

Anywhere in a subquery used as a search condition.

Anywhere in a SELECT statement that specifies a TOP n operator (see “TOP n Operator” on page 91).

You can work around this restriction by specifying the TOP n operator within a derived table and then specifying the EXPAND ON clause in the outer query.

Anywhere in a SELECT statement that specifies a SAMPLE clause (see “SAMPLE Clause” on page 175).

Anywhere in a SELECT statement that specifies a WITH clause (see “WITH Clause” on page 237).

Anywhere in a table function.

As part of the SELECT specification in a CREATE JOIN INDEX statement (see “CREATE JOIN INDEX” in SQL Data Definition Language).

All of the operations in a query expression except ORDER BY are performed before any rows are expanded.

The ORDER BY operation is then performed on the expanded rows if you specify an expanded column in the ORDER BY clause.

If you do not specify a BY expansion_interval clause, the expanding interval defaults to the granularity of the element type of the PERIOD value for the column.

The expansion defaults to the rules for interval literals described in “Rules and Restrictions for Interval Expansion” on page 194.

To limit the number of rows being expanded, specify a PERIOD expression in the FOR clause, where the PERIOD expression represents the period of interest.

The data type of the PERIOD expression must be compatible with the PERIOD data type of expanded_column_name.

If the specified period is not null and does not overlap with the row, then the row is not eligible for expansion.

Teradata Database expands rows based on the interval you specify.

The interval can either be an interval constant, or an anchored interval derived from the anchor name such as any day of a week, the MONTH_BEGIN keyword, or the MONTH_END keyword.

An EXPAND ON clause specification produces one value‑equivalent row for each time granule derived from the specified expansion_interval. Rows are said to be value‑equivalent if the values in all columns except for their expanded columns are identical.

If you specify a FOR clause, the expansion period is the overlapping period of the qualified row and the PERIOD constant you specify in the FOR clause. Otherwise, the expanding period is the PERIOD value of the selected row.

If the expanding PERIOD expression specifies a column from another table that is not specified in the FROM clause, then Teradata Database joins the tables before the expansion.

You must ensure that the appropriate join condition is specified in such cases. If no other join conditions are specified, then the system performs a Product Join on the referenced tables.

See “Example 15: Join Before Expansion” on page 217.

When you do not specify an expanded column in the select list, but do specify the DISTINCT operator, the EXPAND operation is nullified.

See “Example 16: Nullified EXPAND Operation” on page 218.

If the expanding PERIOD is null, then the expanded row has a null expansion.

See “Example 17: Null Expansion Period Producing a Null Expanded Value” on page 220.

Rules and Restrictions for Interval Expansion

The rules and restrictions for the use of Interval Expansions are.

An expansion commences from the beginning value of the expansion period and ends at its ending value with an incremental value set to the specified interval literal for each row to be expanded.

In this process, the last expanded row might not always cover the expansion interval duration. If this occurs, the result is a partial period, which is a period having a duration that is less than the expansion interval.

If one or more rows in the expanded result has an expansion period duration that is less than the specified interval, and the interval being expanded also has a higher granularity than the element type of the period being expanded, Teradata Database returns a warning to the requestor. See “Example 16: Creating a Time Series Using Expansion By an Interval Constant Value” on page 46.

Unlike anchor period expansions, Teradata Database does not use a system‑defined business calendar set for an interval literal in an EXPAND ON clause.

Anchor Period and Anchor Point Expansion

The rules and restriction for the use of Anchor PERIOD and Anchor Point Expansions are:

If the data type of the expansion expression is PERIOD(DATE), and you specify a time literal in the anchored interval, the request returns an error to the requestor because you cannot specify a time option when the data type of the expansion expression is PERIOD(DATE).

You cannot specify an anchored interval if the data type of the expansion expression is either PERIOD(TIME) or PERIOD(TIME WITH TIME ZONE); otherwise, the request aborts and Teradata Database returns an error to the requestor because the PERIOD(TIME) and PERIOD(TIME WITH TIME ZONE) data types are not valid for an anchored EXPAND ON clause.

If the element type of the expansion expression is a Timestamp data type, Teradata Database uses the time literal value that you specify to define the timestamp value of the anchor during expansion.

If you do not specify a time literal, the time literal value defaults to '00:00:00.000000+00:00' for an anchor name of ANCHOR_MILLISECOND, ANCHOR_SECOND, ANCHOR_MINUTE, ANCHOR_HOUR, WEEK_BEGIN, MONTH_BEGIN, QUARTER_BEGIN, or YEAR_BEGIN and to '23:59:59.999999+00:00' for any other anchor name.

The precision of the default value is set to the precision of the expansion expression. The default value includes the time zone value of +00:00 if the expansion expression specifies a time zone; otherwise, the value is the session time zone and does not specify a time zone.

The anchor, for example MONTH_BEGIN, is computed based on the session time zone. Thus, for two sessions that are at different time zones, the output will probably be different. For an example of this, see “Example 7: Same Expansion in Two Different Sessions in Different Time Zones” on page 208.

If you specify the anchor as a DAY, such as anchor_name is DAY, the expansion interval is INTERVAL '1' DAY for each expanded row.

If a table is to be expanded by the first calendar day of every month, you must specify MONTH_BEGIN in the anchored interval clause.

Each expanded value for the row in the result has the first day of the corresponding month as the BEGIN bound and the expansion interval defaults to INTERVAL '1' MONTH. The BEGIN bound of the expanded value for each result row is 'YYYY-MM-01'.

See the first SELECT request in “Example 4: EXPAND ON MONTH_BEGIN and MONTH_END” on page 204.

If a table is to be expanded by the last calendar day of every month, you must specify MONTH_END in the anchored interval clause.

Each expanded value for the result row has the last day of the corresponding month as its BEGIN bound, and the expansion interval defaults to INTERVAL '1' MONTH.

The BEGIN bound of the expanded value for each result row is 'YYYY-MM-DD' where DD is one of 28, 29, 30, or 31, depending on the month and the year.

See the second SELECT request in “Example 4: EXPAND ON MONTH_BEGIN and MONTH_END” on page 204.

If a table is to be expanded by a particular day of every week, you must specify the desired week day in the anchored interval clause. The begin bound of each expanded value for the result row corresponds to that day of the week, and the expanding interval defaults to INTERVAL '7' DAY.

See “Example 1: Expansion on an Anchor Point Using WEEK_DAY” on page 200.

Rules and Restrictions for Anchor Point Expansion of the EXPAND ON Clause

The rules and restrictions for the use of Anchor Point Expansions of EXPAND ON are:

The following cases are possible with Anchor Point Expansion when the expansion interval is longer than the granularity of the element type of the expand expression:

There is no single anchor point in the expansion period.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2007-08-14', DATE '2007-08-17'), then the expansion period starts on Tuesday and ends on Friday in the same week.

In this case, the input row does not produce any expanded rows in the expanded result.

The beginning bound of the expansion period is not aligned to an anchor point and its duration is longer than the expansion interval.

In this case, the expanded row set does not have a row corresponding to the beginning bound of the expansion period.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2007-08-15', DATE '2007-08-25'), the expanded result contains only one point, which is DATE '2007-08-20'.

The beginning bound of the expansion period is aligned with an anchor point.

In such a case, the expanded rows contain a row with the beginning bound of the expansion period.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2007-08-20', DATE '2007-08-25'), then the last expanded row contains DATE '2007-08-20'.

The expanded rows correspond to all anchor points (the BEGIN point of an anchor period) that are within the expansion period.

This also means that each row expands to as many anchor points that exist in the expansion period.

Anchor Period Expansion of the EXPAND ON Clause

The rules and restrictions for the use of Anchor Period Expansion of EXPAND ON are:

The expanded rows correspond to all anchor periods that overlap the expansion period. The key difference between anchor period expansion and anchor point expansion is that for anchor point expansion, the anchor point, which is the beginning bound of an anchor period, must occur within the expansion period, while for anchor period expansions, the anchor period must overlap the expansion period.

The following three cases are the only valid applications of anchor period expansions.

The expansion period has a duration that is less than that of the expansion interval.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2011-08-14', DATE '2011-08-17'), then the expansion period starts on Tuesday, ends on Friday of the same week, and the expanded row is a singleton with a resulting anchor period of PERIOD(DATE '2011-08-13', DATE'2011-08-20').

In this case, there is only one expanded row for the input row.

The expansion period is not aligned with an anchor period, but the duration of the expansion period is greater than the expansion interval.

Such a row produces more than one expanded row in its result.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2011-08-15', DATE '2011-08-25').

In this case, the expanded result has two anchor periods, PERIOD(DATE '2011‑08-13', DATE '2011-08-20') and PERIOD(DATE '2011-08-20', DATE '2011‑08‑27').

The expanding period is aligned with an anchor period and its duration is an exact multiple of the expansion interval.

For example, if the expansion is by MONDAY and the expansion period is PERIOD(DATE '2011-08-20', DATE '2011-08-27'), then the expanded row contains PERIOD(DATE '2011-08-20', DATE '2011-08-27').

Teradata Database uses a system-defined business calendar set in the session for the anchored interval in an EXPAND ON clause.

In an anchor expansion, Teradata Database derives the BEGIN bound of the first expanded row from the business calendar set in the current session. The BEGIN bound of the first expanded row is derived from the YearBeginDate, WeekStart, and CalendarPeriod columns from the BusinessCalendarPattern table.

For example, a row has period from 1-Jan-2008 to 31-May-2008, YearBeginDate is 15-Mar-2008, and CalendarPeriod is from 1-Jan-08 to 30-Oct-08. Expanding by anchor point MONTH_BEGIN results in 4 rows, where the first row is from 15-Jan-08 to 15‑Feb-08 and second row is from 15-Feb-08 to 15-Mar-08 and 3rd row is from 15‑Mar-08 to 15-Apr08 and 4th rows is from 15-Apr-08 to 15-May-08. The MONTH_BEGIN value is derived from the YearBeginDate value, which is January 15 and, because the Calendar begins in January, the first month starts on January 15.

This table lists the EXPAND ON clause anchor names.

Anchor Name	Expanding Interval	Default Time Literal	EXPAND Result
ANCHOR_MILLISECOND	INTERVAL ‘1’ MILLISECOND	00:00:00.000000+00:00	Each expanded value for the row in the result adjusts the anchor to the nearest millisecond boundary.
ANCHOR_SECOND	INTERVAL ‘1’ SECOND	00:00:00.000000+00:00	Each expanded value for the row in the result adjusts the anchor to the nearest second boundary.
ANCHOR_MINUTE	INTERVAL ‘1’ MINUTE	00:00:00.000000+00:00	Each expanded value for the row in the result adjusts the anchor to the nearest minute boundary.
ANCHOR_HOUR	INTERVAL ‘1’ HOUR	00:00:00.000000+00:00	Each expanded value for the row in the result adjusts the anchor to the nearest hour boundary.
WEEK_BEGIN	INTERVAL ‘7’ DAY	00:00:00.000000+00:00	Each expanded value for the row in the result has the first day of the corresponding week as its BEGIN bound.
WEEK_END	INTERVAL ‘7’ DAY	23:59:59.999999+00:00	Each expanded value for the row in the result has the last day of the corresponding week as its BEGIN bound.
MONTH_BEGIN	INTERVAL ‘1’ MONTH	00:00:00.000000+00:00	Each expanded value for the row in the result has the first day of the corresponding month as its BEGIN bound.
MONTH_END	INTERVAL ‘1’ MONTH	23:59:59.999999+00:00	Each expanded value for the row in the result has the last day of the corresponding month as its BEGIN bound.
QUARTER_BEGIN	INTERVAL ‘3’ MONTH	00:00:00.000000+00:00	Each expanded value for the row in the result has the first day of the corresponding quarter as its BEGIN bound.
QUARTER_END	INTERVAL ‘3’ MONTH	23:59:59.999999+00:00	Each expanded value for the row in the result has the last day of the corresponding quarter as its BEGIN bound.
YEAR_BEGIN	INTERVAL ‘1’ YEAR	00:00:00.000000+00:00	Each expanded value for the row in the result has the first day of the corresponding year as its BEGIN bound.
YEAR_END	INTERVAL ‘1’ YEAR	23:59:59.999999+00:00	Each expanded value for the row in the result has the last day of the corresponding year as its BEGIN bound.

The WEEK_BEGIN anchor name corresponds to an anchor expansion by WeekStart.

For example, if the business calendar for the session is ISO and you specify expansion by WEEK_BEGIN, Teradata Database translates the WEEK_BEGIN expansion to EXPAND ON ….BY ANCHOR MONDAY because the week begins on Monday.

The following anchors may give different results for the ISO calendar than they give for the Teradata or COMPATIBLE calendars.

MONTH_BEGIN

MONTH_END

QUARTER_BEGIN

QUARTER_END

YEAR_BEGIN

YEAR_END

For information on how the ISO calendar computes results, see the section named “About ISO Computation” in SQL Functions, Operators, Expressions, and Predicates.

Expand does not consider exceptions even for a first date.

For example, if you want to execute an expansion by business days, you must perform a normal expansion and then apply business functions on all of the expanded rows using a derived table to get the desired expansion by business days.

Expand Interval Literal Expressions

The following table explains the meanings of the expand interval syntax variables for the EXPAND ON clause.

Expand Interval	Description
interval_literal	To perform an interval expansion, specify any valid Interval literal value (see SQL Data Types and Literals for a comprehensive list of valid Interval literals).
ANCHOR anchor_name	To perform an anchor point expansion, specify an anchor name, but do not specify PERIOD.
ANCHOR PERIOD anchor_name	To perform an anchor period expansion, you must specify both PERIOD and an anchor name.

Anchor Name Literal Expressions

The following table lists the ANCHOR anchor_name variables for the EXPAND ON clause.

Anchor Name	Description
ANCHOR_MILLISECOND	Produces multiple expanded rows, one for each millisecond in the input period.
ANCHOR_SECOND	Produces multiple expanded rows, one for each second in the input period.
ANCHOR_MINUTE	Produces multiple expanded rows, one for each minute in the input period.
ANCHOR_HOUR	Produces multiple expanded rows, one for each hour in the input period.
DAY	The expansion interval is INTERVAL '1' DAY for each expanded row.
WEEK_BEGIN	Produces multiple expanded rows, one for each beginning value of a week present in the input period.
WEEK_END	Produces multiple expanded rows, one for each ending value of a week present in the input period.
MONTH_BEGIN	Produces multiple expanded rows, one for each beginning value of a month present in the input period. An example query that uses such expansion is “Get the month end average inventory cost during the last quarter of the year 2010."
MONTH_END	Produces multiple expanded rows, one for each ending value of a month present in the input period.
QUARTER_BEGIN	Produces multiple expanded rows, one for each beginning value of a quarter present in the input period.
QUARTER_END	Produces multiple expanded rows, one for each ending value of a quarter present in the input period.
YEAR_BEGIN	Produces multiple expanded rows, one for each beginning value of a year present in the input period.
YEAR_END	Produces multiple expanded rows, one for each ending value of a year present in the input period.
MONDAY	An anchor period expansion produces rows whose period beginning bounds are always aligned to specific DateTime values derived from the anchor names. For this specification, the period beginning bound is the first Monday in the specified period.
TUESDAY	The period beginning bound is the first Tuesday in the specified period.
WEDNESDAY	The period beginning bound is the first Wednesday in the specified period.
THURSDAY	The period beginning bound is the first Thursday in the specified period.
FRIDAY	The period beginning bound is the first Friday in the specified period.
SATURDAY	The period beginning bound is the first Saturday in the specified period.
SUNDAY	The period beginning bound is the first Sunday in the specified period.

Example : Expansion on an Anchor Point Using WEEK_DAY

Create a table named tdate with the following definition.

     CREATE SET TABLE tdate (

       id       INTEGER,

       quantity INTEGER,

       pd       PERIOD(DATE))

     PRIMARY INDEX (id);

You insert two rows into tdate so its contents are as follows.

id

--

quantity

--------

pd

--

2005-02-03, 2005-06-20

2004-01-03, 2004-05-20

Submit a SELECT statement against tdate that specifies an EXPAND ON clause anchored by a day of the week, Monday, so that each expanded row begins from a Monday, as specified in the statement, and the duration of each expanded period is seven days.

     SELECT id, BEGIN(bg)

     FROM tdate

     EXPAND ON pd AS bg BY ANCHOR MONDAY;

Teradata Database returns tdate details for each week of a given period, beginning on the first Monday from the eligible data, as you specified in the BY ANCHOR clause of the statement.

Because the first row in tdate starts on a Thursday, not a Monday, the expanded row starts on the next sequential Monday date, which is February 7, and then continues in weekly granular increments.

	`id` `--`	`begin(bg)` `---------`
	`11`	`2005-02-07`
	`11`	`2005-02-14`
	`11`	`2005-02-21`
	`11`	`2005-02-28`
	`11`	`2005-03-07`
	`11`	`2005-03-14`
	`11`	`2005-03-21`
	`11`	`2005-03-28`
	`11`	`2005-04-04`
	`11`	`2005-04-11`
	`11`	`2005-04-18`
	`11`	`2005-04-2`5
	`11`	`2005-05-02`
	`11`	`2005-05-09`
	`11`	`2005-05-16`
	`11`	`2005-05-23`
	`11`	`2005-05-30`
	`11`	`2005-06-06`
	`11`	`2005-06-13`
	`10`	`2004-01-05`
	10	`2004-01-12`
	10	`2004-01-12`
	10	`2004-01-19`
	10	`2004-01-26`
	10	`2004-02-02`
	10	`2004-02-09`
	10	`2004-02-16`
	10	`2004-02-23`
	10	`2004-03-01`
	10	`2004-03-08`
	10	`2004-03-15`
	10	`2004-03-22`
	10	`2004-03-29`
	10	`2004-04-05`
	10	`2004-04-12`
	10	`2004-04-19`
	10	`2004-04-26`
	10	`2004-05-03`
	10	`2004-05-10`
	10	`2004-05-17`

Example : Expansion on an Anchor Point Using ANCHOR_SECOND

Create a table named t2 with the following definition.

     CREATE TABLE t2 (

       id       INTEGER

       quantity INTEGER

       pd       PERIOD(DATE))

       PRIMARY INDEX (id);

The input row for the example statement is of type TIMESTAMP and contains the following timestamp period data. (2011-01-01 10:15:20.000001, 2011-01-01 10:15:25.000009).

Submit the following SELECT statement on table t2.

     SELECT BEGIN(expd)

     FROM t2

     EXPAND ON pd AS expd BY ANCHOR ANCHOR_SECOND;

The result set contains 5 rows because the input row has a period of 5 seconds, beginning with 10:15:20.000001 and ending with 10:15:25.000009.

begin expd

----------

2011-01-01 10:15:21.000000

2011-01-01 10:15:22.000000

2011-01-01 10:15:23.000000

2011-01-01 10:15:24.000000

2011-01-01 10:15:25.000000

Example : Expansion Over a UNION Operator

Suppose you create a table named tdate1 with the following definition.

     CREATE SET TABLE tdate1 (

       id       INTEGER,

       quantity INTEGER,

       pd       PERIOD(DATE))

     PRIMARY INDEX (id);

Table tdate1 contains the following rows.

id

--

quantity

--------

pd

--

2006-02-03, 2006-06-20

2005-01-03, 2005-05-20

You now submit the following unioned SELECT statement against tdate, as defined in “Example 1: Expansion on an Anchor Point Using WEEK_DAY” on page 200, and tdate1 that specifies an EXPAND ON clause for both statements, each having a one month interval granularity.

     SELECT id, quantity, expd

     FROM tdate

     EXPAND ON pd AS expd BY INTERVAL '1' MONTH

     UNION

     SELECT id, quantity, expd

     FROM tdate1

     EXPAND ON pd AS expd BY INTERVAL '1' MONTH;

In this example, Teradata Database first expands the rows in tables tdate and tdate1 and then unions the resulting rows from the queries on the expanded results.

Also note that Teradata Database returns a warning to the requestor that some rows in the expanded result might have an expanded period duration that is less than the duration of the specified interval.

Teradata Database returns tdate1 details for each month of a given period for these two queries and then unions the result rows as follows.

`id` `--`	`quantity` `--------`	`expd` `----`
`11`	`110`	`2005-02-03, 2005-03-03`
`11`	`110`	`2005-03-03, 2005-04-03`
`11`	`110`	`2005-04-03, 2005-05-03`
`11`	`110`	`2005-05-03, 2005-06-03`
`11`	`110`	`2005-06-03, 2005-06-20`
`10`	`100`	`2004-01-03, 2004-02-03`
10	`100`	`2004-02-03, 2004-03-03`
10	`100`	`2004-03-03, 2004-04-03`
10	`100`	`2004-04-03, 2004-05-03`
10	`100`	`2004-05-03, 2004-05-20`
12	`120`	`2006-02-03, 2006-03-03`
12	`120`	`2006-03-03, 2006-04-03`
12	`120`	`2006-04-03, 2006-05-03`
12	`120`	`2006-05-03, 2006-06-03`
12	`120`	`2006-06-03, 2006-06-20`
13	`130`	`2005-01-03, 2005-02-03`
13	`130`	`2005-02-03, 2005-03-03`
13	`130`	`2005-03-03, 2005-04-03`
13	`130`	`2005-04-03, 2005-05-03`
13	`130`	`2005-05-03, 2005-05-20`

Example : EXPAND ON MONTH_BEGIN and MONTH_END

This example shows how to use MONTH_BEGIN and MONTH_END in an EXPAND ON clause.

First create the ttimestamp table as follows.

     CREATE SET TABLE ttimestamp (

       id       INTEGER,

       quantity INTEGER,

       pd       PERIOD(TIMESTAMP(0)))

     PRIMARY INDEX (id);

Table ttimestamp contains the following rows.

id

--

quantity

--------

pd

--

2005-02-01 01:10:40, 2005-06-20 05:20:50

2004-01-03 01:10:40, 2004-05-31 20:20:50

When you specify an EXPAND ON clause by MONTH_BEGIN or MONTH_END, every expanded row starts from either the MONTH_BEGIN value or from the MONTH_END value for that month, and the granularity of each expanded period is one month. In this example, the table data is shown with the default session time zone set to INTERVAL ‘00:00’ HOUR TO MINUTE.

     SET TIME ZONE INTERVAL ‘00:00’ HOUR TO MINUTE;

The following SELECT statement specifies an EXPAND … BY MONTH_BEGIN.

     SELECT id, quantity, BEGIN(bg)

     FROM ttimestamp

     EXPAND ON pd AS bg BY ANCHOR MONTH_BEGIN;

Each row is expanded at the default time literal value 00:00:00+00:00 for each MONTH_BEGIN value.

id --	qty ---	bg --
11	110	2005-03-01 00:00:00
11	110	2005-04-01 00:00:00
11	110	2005-05-01 00:00:00
11	110	2005-06-01 00:00:00
10	100	2004-02-01 00:00:00
10	100	2004-03-01 00:00:00
10	100	2004-04-01 00:00:00
10	100	2004-05-01 00:00:00

The following SELECT statement specifies an EXPAND … BY ANCHOR MONTH_END, but is otherwise identical to the previous statement.

     SELECT id, quantity, BEGIN(bg)

     FROM ttimestamp

     EXPAND ON pd AS bg BY ANCHOR MONTH_END;

Each row is expanded at the default time literal value 23:59:59+00:00 at each month end.

id --	quantity --------	bg --
11	110	2005-02-28 23:59:59
11	110	2005-03-31 23:59:59
11	110	2005-04-30 23:59:59
11	110	2005-05-31 23:59:59
10	100	2004-01-31 23:59:59
10	100	2004-02-29 23:59:59
10	100	2004-03-31 23:59:59
10	100	2004-04-30 23:59:59

Example : EXPAND ON and DISTINCT

This example shows how Teradata Database performs the DISTINCT operation after expansion occurs.

     CREATE SET TABLE products (

       product_id       INTEGER,

       product_price    DECIMAL(5,2),

       product_duration PERIOD(DATE))

     PRIMARY INDEX (product_id);

Assume that you have the following rows in products.

`product_id` `----------`	`product_price` -------------	`product_duration` ----------------
1000	100.00	2007-02-15, 2007-08-11
1001	99.99	2007-03-04, 2007-05-01
1001	101.10	2008-05-10, 2009-05-10
1001	1-4.10	2007-07-16, 2008-10-09

When you specify a DISTINCT operator in the query expression and the expanded column in the select list of your query, Teradata Database performs the DISTINCT operation after expansion and removes the duplicate rows from the expanded result (see the example below).

     SELECT DISTINCT product_id, pd

     FROM products

     EXPAND ON product_duration AS pd BY ANCHOR PERIOD MONTH_BEGIN;

This SELECT statement returns the following response set.

	product_id	pd
	1000	2007-02-01, 2007-03-01
	1000	2007-03-01, 2007-04-01
	1000	2007-04-01, 2007-05-01
	1000	2007-05-01, 2007-06-01
	1000	2007-06-01, 2007-07-01
	1000	2007-07-01, 2007-08-01
	1000	2007-08-01, 2007-09-01
	1001	2007-03-01, 2007-04-01
	1001	2007-04-01, 2007-05-01
	1001	2008-05-01, 2008-06-01
	1001	2008-06-01, 2008-07-01
	1001	2008-07-01, 2008-08-01
	1001	2008-08-01, 2008-09-01
	1001	2008-09-01, 2008-10-01
	1001	2008-10-01, 2008-11-01
	1001	2008-11-01, 2008-12-01
	1001	2008-12-01, 2009-01-01
	1001	2009-01-01, 2008-02-01
	1001	2009-02-01, 2009-03-01
	1001	2009-03-01, 2009-04-01
	1001	2009-04-01, 2009-05-01
	1001	2009-05-01, 2009-06-01
	1001	2007-07-01, 2007-08-01
	1001	2007-08-01, 2007-09-01
	1001	2007-09-01, 2007-10-01
	1001	2007-10-01, 2007-11-01
	1001	2007-11-01, 2007-12-01
	1001	2007-12-01, 2008-01-01
	1001	2008-01-01, 2008-02-01
	1001	2008-02-01, 2008-03-01
	1001	2008-03-01, 2008-04-01
	1001	2008-04-01, 2008-05-01

Example : Same Expansion in Two Different Sessions Using Different Time Zone Intervals

The beginning bound of the expanding period value is adjusted to start at the time specified by time_literal before it expands the rows. The anchor point, for example MONTH_BEGIN, is computed based on the session time zone. As a result, the output can be different for two sessions that are at two different time zones.

In this example the time zone for the first session is set to INTERVAL -’01:00’ HOUR TO MINUTE, and the time zone for the second session is set to INTERVAL ‘02:00’ HOUR TO MINUTE.

The output of the identical SELECT statement submitted in the two time zones differs, with the statement submitted in the second session returning one additional row (shaded in red).

You set the time zone for the first session as follows and then submit the indicated SELECT statement anchored on MONTH_BEGIN:

     SET TIME ZONE INTERVAL -'01:00' HOUR TO MINUTE;

     SELECT id, quantity, BEGIN(pd) AS bg

     FROM ttimestamp

     EXPAND ON PERIOD bg BY ANCHOR MONTH_BEGIN;

id --	quantity --------	bg --
11	110	2005-03-01 00:00:00
11	110	2005-04-01 00:00:00
11	110	2005-05-01 00:00:00
11	110	2005-06-01 00:00:00
10	100	2004-02-01 00:00:00
10	100	2004-03-01 00:00:00
10	100	2004-04-01 00:00:00
10	100	2004-05-01 00:00:00

You set the time zone for the second session as follows and then submit the indicated SELECT statement anchored on MONTH_END:

     SET TIME ZONE INTERVAL '02:00' HOUR TO MINUTE;

     SELECT id, quantity, BEGIN(pd) AS bg

     FROM ttimestamp

     EXPAND ON PERIOD bg BY ANCHOR MONTH_BEGIN;

id --	quantity --------	bg --
11	110	2005-02-01 00:00:00
11	110	2005-03-01 00:00:00
11	110	2005-04-01 00:00:00
11	110	2005-05-01 00:00:00
11	110	2005-06-01 00:00:00
10	100	2004-02-01 00:00:00
10	100	2004-03-01 00:00:00
10	100	2004-04-01 00:00:00
10	100	2004-05-01 00:00:00E

Example : Same Expansion in Two Different Sessions in Different Time Zones

The beginning bound of the expanding period value is adjusted to start at the time specified by time_literal before it expands the rows. The anchor point, for example MONTH_BEGIN, is computed based on the session time zone. As a result, the output can be different for two sessions that are at two different time zones.

In the following example, the time literal defaults to 00:00:00 at the session time zone because the EXPAND ON clause input row does not specify a time zone (because the value of duration has a PERIOD(TIMESTAMP) data type). After the time literal is converted to UTC, it is the previous day; therefore, the previous day-to-month begin is checked with the row, and when it is returned, Teradata Database adds the session time.

First you create the following table.

     CREATE SET TABLE test (

       testid   INTEGER,

       duration PERIOD(TIMESTAMP))

     PRIMARY INDEX (testid);

Table test contains the following row.

testid

------

duration (at UTC)

-----------------

ABC

2002-01-31 15:30:00, 2002-05-31 15:00:00

You then perform the following anchor point expansion by MONTH_BEGIN with a default time zone literal.

     SET TIME ZONE INTERVAL '09:00' HOUR TO MINUTE;

     SELECT BEGIN(xyz)

     FROM test

     EXPAND ON duration AS xyz BY ANCHOR MONTH_BEGIN;

This statement returns the following rows:

	BEGIN(xyz) ----------
	2002-03-01 00:00:00
	2002-04-01 00:00:00
	2002-05-01 00:00:00

In the following example, the time literal is 20:00:00 at session time zone because the input row does not specify a time zone (because the value of duration has a PERIOD(TIMESTAMP) data type). After the time literal is converted to UTC, it is the next day; therefore, Teradata Database checks the next day to month end with the row, and when it is returns the row, it adds the session time.

You have the following table.

     CREATE SET TABLE test1 (

       testid   INTEGER,

       duration PERIOD(TIMESTAMP))

     PRIMARY INDEX (testid);

Table test1 contains the following row.

testid

------

duration (at UTC)

-----------------

ABC

2005-12-03 04:30:00, 2006-04-01

You perform the following anchor period expansion by MONTH_END with a time literal and with a default session time zone.

     SET TIME ZONE INTERVAL -'07:00' HOUR TO MINUTE;

     SELECT BEGIN(xyz)

     FROM test1

     EXPAND ON duration AS xyz BY ANCHOR MONTH_END AT TIME ‘20:00:00’;

	BEGIN(XYZ) ----------
	2005-12-31 20:00:00
	2006-01-31 20:00:00
	2006-02-28 20:00:00

In the next example, the time literal value is 07:00:00 at time zone +10:00. After the time literal is converted to UTC, the time is 21:00 on the previous day. Therefore, Teradata Database checks the previous day to month end value with the row and, when the time series value is returned by the statement, Teradata Database adds the session time, which is 00:00.

You have the following table.

     CREATE SET TABLE test2 (

       testid   INTEGER,

       duration PERIOD(TIMESTAMP))

     PRIMARY INDEX (testid);

Table test2 contains the following row.

testid

------

duration (at UTC)

-----------------

timeseries

2005-12-30 22:30:00, 2006-04-29 18:00:00

You perform the following anchor period expansion by MONTH_END, specifying both a time literal and a time zone.

     SET TIME ZONE INTERVAL '00:00' HOUR TO MINUTE;

     SELECT timeseries

     FROM test2

     EXPAND ON duration AS timeseries BY ANCHOR PERIOD MONTH_END

                                         AT TIME '07:00:00+10:00';

This statement returns the following row set.

	timeseries ----------
	2005-12-30 21:00:00, 2006-01-30 21:00:00
	2006-01-30 21:00:00, 2006-02-27 21:00:00
	2006-02-27 21:00:00, 2006-03-30 21:00:00
	2006-03-30 21:00:00, 2006-04-29 21:00:00

Example : EXPAND ON and OLAP Functions

This example shows the use of the EXPAND ON clause in a SELECT statement that also specifies an OLAP function. In this case, the OLAP function specified is RANK. For details about the RANK function, see SQL Functions, Operators, Expressions, and Predicates.

First create the player_history table as follows.

     CREATE SET TABLE player_history (

       player_id INTEGER,

       duration  PERIOD(DATE),

       grade     CHARACTER(1))

     PRIMARY INDEX (player_id);

The player_history table contains the following rows.

`player_id` `---------`	`duration` `--------`	`grade` `-----`
1000	2007-06-03, 2007-07-03	A
1000	2007-07-03, 2007-08-03	B
1000	2007-08-03, 2007-09-03	C
1001	2007-07-03, 2007-08-03	A
1001	2007-08-03, 2007-09-03	D
1001	2007-09-03, 2007-10-03	E

The following SELECT statement specifies the RANK OLAP function in the select list.

     SELECT playerid, BEGIN(expd), RANK(grade ASC) AS a, grade

     FROM player_history

     WHERE player_id = 1000 QUALIFY a < 3

     EXPAND ON duration AS expd BY ANCHOR MONDAY;

The query returns the following response set.

`player_id` `---------`	`BEGIN(expd)` -----------	`a` -	`grade` -----
1000	2007-06-04	1	A
1000	2007-06-11	1	A
1000	2007-06-18	1	A
1000	2007-06-25	1	A
1000	2007-07-02	1	A
1000	2007-07-09	2	B
1000	2007-07-16	2	B
1000	2007-07-23	2	B
1000	2007-07-30	2	B

Example : EXPAND ON With a Join

This example shows using a join with the EXPAND ON clause.

First create the student table.

     CREATE SET TABLE student (

       id                 INTEGER,

       name               CHARACTER(10) CHARACTER SET LATIN

                          NOT CASESPECIFIC,

       duration_begin_end PERIOD(DATE))

     PRIMARY INDEX (id);

The student table contains the following row.

id

--

name

----

duration_begin_end

------------------

ABC

2004-01-01, 2004-12-31

Now create the course table.

     CREATE SET TABLE course (

       name             CHARACTER(10) CHARACTER SET LATIN

                        NOT CASESPECIFIC,

       student_id       INTEGER,

       course_begin_end PERIOD(DATE))

     PRIMARY INDEX (name);

The course table contains the following rows.

`name` `----`	`student_id` `----------`	`course_begin_end` `----------------`
CPP	101	2004-08-01,2004-08-30
Java	101	2004-07-01,2004-07-30
C	101	2004-04-01,2004-06-30

Submit a SELECT statement that returns the month a student was enrolled in a particular course.

This statement joins the student table (expanded on an interval literal and aliased as dt) with the course table using a mix of equality and inequality predicates in its WHERE clause.

     SELECT course.name, EXTRACT(MONTH FROM BEGIN(expd)

     FROM (SELECT student_id, expd

           FROM student

           EXPAND ON duration_begin_end AS expd BY INTERVAL '1' MONTH)

                     AS dt, course AS c

     WHERE c.student_id = dt.id

     AND   (BEGIN(c.course_begin_end) < END(expd)

     AND    BEGIN(expd) < END(c.course_begin_end)

     AND    dt.id = 101;

This statement returns the following five rows.

	course-name -----------	extract(month from expd) ------------------------
	C	4
	C	5
	C	6
	CPP	8
	Java	7

Teradata Database also returns a 9308 warning message for this statement.

Example : EXPAND ON For an Anchored Interval

This example shows the use of an anchored interval for doing anchor period and anchor point expansions. For an anchor period expansion, the expanded period value must overlap the expanding period, while for an anchor point expansion, the begin value of the expanded period value must be contained in the expanding period, which is a more restrictive condition.

First create the sold_products table

     CREATE SET TABLE sold_products, NO FALLBACK  (

       product_id       INTEGER,

       product_price    DECIMAL(10,2),

       product_duration PERIOD(DATE))

     PRIMARY INDEX (product_id);

The sold_products table contains the following rows.

product_id

----------

product_price

-------------

product_duration

----------------

100.00

2007-02-15, 2007-08-11

 99.99

2007-03-04, 2007-05-01

The following SELECT statement specifies an anchor period of MONTH_BEGIN. This is an anchor period expansion.

     SELECT product_id, product_price, product_duration, expd

     FROM sold_products

     EXPAND ON product_duration AS expd BY ANCHOR PERIOD MONTH_BEGIN;

The statement returns the following nine rows.

product_id ----------	product_price -------------	product_duration ----------------	expd ----
1000	100.00	2007-02-15,2007-08-11	2007-02-01,2007-03-01
1000	100.00	2007-02-15,2007-08-11	2007-03-01,2007-04-01
1000	100.00	2007-02-15,2007-08-11	2007-04-01,2007-05-01
1000	100.00	2007-02-15,2007-08-11	2007-05-01,2007-06-01
1000	100.00	2007-02-15,2007-08-11	2007-06-01,2007-07-01
1000	100.00	2007-02-15,2007-08-11	2007-07-01,2007-08-01
1000	100.00	2007-02-15,2007-08-11	2007-08-01,2007-09-01
1001	99.99	2007-03-04,2007-05-01	2007-03-01,2007-04-01
1001	99.99	2007-03-04,2007-05-01	2007-04-01,2007-05-01

For an anchor point expansion done on the same data, the shaded rows would not appear, as the following example shows.

Note the following distinction between anchor point and anchor period expansions.

IF the expansion is an …	THEN the …
anchor period expansion	expanded period value must overlap the expanding period.
anchor point expansion	begin value of the expanded period value must be contained within the expanding period.

Submit the following SELECT statement, which differs from the previous statement only in specifying the BEGIN bound function on product_duration instead of simply specifying the column name. This is an anchor point expansion done on the same data as the previous anchor period expansion.

     SELECT product_id, product_price, product_duration, BEGIN(expd)

     FROM sold_products

     EXPAND ON product_duration AS expd BY ANCHOR MONTH_BEGIN;

This statement returns seven rows, rather than nine, with the rows shaded in red from the previous example not appearing in the result set.

product_id ----------	product_price -------------	product_duration ----------------	begin(expd) -----------
1000	100.00	2007-02-15,2007-08-11	2007-03-01
1000	100.00	2007-02-15,2007-08-11	2007-04-01
1000	100.00	2007-02-15,2007-08-11	2007-05-01
1000	100.00	2007-02-15,2007-08-11	2007-06-01
1000	100.00	2007-02-15,2007-08-11	2007-07-01
1000	100.00	2007-02-15,2007-08-11	2007-08-01
1001	99.99	2007-03-04,2007-05-01	2007-04-01

Example : EXPAND ON and Span Grouping

This example shows the use of the EXPAND ON clause with grouping on a span of entries from the select list.

First create the stock table.

     CREATE SET TABLE stock (

       stock_id        INTEGER,

       stock_quantity  INTEGER,

       begin_end_date  PERIOD(DATE))

     PRIMARY INDEX (stockid);

The stock table contains the following rows.

stock_id

--------

stock_quantity

--------------

begin_end_date

--------------

2005-10-10,2005-11-15

2005-06-01,2005-08-31

This example shows how you can compute a weighted average for the stock_quantity column on a monthly basis.

Assume that udf_agspan is an aggregate UDF that adds the stock quantity for given month of a year and then divides the sum by the number of days in that month. This gives a different result when compared to the AVG function when the row is not spanning the whole month.

     SELECT udf_agspan(stock_quantity,

            EXTRACT(YEAR FROM BEGIN(expdcol)),

            EXTRACT(MONTH FROM BEGIN(expdcol)))

            (FORMAT ‘-----9.999’) AS wavg,

            EXTRACT(YEAR FROM BEGIN(expdcol) AS yr,

            EXTRACT(MONTH FROM BEGIN(expdcol) AS mn,

            stock_id

     FROM (SELECT stock.*, expdcol

           FROM stock

           EXPAND ON begin_end_date AS expdcol BY INTERVAL '1'DAY) AS dt

     GROUP BY 2,3,4;

This statement returns the following rows.

wavg --------	yr ----	mn --	stock_id --------
141.935	2005	10	100
93.333	2005	11	100
20.000	2005	06	101
20.000	2005	07	101
19.355	2005	08	101

Example : EXPAND ON for a Moving Average

This example shows how to create a moving average, which is a common method of smoothing time series data, on the data in the price column.

First create the stk table.

     CREATE SET TABLE stk (

       stock_id  INTEGER,

       price     FLOAT,

       validity  PERIOD(TIMESTAMP))

     PRIMARY INDEX (stock_id);

The stk table contains the following set of 30 rows.

`stock_id` `--------`	`price` -----	`validity` --------
1000	10.00	2006-01-01 09:00:00 , 2006-01-01 12:00:00
1000	11.00	2006-01-01 12:00:00 , 2006-01-01 15:00:00
1000	9.00	2006-01-01 15:00:00 , 2006-01-01 18:00:00
1000	12.00	2006-01-02 09:00:00 , 2006-01-02 12:00:00
1000	13.00	2006-01-02 12:00:00 , 2006-01-02 15:00:00
1000	14.00	2006-01-02 15:00:00 , 2006-01-02 18:00:00
1000	8.00	2006-01-03 09:00:00 , 2006-01-03 12:00:00
1000	5.00	2006-01-03 12:00:00 , 2006-01-03 15:00:00
1000	15.00	2006-01-03 15:00:00 , 2006-01-03 18:00:00
1000	19.00	2006-01-04 09:00:00 , 2006-01-04 12:00:00
1000	16.00	2006-01-04 12:00:00 , 2006-01-04 15:00:00
1000	16.00	2006-01-04 15:00:00 , 2006-01-04 18:00:00
1000	16.00	2006-01-05 09:00:00 , 2006-01-05 12:00:00
1000	16.00	2006-01-05 12:00:00 , 2006-01-01 15:00:00
1000	16.00	2006-01-05 15:00:00 , 2006-01-05 18:00:00
1001	20.00	2006-01-01 09:00:00 , 2006-01-01 12:00:00
1001	21.00	2006-01-01 12:00:00 , 2006-01-01 15:00:00
1001	19.00	2006-01-01 15:00:00 , 2006-01-01 18:00:00
1001	22.00	2006-01-02 09:00:00 , 2006-01-02 12:00:00
1001	23.00	2006-01-02 12:00:00 , 2006-01-02 15:00:00
1001	24.00	2006-01-02 15:00:00 , 2006-01-02 18:00:00
1001	18.00	2006-01-03 09:00:00 , 2006-01-03 12:00:00
1001	15.00	2006-01-03 12:00:00 , 2006-01-03 15:00:00
1001	25.00	2006-01-03 15:00:00 , 2006-01-03 18:00:00
1001	29.00	2006-01-04 09:00:00 , 2006-01-04 12:00:00
1001	26.00	2006-01-04 12:00:00 , 2006-01-04 15:00:00
1001	26.00	2006-01-04 15:00:00 , 2006-01-04 18:00:00
1001	26.00	2006-01-05 09:00:00 , 2006-01-05 12:00:00
1001	24.00	2006-01-05 12:00:00 , 2006-01-01 15:00:00
1001	24.00	2006-01-05 15:00:00 , 2006-01-05 18:00:00

This example returns a moving average of stock over a three day period.

     SELECT stock_id, CAST (p AS DATE), AVG(price)

            OVER (PARTITION BY stock_id

                  ORDER BY p ROWS

                  2 PRECEDING)

     FROM (SELECT stock_id, price, BEGIN(p)

           FROM stk

           EXPAND ON validity AS p

                     BY ANCHOR DAY AT TIME ‘17:59:59’

                     FOR PERIOD(TIMESTAMP '2006-01-01 17:59:59',

                                TIMESTAMP '2006-01-05 18:00:00')) AS dt;

This statement returns the following 10 rows with a moving average over price.

stock_id --------	CAST(p AS DATE) ---------------	AVG(price) --------------------
1000	2006-01-01	9.00000000000000E 000
1000	2006-01-02	1.15000000000000E 001
1000	2006-01-03	1.26666666666667E 001
1000	2006-01-04	1.50000000000000E 001
1000	2006-01-05	1.56666666666667E 001
1001	2006-01-01	1.90000000000000E 001
1001	2006-01-02	2.15000000000000E 001
1001	2006-01-03	2.26666666666667E 001
1001	2006-01-04	2.50000000000000E 001
1001	2006-01-05	2.50000000000000E 001

You can produce the same result without using an EXPAND ON clause by joining with a calendar table, as in the following SELECT statement.

     SELECT stock_id, CAST(CAST(p AS TIMESTAMP) AS DATE),

            AVG(price) OVER (PARTITION BY stock_id

                             ORDER BY p ROWS

                             2 PRECEDING)

     FROM stk, (SELECT (calendar_date (FORMAT 'yyyy-mm-dd')) || ' ' ||

                FROM sys_callendar.calendar

                WHERE calendar_date BETWEEN DATE '2006-01-01'

                                    AND     DATE '2006-01-06') AS dt(p)

     WHERE BEGIN(validity) <= p

     AND   p < END(validity)) AS expnd;

Example : EXPAND ON for a WEEK_BEGIN Anchor Point

This example expands employee using a WEEK_BEGIN anchor point.

Assume the following table definition.

     CREATE SET TABLE employee, NO FALLBACK (

       eid       INTEGER,

       ename     CHARACTER(20) CHARACTER SET LATIN NOT CASESPECIFIC,

       jobperiod PERIOD(DATE))

     PRIMARY INDEX (eid);

Table employee contains the following single row.

employee

eid

---

ename

-----

jobperiod

---------

Xavier

2008-06-02,2008-06-24

Expand employee by WEEK_BEGIN.

     SELECT eid, ename, BEGIN(expd) AS tsp

     FROM employee

     EXPAND ON jobperiod expd BY ANCHOR WEEK_BEGIN;

In this example, each expanded row value starts on a Monday because the week starts on a Monday.

employee
eid ---	ename -----	tsp ---
1001	Xavier	2008-06-09
1001	Xavier	2008-06-16
1001	Xavier	2008-06-23

Example : EXPAND ON for a QUARTER_BEGIN Anchor Period

This example expands employee using a QUARTER_BEGIN anchor period.

     SELECT eid, ename, BEGIN(expd) AS tsp

     FROM employee

     EXPAND ON jobdperiod expd BY ANCHOR PERIOD QUARTER_BEGIN;

eid

---

ename

-----

tsp

---

Xavier

2008-04-01

Example : Join Before Expansion

This example shows how Teradata Database joins the tables specified in an EXPAND ON clause when the specified period expression specifies a column from a table that is not specified in the FROM clause (see step 5 of the EXPLAIN output, highlighted in boldface type).

     CREATE SET TABLE DR.t3, NO FALLBACK , NO BEFORE JOURNAL,

                             NO AFTER JOURNAL, CHECKSUM = DEFAULT (

       a  INTEGER,

       b  INTEGER,

       pd PERIOD(TIMESTAMP(6)))

     PRIMARY INDEX (a);

     CREATE SET TABLE DR.t4, NO FALLBACK, NO BEFORE JOURNAL,

                             NO AFTER JOURNAL, CHECKSUM = DEFAULT (

       x  INTEGER NOT NULL,

       y  INTEGER NOT NULL,

       pd PERIOD(DATE))

     PRIMARY INDEX (x);

     EXPLAIN SELECT expd

             FROM t4

             EXPAND ON t3.pd AS expd;

 *** Help information returned. 27 rows.

 *** Total elapsed time was 1 second.

Explanation

------------------------------------------------------------------------

  1) First, we lock a distinct DF2."pseudo table" for read on a

     RowHash to prevent global deadlock for DF2.t4.

  2) Next, we lock a distinct DF2."pseudo table" for read on a

     RowHash to prevent global deadlock for DF2.t3.

  3) We lock DF2.t4 for read, and we lock DF2.t3 for read.

  4) We do an all-AMPs RETRIEVE step from DF2.t4 by way of an

     all-rows scan with no residual conditions into Spool 3 (all_amps),

     which is duplicated on all AMPs.  The size of Spool 3 is estimated

     with low confidence to be 4 rows.  The estimated time for this

     step is 0.03 seconds.

  5) We do an all-AMPs JOIN step from DF2.t3 by way of an all-rows

     scan with no residual conditions, which is joined to Spool 3 (Last

     Use) by way of an all-rows scan. DF2.t3 and Spool 3 are joined

     using a product join, with a join condition of ("(1=1)").  The

     result goes into Spool 2 (all_amps), which is built locally on the

     AMPs.  The size of Spool 2 is estimated with low confidence to be

     4 rows (116 bytes).  The estimated time for this step is 0.05

     seconds.

  6) We do an all-AMPs RETRIEVE step from Spool 2 (Last Use) by way of

     an all-rows scan and EXPAND ON (DF2.t3.pd) into Spool 1

     (group_amps), which is built locally on the AMPs.  The size of

     Spool 1 is estimated with no confidence to be 120 rows (5,400

     bytes).

  7) Finally, we send out an END TRANSACTION step to all AMPs involved

     in processing the request.

  -> The contents of Spool 1 are sent back to the user as the result of

     statement 1.

Example : Nullified EXPAND Operation

This example shows how when an expanded column is not specified in the select list of a query, but a DISTINCT operator is specified, the EXPAND operation is nullified.

     CREATE SET TABLE df2.t1, NO FALLBACK, NO BEFORE JOURNAL,

                              NO AFTER JOURNAL, CHECKSUM = DEFAULT,

                              DEFAULT MERGEBLOCKRATIO (

       i  INTEGER,

       j  INTEGER,

       pd PERIOD(DATE) FORMAT 'yyyy-mm-dd')

     PRIMARY INDEX (i);

The first SELECT statement in this example does not specify the DISTINCT operator.

     EXPLAIN SELECT i,j

             FROM t1

             EXPAND ON pd AS expd BY INTERVAL '1' DAY;

 *** Help information returned. 17 rows.

 *** Total elapsed time was 1 second.

Explanation

------------------------------------------------------------------------

  1) First, we lock a distinct DF2."pseudo table" for read on a

     RowHash to prevent global deadlock for DF2.t1.

  2) Next, we lock DF2.t1 for read.

  3) We do an all-AMPs RETRIEVE step from DF2.t1 by way of an

     all-rows scan with no residual conditions into Spool 2 (all_amps),

     which is built locally on the AMPs.  The size of Spool 2 is

     estimated with low confidence to be 2 rows (74 bytes).  The

     estimated time for this step is 0.01 seconds.

  4) We do an all-AMPs RETRIEVE step from Spool 2 (Last Use) by way of

     an all-rows scan and EXPAND ON (DF2.t1.pd) into Spool 1

     (all_amps), which is built locally on the AMPs.  The size of Spool

     1 is estimated with no confidence to be 60 rows (2,580 bytes).

  5) Finally, we send out an END TRANSACTION step to all AMPs involved

     in processing the request.

  -> The contents of Spool 1 are sent back to the user as the result of

     statement 1.

The following SELECT statement does specify the DISTINCT operator. Note the difference in the EXPLAIN report, where you can see that this output is identical to the previous output with the exception that step 4 of the previous EXPLAIN output (highlighted in boldface type) is missing because the DISTINCT operator has nullified the effect of the EXPAND ON specification, so it is not performed.

     EXPLAIN SELECT DISTINCT i,j

             FROM t1

             EXPAND ON pd AS expd BY INTERVAL '1' day;

 *** Help information returned. 15 rows.

 *** Total elapsed time was 1 second.

Explanation

---------------------------------------------------------------------------

  1) First, we lock a distinct DF2."pseudo table" for read on a

     RowHash to prevent global deadlock for DF2.t1.

  2) Next, we lock DF2.t1 for read.

  3) We do an all-AMPs RETRIEVE step from DF2.t1 by way of an

     all-rows scan with no residual conditions into Spool 1 (all_amps),

     which is built locally on the AMPs.  Then we do a SORT to order

     Spool 1 by the sort key in spool field1 (DF2.t1.i, DF2.t1.j)

     eliminating duplicate rows.  The size of Spool 1 is estimated with

     low confidence to be 2 rows (102 bytes).  The estimated time for

     this step is 0.01 seconds.

  4) Finally, we send out an END TRANSACTION step to all AMPs involved

     in processing the request.

  -> The contents of Spool 1 are sent back to the user as the result of

     statement 1.  The total estimated time is 0.01 seconds.

Example : Null Expansion Period Producing a Null Expanded Value

This example shows how a null expansion period produces a null expanded value. In this example, the value for PERIOD(DATE) in column pd is null, so the expansion on pd, expd, is also null.

     CREATE SET TABLE DF2.t4, NO FALLBACK, NO BEFORE JOURNAL,

                              NO AFTER JOURNAL, CHECKSUM = DEFAULT (

       x  INTEGER NOT NULL,

       y  INTEGER NOT NULL,

       pd PERIOD(DATE))

     PRIMARY INDEX (x);

First show that column pd is null.

     SELECT *

     FROM t4;

      *** Query completed. One row found. 3 columns returned.

      *** Total elapsed time was 1 second.

      x            y           pd

     -----------  -----------  -----------------------------------------

              10           30  ?

Then show that the expansion on pd aliased as expd, is also null.

     SELECT x, expd

     FROM t4

     EXPAND ON pd AS expd;

      *** Query completed. One row found. 2 columns returned.

      *** Total elapsed time was 1 second.

               x  expd

     -----------  ------------------------------------------------

              10  ?

For More Information

For more information about the EXPAND ON clause, see Temporal Table Support.