Modularity Syntax Elements - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
8.10
1.1
Published
October 2019
Language
English (United States)
Last Update
2019-12-31
dita:mapPath
ima1540829771750.ditamap
dita:ditavalPath
jsj1481748799576.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢
CommunityEdgeTable
[Optional] Specify the name of the table that the function creates to output the weights of the edges between the communities at different resolution levels. If you omit this syntax element, the function does not create this table.
Tip: Teradata recommends using this element only if this information is required. Creating this table can greatly increase function run time.
TargetKey
Specify the key of the target vertex of an edge. The key consists of the names of one or more Edges table columns.
EdgeWeight
[Optional] Specify the name of the Edges table column that contains edge weights. Each edge_weight is a positive value.
Default behavior: The weight of each edge is 1 (that is, the graph is unweighted).

This syntax element determines how the function treats duplicate edges (that is, edges with the same source and destination, which might have different weights). For a weighted graph, the function treats duplicate edges as a single edge whose weight is the sum of the weights of the duplicate edges. For an unweighted graph, the function uses only one of the duplicate edges.

CommunityAssociation
[Optional] Specify the name of the column that represents the community association of the vertices. Use this syntax element if you know some vertex communities.
Resolution
[Optional] Specify hierarchical-level information for the communities. For details, see Resolution. If you specify a list of resolution values, the function incrementally finds the communities for each value and for the default value.
Each resolution must be a distinct DOUBLE PRECISION value in the range [0.0, 1000000.0]. The value 0.0 puts each node in its own community of size 1. You can specify a maximum of 500 resolution values.
Default: 1.0
Seed
[Optional] Specify the random seed the algorithm uses for repeatable results. The seed must be a LONG value greater than or equal to 1. The algorithm multiplies seed by the hash code of vertex_key to create a unique seed for each vertex.
For repeatable results, use both the Seed and UniqueID syntax elements. For more information, see Nondeterministic Results and UniqueID Syntax Element.
The function also uses seed for these purposes:
  • To break ties between different vertices during community formation
  • To determine when to stop community formation

    If seed is 1 (default value), community formation continues until the change in modularity score drops below a threshold value. The threshold value is set in the function code; you cannot change it.

    If seed is set to any other value, community formation continues until the change in modularity score between iterations is 0. This deeper analysis of the graph can improve community formation, but can also increase execution time.

Default: 1
Accumulate
[Optional] Specify the names of the Vertices columns to copy to the community Vertices table.
Default behavior: The function copies the vertex_key columns to the output Vertices table for each vertex, changing the column names to id, id_1, id_2, and so on.