Unweighted, Undirected Network (BUN) - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantage™

The local clustering coefficient was originally defined on an unweighted, undirected graph—also called a bi-directed network (BUN). A simple BUN has no self-loops and no multiple edges.

Let G = (V, E) be a simple BUN with a set of nodes (vertices) V and a set of edges E.

The degree di of node i is the number of nodes in V that are adjacent to i. A complete subgraph of three nodes of G is called a triangle. This is the formula for the number of triangles of node i:


Formula for the number of triangles of node i, used by Machine Learning Engine function LocalClusteringCoefficient

where aij = 1 if there is an edge from i to j; otherwise aij = 0.

A triple Ƴ at a node i is a path of length two for which i is the center node. This is the formula for the maximum number of triples of node i:


Formula for the maximum number of triples of node i, used by Machine Learning Engine function LocalClusteringCoefficient

The maximum number of triples occurs when every neighbor of node i is connected to every other neighbor of node i.

This is the formula for the clustering coefficient for a node i with di ≥ 2:

ci = δi / τi