From Gradient to Graph

I. Finding the Shape of a Field

In physics — and in data analysis — the fundamental question is: what shape does a field assume when we want to minimise its energy? We are not looking for a single point (as in gradient descent); we are looking for an entire function. This is the problem of the calculus of variations.

Define the informational energy functional of a field ψ(x):

E[ψ] = \int ( α\cdot|\nablaψ(x)|² + V(x)\cdot|ψ(x)|² ) dⁿx

The two terms have clean interpretations. The kinetic term $α|\nablaψ|²$ is regularisation: a penalty for abrupt changes in the field between neighbouring data points. The coefficient α (in physics ℏ²/2m, in QIFT controlled by ℏ_ZC) sets the smoothness scale. The potential term $V(x)|ψ|²$ is the data landscape: where V(x) is low (dense clusters of records), the field ψ is attracted.

Derivation in four steps

1 Variation. Add a perturbation δψ* and compute the change in energy:

δE = \int ( α\cdot\nablaψ \cdot \nabla(δψ*) + V(x)\cdotψ\cdotδψ* ) dⁿx

2 Integration by parts (Green’s identity). Transfer the gradient from δψ* to ψ:

\int \nablaψ \cdot \nabla(δψ*) dⁿx = -\int (\nabla²ψ) δψ* dⁿx (boundary terms vanish because the field decays at infinity)

3 Functional derivative. Everything multiplying δψ* under the integral is our derivative:

δE/δψ* = -α\cdot\nabla²ψ(x) + V(x)\cdotψ(x)

4 Stationary states. We seek an extremum subject to the normalisation constraint ∫|ψ|² = 1. The Lagrange multiplier λ yields an eigenvalue equation:

( -α\cdot\nabla² + V(x) ) ψ(x) = E\cdotψ(x)

⟨ Key observation ⟩

This is the Schrödinger equation — but its meaning here is informational, not physical. We are looking for the eigenfunctions of a data operator. Each ψₙ is a “vibrational mode” of the dataset: ψ₀ captures the broadest trend, and higher modes pick up increasingly fine-grained correlations.

But here a crucial question arises: the derivation above assumes a continuous field ψ(x) living on a smooth manifold. We never start with that. We start with a data table — rows of records, columns of features, nothing more. The field and its geometry are objects we must construct, not given. The next section shows how.

∇² → L

II. From Table to Operator: Constructing the Graph Laplacian

The starting point of any data analysis is a table: N rows (records) and d columns (features). There is no field, no graph, no manifold in sight. These are constructions we impose on the data to reveal its structure.

The first construction is a similarity graph. Given a table X with rows x₁, …, xₙ, we define pairwise similarities — for instance, via a Gaussian kernel Aᵢⱼ = exp(−||xᵢ − xⱼ||² / 2σ²) — and obtain a weighted adjacency matrix A. The graph is not in the data; it is a lens through which we choose to view the data. Different kernels, different σ, different graphs — different structure revealed.

The second construction is the Graph Laplacian. With adjacency A and degree matrix D = diag(k₁, …, kₙ):

L = D - A

This operator acts on functions f : V → ℝ defined on the nodes of the constructed graph in exactly the same way that −∇² acts on continuous functions:

(Lf)(i) = Σⱼ Aᵢⱼ \cdot (f(i) - f(j)) = kᵢ\cdotf(i) - Σⱼ Aᵢⱼ\cdotf(j)

It measures how much the value at node i differs from its neighbours — the discrete second derivative. The variational equation from Part I becomes a matrix eigenvalue problem:

L ψₙ = λₙ ψₙ

The eigenvectors ψₙ are the harmonic modes of the constructed graph. ψ₀ (λ₀ = 0) is constant — the mean field. ψ₁ (the Fiedler vector) optimally bisects the graph into two communities. Higher modes detect progressively finer structure.

Figure 1. Three lowest variational modes of the Graph Laplacian on the Zachary Karate Club network. Node colour = eigenmode amplitude. ψ₀ is constant (ground state). ψ₁ (Fiedler vector) optimally bisects the network into the two clusters that historically split apart. ψ₂ detects sub-communities.

⟨ On the ontological status of the graph ⟩

It is tempting to say “we have a graph.” We do not. We have a table, and we constructed a graph from it. The Laplacian L inherits every modelling choice we made (kernel type, bandwidth, k-nearest-neighbours threshold). This is not a weakness — it is exactly the degree of freedom that QIFT exploits: different constructions correspond to different encodings of the data into the information field, and the choice of encoding determines which quantum features become accessible.

L = D − γ⁽¹⁾

III. Spectral Duality: Laplacian ↔ 1-RDM

Now QIFT enters. In Quantum Information Field Theory, the adjacency matrix A — constructed from the data table — is promoted to the 1-Body Reduced Density Matrix (1-RDM) of the network information field. The 1-RDM is defined as the expectation value of the bosonic field operators: γᵢⱼ = ⟨a†ᵢ aⱼ⟩. This gives it two distinct components:

γᵢⱼ = Aᵢⱼ for i \neq j (hopping amplitude: inter-node correlation) γᵢᵢ = kᵢ on the diagonal (occupation number: node degree)

Off-diagonal elements are amplitudes of virtual excitations: quanta of correlation propagating between nodes. Diagonal elements are occupation numbers — how many excitations “rest” at each node. The graph we constructed from the data table is now reinterpreted as a field configuration, and degree is the expectation value of a number operator.

The key identity follows. The normalised Laplacian and the hopping part of the normalised 1-RDM (i.e. the off-diagonal block, which is the adjacency) share exactly the same eigenvectors, and their eigenvalues are related by a simple transformation:

γ_norm = D⁻½ A D⁻½ (normalised hopping amplitudes) L_norm = I - γ_norm λ(L_norm) = 1 - λ(γ_norm) \leftarrow SPECTRAL IDENTITY

⟨ Numerical verification ⟩

We test this identity on six graph types: Karate Club, Erdős–Rényi, Barabási–Albert, star, cycle, and complete graph. In every case: λ(L_norm) = 1 − λ(γ_norm) holds EXACTLY (to machine precision ~10⁻¹⁰).

Figure 2. Spectral duality: eigenvalues of the normalised Laplacian (red dots) vs. 1 − λ(γ_norm) (blue circles). Perfect overlap on all six graphs confirms that L and γ encode THE SAME structure in complementary bases.

This is not an approximation — it follows from the definition. But the consequence is deep: everything known from spectral graph theory (spectral clustering, diffusion maps, Laplacian eigenmaps) translates directly into the language of the 1-RDM. And the 1-RDM gives more — it carries quantum diagnostics that the Laplacian alone cannot provide.

C_od , {nₖ} → FP

IV. The Friendship Paradox as a Field Observable

In 1991 Scott Feld observed something counterintuitive: most people have fewer friends than their friends do. The classical explanation is straightforward — it is a degree-weighted sampling bias:

E_{edge}[kⱼ] − k̄ = Var(k) / k̄ (paradox excess)

But this explanation uses only the degree distribution — a one-dimensional shadow of a high-dimensional correlation structure. QIFT provides a deeper explanation through the spectral structure of the 1-RDM.

Off-diagonal coherence

Define the off-diagonal coherence of the 1-RDM as the total squared magnitude of all inter-node amplitudes:

C_od(γ) = Σᵢ\neqⱼ |γᵢⱼ|²

This quantity measures the total correlation power of the network field. It vanishes if and only if γ is diagonal — that is, if and only if nodes are informationally independent (a product state). When C_od is large, the field carries strong inter-node coherence.

However, C_od alone does not determine the paradox. For unweighted simple graphs, C_od = 2|E| = N·k̄, which depends only on edge count, not on the distribution of degrees. The friendship paradox is controlled by something finer: the spectral decomposition of the 1-RDM.

Spectral decomposition of the paradox

Decompose the 1-RDM into its eigenmodes: γ = Σₖ nₖ |φₖ⟩⟨φₖ|. Each eigenvalue nₖ is the occupation number of mode k, and each eigenvector φₖ is a collective oscillation pattern. Define the overlap of mode k with the uniform vector: sₖ = ⟨1|φₖ⟩ = Σᵢ φₖ(i)/√N. Then the variance of the degree sequence decomposes spectrally:

Var(k) = (1/N) Σₖ nₖ² sₖ² - k̄²

Each eigenmode contributes to the paradox in proportion to its squared occupation number nₖ², weighted by how strongly it couples to the uniform distribution. The paradox is large when a single mode dominates — when the network field undergoes bosonic condensation.

Condensate fraction and phase transition

Define the condensate fraction as the ratio of the dominant 1-RDM eigenvalue to the sum of all eigenvalues:

f₁ = n₁ / Σₖ |nₖ|

When f₁ is large, the network is dominated by a single mode — a “bosonic condensate” concentrated on the hubs. When f₁ → 0, the field is dispersed (homogeneous network, no paradox).

Figure 3. Dominant 1-RDM eigenmode on six graphs. Colour = amplitude of ψ₀. Star and complete graphs have f₁ = 0.5 (maximal condensation). Cycle: f₁ ≈ 0.05 (no condensation, FP = 0). Barabási–Albert: intermediate, with clear hub concentration.

Figure 4. (A) Friendship Paradox excess vs. condensate fraction f₁ across 90 random graphs of three types. Barabási–Albert (red): high FP, high condensation. Erdős–Rényi (blue): low FP, low condensation. Watts–Strogatz (green): intermediate. (B) Spectral decomposition of Var(k) for the Karate Club: the dominant eigenmode (red) accounts for most of the variance.

Graph	k̄	Var(k)	FP excess	n₁	f₁	η_G
Karate Club	13.59	136.18	10.02	21.69	0.142	0.980
Erdős–Rényi	4.76	2.83	0.59	5.52	0.090	0.992
Barabási–Albert	3.76	12.89	3.42	5.82	0.129	0.983
Star	1.94	29.23	15.06	5.74	0.500	0.750
Cycle	2.00	0.00	0.00	2.00	0.046	0.998
Complete	33.00	0.00	0.00	33.00	0.500	0.750

⟨ Cycle vs. complete graph — a subtlety ⟩

Both have Var(k) = 0 and FP = 0, but for diametrically opposite reasons. The cycle has f₁ ≈ 0.05 (dispersed spectrum, no condensation). The complete graph has f₁ = 0.5 (maximal condensation), but Var(k) = 0 because all degrees are identical. Classical statistics cannot distinguish these cases. The 1-RDM can — and the informational hardness η_G reflects this: 0.998 vs. 0.750.

V. Closing the Loop

We have traversed the full circle:

Calculus of variations → Eigenvalue equation E[ψ] → δE/δψ* = 0 (−∇² + V)ψ = Eψ Table → Construction → Graph Laplacian data table → Aᵢⱼ L = D − A, L ψₙ = λₙ ψₙ Spectral duality → 1-RDM as field L = D − γ⁽¹⁾ λ(L) = 1 − λ(γ) Diagnostics → Friendship Paradox C_od(γ), {nₖ} → Var(k) f₁ → bosonic condensation

The calculus of variations says: “find the smoothest function on the graph that is compatible with the data.” The solutions are the Laplacian eigenmodes — and simultaneously the eigenmodes of the 1-RDM of the information field. The Friendship Paradox is not a combinatorial curiosity — it is a spectral observable of the network field, driven by condensation of bosonic excitations into the dominant eigenmode.

But this is only the beginning. The Graph Laplacian is a commuting operator. When we replace it with the non-commutative Hamiltonian of QIFT — with the full CCR structure and operators Q̂, P̂ — we enter a regime where eigenmodes become quantum in the sense of Haag: unitarily inequivalent to classical ones, capable of interference inaccessible to any Graph Laplacian.

And it all begins with a table.

From Gradient to GraphVariational Calculus, the Laplacian, and the Friendship Paradox