Sequential RC-TGAN: Generating Relational Time Series with Spectral Envelope Loss
^†^†thanks: This work was supported by Mitacs through the Mitacs Accelerate program.

Mohamed Gueye^1,2, Yazid Attabi¹, Manuel Morales², and Maxime Dumas¹

Abstract

The generation of synthetic relational databases often involves modeling complex temporal dynamics, such as transaction logs or event sequences. A significant challenge in this domain is the handling of categorical time series (e.g., status codes), where standard encoding methods like one-hot encoding fail to capture intrinsic frequency-domain features such as seasonality and cyclicity. In this paper, we introduce Sequential RC-TGAN (Seq. RC-TGAN), a temporal extension of the RC-TGAN framework, equipped with a novel integrated loss function based on the Spectral Envelope Theory. This differentiable loss allows the generator to directly optimize the preservation of latent periodic structures via backpropagation. While spectral envelope theory is inherently designed for categorical sequences, we extend this frequency-domain regularization to continuous time series by employing a Variational Gaussian Mixture Model (VGM) discretization strategy. To establish a mathematically rigorous evaluation standard, we simulate categorical time series governed by a parameter $\alpha$ , with exactly known theoretical spectral envelopes. Integrating these dynamic sequences into the child tables of a relational database yields a robust ground-truth benchmark for evaluating the frequency-domain fidelity of our generative framework. Furthermore, we address the lack of robust evaluation standards for relational time series by proposing two new metrics: Spectral Density Divergence and Spectral Envelope Divergence. Experimental results on real-world datasets, as well as our simulated benchmarks, demonstrate that our end-to-end approach significantly outperforms state-of-the-art systems in reproducing cyclic patterns and long-term seasonality across both categorical and continuous features.

I Introduction

Synthetic data generation has rapidly evolved from a niche privacy-preserving technique into a foundational pillar of modern machine learning, addressing critical bottlenecks related to data scarcity and algorithmic fairness while circumventing stringent privacy regulations. Early generative paradigms primarily focused on static, single-table tabular data [21, 5, 15]. Architectures such as TabGPT [10] and Tabular Transformer GAN (TT-GAN) [3] adapted NLP techniques to generate tabular rows via autoregressive next-token prediction.

In practice, contemporary enterprise data is predominantly structured within complex Relational Databases (RDBs), consisting of interconnected networks of tables governed by strict primary key (PK) and foreign key (FK) constraints. Consequently, multi-table generation models have emerged to address this structural complexity. Early approaches to relational data generation relied on statistical baselines, such as the Synthetic Data Vault (SDV) [12], which utilized hierarchical Gaussian copulas to model cross-table distributions. The transition to deep learning in this domain was pioneered by the Row Conditional Tabular GAN (RC-TGAN) [1], which leveraged Generative Adversarial Networks (GANs) to explicitly maintain referential integrity between parent and child tables. More recently, diverse architectures have been introduced, including transformer-based sequence-to-sequence models like REaLTabFormer [17], standard tabular diffusion models like ClavaDDPM [11], and graph-based diffusion frameworks such as RelDiff [2]. Despite their structural sophistication, these relational models are fundamentally static. They treat data as fixed snapshots and obliterate the complex longitudinal dynamics of multivariate time series (e.g., financial transaction logs) embedded within these schemas.

Synthesizing dynamic temporal sequences that are structurally embedded within a relational database requires conditioning child time series trajectories on the static parent table. Models such as TimeGAN [22] and DoppelGANger [7] pioneered this domain. However, these methods operate strictly in the time domain and are not focused on categorical time series, which are ubiquitous in real-world relational tables. Because standard representations like one-hot encoding map categories to orthogonal, equidistant vectors (where the Euclidean distance is always $\sqrt{2}$ ), the neural network becomes completely blind to ordinal, hierarchical, or periodic relationships, preventing the generator from understanding the cyclical nature of categorical states. Apprehending these complex cyclical patterns in the time domain is inherently difficult, often leading to models missing crucial structural amplitudes. Consequently, enriching this sequence analysis with the frequency domain provides a significantly better approach, allowing the model to explicitly uncover and optimize the latent periodic structures underlying the discrete categories.

To overcome the intersecting limitations of time-domain optimization and categorical data modeling, we propose a profound paradigm shift by directly integrating Spectral Envelope Theory [18, 19, 16] into a relational generative architecture. We introduce Sequential RC-TGAN (Seq. RC-TGAN), equipped with a novel, differentiable spectral envelope loss that explicitly exploits the frequency domain to optimize the generator’s ability to model the complex pattern of categorical time series. The principle of spectral envelope in this context is to find an optimal scalar transformation that maximizes the spectral density of categorical time series; through this principle, we successfully translate discrete categories into continuous numerical representations. While recent literature has also pivoted toward frequency-domain regularization with architectures such as the Frequency-Markov Diffusion GAN (FMD-GAN) [8], FDEDiff [23], and TIFO [13] introducing highly innovative frequency-aware denoising, these methods are primarily designed for single-table generation and continuous time series, processing categorical time series using standard one-hot encoding representations. Consequently, they fail to resolve the challenge of categorical periodicity, as they cannot natively assign spectral meaning to orthogonal vectors without extensive feature engineering.

Our main contributions are as follows:

•

We integrate spectral envelope theory into a conditional sequential GAN framework by introducing a novel spectral loss term ( $\mathcal{L}_{spec}$ ). This loss explicitly minimizes the distance between the spectral envelopes of real and synthetic data, overcoming the orthogonality of one-hot encodings to preserve latent periodic structures in categorical time series.
•

We extend this spectral methodology to continuous numerical features by employing beforehand a discretization strategy based on Gaussian Mixture Models (GMM) [21], allowing the spectral envelope to capture and enforce frequency-domain features across mixed data types simultaneously.
•

We analytically derive the exact theoretical spectral envelope for Markov chains governed by circulant transition matrices. This provides a mathematically tractable and rigorous "gold standard" benchmark simulated dataset to evaluate the frequency-domain fidelity of sequential generative models without relying on empirical periodograms.
•

We propose a new set of evaluation metrics rooted in spectral analysis : Spectral Density Divergence ( $\overline{\mathcal{D}}_{spec}$ ) and Spectral Envelope Divergence ( $\overline{\mathcal{D}}_{env}$ ). These metrics are designed to rigorously assess the temporal fidelity and cyclic consistency of generated continuous and categorical time series, addressing the blindspots of traditional time-domain metrics.

The remainder of this paper is organized as follows. Section II provides the necessary background on spectral analysis. Section III formulates spectral envelopes within a metric space. Section IV introduces the proposed GAN framework for multi-table time series synthesis. Section V outlines the design of simulated data for our experiments. Section VI defines the new evaluation metrics based on spectral analysis. Finally, Section VII presents the experimental setup and results, followed by concluding remarks.

II Background on Spectral Analysis

II-A Spectral Density

Let $\{S_{t},t\in\mathbb{Z}\}$ be a weakly stationary process with value on $\mathbb{R}$ , with mean $\mu\in\mathbb{R}$ and autocovariance function $\gamma(h)=\text{cov}(S_{t+h},S_{t})$ . The spectral density $f(\omega)$ describes how the variance of the process is distributed across frequencies $\omega\in[-1/2,1/2]$ . The spectral density is defined as the Fourier transform of the autocovariance:

f(\omega)=\sum_{h=-\infty}^{\infty}\gamma(h)e^{-2\pi i\omega h}.

(1)

Inversely : $\gamma(h)=\int_{-1/2}^{1/2}{f(\omega)e^{2\pi i\omega h}d\omega}.$

In practice, for a finite time series $\{s_{1},\dots,s_{T}\}$ , the spectral density is estimated using the periodogram $I(\omega_{k})$ , calculated at Fourier frequencies $\omega_{k}=k/T$ :

I(\omega_{k})=\left|d(\omega_{k})\right|^{2}=\frac{1}{T}\left|\sum_{t=1}^{T}s_{t}e^{-2\pi i\omega_{k}t}\right|^{2},

where $d(\omega_{k})=\frac{1}{\sqrt{T}}\sum_{t=1}^{\prime}s_{t}e^{-2\pi i\omega_{k}t}$ , the Discrete Fourier Transform (DFT).

The concept of spectral density can be extended to the multivariate case. Let $\{\textbf{S}_{t}\}_{t\in\mathbb{Z}}$ be a weakly stationary process in $\mathbb{R}^{q}$ with mean $\mu$ and autocovariance matrix $\Gamma(h)=\mathbb{E}[(\textbf{S}_{t+h}-\mu)(\textbf{S}_{t}-\mu)^{\prime}]$ . The $(j,p)$ -th entry of this matrix is the cross-covariance function $\gamma_{jp}(h)=\text{cov}(\textbf{S}_{j,t+h},\textbf{S}_{p,t})$ , which measures the covariance between component $j$ at time $t+h$ and component $p$ at time $t$ .
The spectral density matrix $\textbf{f}(\omega)\in\mathbb{C}^{q\times q}$ is defined as the Fourier transform of $\Gamma(h)$ :

\textbf{f}(\omega)=\sum_{h=-\infty}^{\infty}\Gamma(h)e^{-2\pi i\omega h},\quad\omega\in[-1/2,1/2].

The diagonal elements $\textbf{f}_{jj}(\omega)$ represent the univariate spectral densities, while the off-diagonal elements $\textbf{f}_{jp}(\omega)$ denote the cross-spectral densities.
For a finite observation $\{\textbf{s}_{1},\dots,\textbf{s}_{T}\}$ , the spectral density is estimated using the periodogram. Let $\textbf{d}(\omega_{k})\in\mathbb{C}^{q}$ be the Discrete Fourier Transform (DFT) at frequency $\omega_{k}=k/T$ :

\textbf{d}(\omega_{k})=\frac{1}{\sqrt{T}}\sum_{t=1}^{T}\textbf{s}_{t}e^{-2\pi i\omega_{k}t}.

The multivariate periodogram matrix $\textbf{I}(\omega_{k})$ is defined as the outer product:

\textbf{I}(\omega_{k})=\textbf{d}(\omega_{k})\textbf{d}(\omega_{k})^{*},

(2)

where ^∗ denotes the conjugate transpose. While $\textbf{I}(\omega_{k})$ is an asymptotically unbiased estimator of $\textbf{f}(\omega_{k})$ , it is not consistent; its variance does not vanish as $T\to\infty$ . Consequently, consistent estimation requires smoothing techniques, such as windowing or averaging over frequency bands.

II-B Spectral Envelope for Categorical Time Series

Consider a categorical time series $X_{t}$ taking values in a finite set $\textbf{a}=\{a_{0},\dots,a_{K-1}\}$ that is stationary. Because standard frequency-domain tools cannot be directly applied to discrete qualitative symbols, we assign a vector of numerical scaling values $\beta=(\beta_{0},\dots,\beta_{K-1})^{\prime}\in\mathbb{R}^{K}$ to the categories in a. This transformation results in a real-valued numerical process, denoted $X_{t}(\beta)\in\mathbb{R}$ , where $X_{t}(\beta)=\beta_{k}$ whenever the original series is in state $X_{t}=a_{k}$ . By explicitly mapping the qualitative categories to quantitative scalars, we convert the discrete sequence into a standard univariate continuous-state time series. This mathematical conversion is a strict prerequisite, as it enables the calculation of autocovariance functions and the subsequent computation of the spectral density via the Fourier transform.

Instead of assigning arbitrary numbers to categories, the spectral envelope framework systematically derives optimal numerical values that expose hidden periodicities within a categorical time series. The primary objective is to find a scaling vector $\beta$ that maximizes the spectral density relative to the total variance at each specific frequency $\omega$ . Formally, the Spectral Envelope $\lambda(\omega)$ is defined as:

\lambda(\omega)=\sup_{\beta\not\propto\mathbf{1}}\left\{\frac{f(\omega;\beta)}{\sigma^{2}(\beta)}\right\},\quad\forall\omega\in[-1/2,1/2],

(3)

where $f(\omega;\beta)$ and $\sigma^{2}(\beta)$ represent the spectral density and the variance of the transformed numerical process $X_{t}(\beta)$ , respectively [18]. The condition $\beta\not\propto\mathbf{1}$ explicitly excludes trivial scalings where every category is assigned the exact same numerical value. If $\beta$ is proportional to a vector of all ones ( $\beta\propto\mathbf{1}$ ), the transformed sequence $X_{t}(\beta)$ would merely become a flat, constant series. This would result in a variance of zero ( $\sigma^{2}(\beta)=0$ ), thereby rendering the objective ratio undefined.

This optimization problem can be solved by representing the categorical process as a multivariate point process $Y_{t}\in\mathbb{R}^{K}$ (using one-hot vectors). Let $f_{Y}(\omega)$ be the spectral density matrix and $V_{Y}$ be the variance matrix of stationary process $Y_{t}$ . The optimization problem in (3) can be re-written :

\lambda(\omega)=\sup_{\beta\not\propto\mathbf{1}}\left\{\frac{\beta^{\prime}f_{Y}(\omega)\beta}{\beta^{\prime}V_{Y}\beta}\right\},\quad\forall\omega\in[-1/2,1/2].

(4)

This expression is a generalized Rayleigh quotient. The solution $\lambda(\omega)$ is the largest eigenvalue of $f_{Y}(\omega)$ in the metric of $V_{Y}$ . The corresponding eigenvector $\beta(\omega)$ is called the optimal scaling at frequency $\omega$ .
The value $\lambda(\omega)$ is called the spectral envelope because it envelopes the normalized spectrum of any scaled process $X_{t}(\beta)$ . In other words, for any normalized scaling $\beta$ (such that $\sigma^{2}(\beta)=1$ ), we have $f(\omega;\beta)\leq\lambda(\omega)$ , with equality achieved if and only if $\beta$ is proportional to the optimal scaling $\beta(\omega)$ .

While the spectral envelope provides a robust mechanism for uncovering the latent periodicities of a single categorical time series, leveraging this concept within a deep generative framework requires systematically comparing the structural properties of real and synthesized processes. To formulate a differentiable objective that minimizes the frequency-domain discrepancy between these temporal dynamics, we cannot merely view $\lambda(\omega)$ as a collection of point-wise maxima. Instead, we must formalize the spectral envelope as a distinct mathematical object residing within a well-defined functional space. This theoretical shift naturally motivates the construction of a metric space for spectral envelopes, providing the foundational distance metrics required to optimize our generative model via backpropagation.

III A Metric Space Formulation for Spectral Envelopes

Consider a stationary categorical process $X_{t}$ taking values in the finite set a with spectral envelope $\lambda(\omega)$ . Let $X^{(\theta)}_{t}$ be a parametric stationary categorical process (e.g., a synthetic process generated by a model) with values in a, parameters $\theta$ , and spectral envelope $\lambda^{(\theta)}(\omega)$ .

In a generative context, the goal is to ensure that the synthetic process $X^{(\theta)}_{t}$ approximates the real process $X_{t}$ . A fundamental question arises: how can we quantify the discrepancy between these processes in the frequency domain? By defining a metric distance between $\lambda(\omega)$ and $\lambda^{(\theta)}(\omega)$ , we can formulate an optimization problem where minimizing this distance with respect to $\theta$ forces the synthetic process to recover the latent periodic structures of the real data. We first formalize the space in which these spectral envelopes reside.

Definition 1.

Let $\mathcal{S}_{K}$ be the set of spectral envelopes corresponding to stationary categorical processes with $K$ categories that possess a continuous spectral density matrix associated with their one-hot encoding representation (i.e. $f_{Y}(\omega)$ ).

Lemma 1.

Every element $\lambda\in\mathcal{S}_{K}$ is a continuous function on the interval $[-1/2,1/2]$ .
The proof of this lemma is in the appendix.

Consequently, $\mathcal{S}_{K}$ is a subset of $C^{0}\left([-1/2,1/2]\right)$ , the space of continuous functions on the fundamental frequency domain. This inclusion implies that $\mathcal{S}_{K}$ resides within the Hilbert space $L^{2}\left([-1/2,1/2]\right)$ .

The Hilbert space $L^{2}\left([-1/2,1/2]\right)$ consists of square-integrable functions defined on $[-1/2,1/2]$ equipped with the inner product:

\langle h,g\rangle=\int_{-1/2}^{1/2}{h(\omega)g(\omega)d\omega},\quad\forall h,g\in L^{2}.

This induces the $L^{2}$ norm, representing the total energy of the function:

\|h\|_{2}=\sqrt{\langle h,h\rangle}=\sqrt{\int_{-1/2}^{1/2}{h(\omega)^{2}d\omega}}.

From this functional space definition, we derive metrics to measure the distance between the real spectral envelope $\lambda$ and the synthetic spectral envelope $\lambda^{(\theta)}$ :

\|\lambda-\lambda^{(\theta)}\|_{2}=\sqrt{\int_{-1/2}^{1/2}{\left(\lambda(\omega)-\lambda^{(\theta)}(\omega)\right)^{2}d\omega}}

(5)

The $L^{2}$ distance in (5) aggregates the error over the entire frequency domain. It is differentiable (assuming $\lambda^{(\theta)}$ is differentiable with respect to $\theta$ ) and provides non-zero gradients for deviations across all frequencies simultaneously. This "smoothness" makes the $L^{2}$ metric significantly tractable as a loss function for backpropagation in deep neural networks. Therefore, we adopt the square of the $L^{2}$ distance as our objective function to minimize the divergence between the real and synthetic spectral envelopes.

Lemma 2.

For all $\lambda\in\mathcal{S}_{K}$ , the following norm properties hold:

(i)

$1\leq\|\lambda\|_{1}\leq K-1$ .
(ii)

$1\leq\|\lambda\|_{2}<\infty$ .

The proof of this lemma is in the appendix.

The $L^{1}$ upper bound ( $\|\lambda\|_{1}\leq K-1$ ) reflects the dimensionality constraint of a categorical variable with $K$ states, where the rank of the associated variance-covariance matrix is at most $K-1$ . Regarding the lower bound $\|\lambda\|_{2}\geq 1$ , this property constitutes a fundamental energy constraint for any non-trivial stationary process. Because the spectral density $f(\omega)$ decomposes the total variance of the process across frequencies, the integral of $f(\omega)$ must equal the variance $\gamma(0)$ . Given that $\lambda(\omega)$ is defined as the supremum that envelopes the normalized spectrum of any scaled process, its integral (the $L^{1}$ norm) cannot be less than the variance of a standardized process ( $\sigma^{2}=1$ ). By the relationship between norms on a compact domain of length 1, we have $\|\lambda\|_{2}\geq\|\lambda\|_{1}\geq 1$ . This lower bound represents the "white noise" baseline where the spectral mass is uniformly distributed. In the context of deep learning, this ensures the loss function is anchored; the generator cannot minimize the spectral distance by simply reducing the synthetic process to a trivial or zero-variance state, as it must maintain the minimum spectral energy inherent to a categorical distribution.

IV Conditional GAN for Multi-table Time Series Synthesis

IV-A Formalization and Notation

Our formulation is grounded in the Probabilistic Relational Model (PRM) framework [4]. We consider a relational schema $\mathcal{S}=\{W,U\}$ containing two classes (tables) $W$ and $U$ , where $W$ acts as the parent entity and $U$ as the child entity.

Let $\mathcal{A}(U)$ denote the set of attributes for table $U$ . This set is partitioned into continuous attributes $\mathcal{A}_{cont}(U)=\{c_{1},\dots,c_{I}\}$ and categorical attributes $\mathcal{A}_{cat}(U)=\{d_{1},\dots,d_{J}\}$ . The attribute space (or domain) for $U$ is defined as the Cartesian product of the domains of its individual attributes: $\mathcal{V}(U)=\bigotimes_{A\in\mathcal{A}(U)}\mathcal{V}(A)$ . Similarly, we define $\mathcal{V}(W)$ as the attribute space for the parent table $W$ .

In this relational structure, specific dependencies exist between instances of $W$ and $U$ . Let $w\in W$ denote a specific row (instance) in the parent table, with feature values $w.\mathcal{A}\in\mathcal{V}(W)$ . We define $\text{Children}(w)\subset U$ as the set of child rows in table $U$ that reference the parent $w$ .

In the context of time series synthesis, the set $\text{Children}(w)$ is not merely a bag of rows but an ordered sequence associated with the parent entity. We denote this sequence as $\text{Children}(w).\mathcal{A}=(u_{1},\dots,u_{T})$ , where each $u_{t}\in\mathcal{V}(U)$ represents the state of the child entity at time step $t$ , and $T$ is the sequence length. Thus, the dataset consists of tuples $\left(w.\mathcal{A},\text{Children}(w).\mathcal{A}\right)$ , pairing static parent features with dynamic child sequences.

IV-B Sequential RC-TGAN Architecture

To address the challenge of generating relational time series, we introduce the Sequential RC-TGAN (see Fig. 1), an extension of the Row Conditional-TGAN (RC-TGAN) [1] model enhanced by the temporal dimension modeling. The original RC-TGAN primarily focused on modeling inter-table relationships, employing a generator $\mathcal{G}$ to model the conditional distribution of a single child row given its parent: $\mathbb{P}(u|w.\mathcal{A})$ .

The Sequential RC-TGAN adapts this paradigm to support inter-row relationships modeling inside a tabular data. Rather than mapping parent feature values $w.\mathcal{A}$ and a noise vector $z$ to a static point in the feature space $\mathcal{V}(U)$ , our generator learns to map them to a temporal trajectory within $\mathcal{V}(U)^{T}$ . Formally, the model approximates the conditional joint distribution of the child sequence given the parent attributes:

\mathbb{P}(\text{Children}(w).\mathcal{A}|w.\mathcal{A})=\mathbb{P}(u_{1},\dots,u_{T}|w.\mathcal{A}).

(6)

This formulation ensures that the generation process is explicitly conditioned on the static characteristics of the parent entity, thereby guaranteeing that the synthesized temporal dynamics remain consistent with their relational context.

IV-B1 Conditional Recurrent Generator

To capture temporal dependencies effectively, we replace the fully connected layers of the original RC-TGAN with a Recurrent Neural Network (RNN) generator [9].

The generation process is conditioned on the static parent attributes $w.\mathcal{A}$ at every time step, ensuring the generated sequence adheres to the specific constraints of the parent entity. At each time step $t$ , the generator receives a concatenated input consisting of a random noise vector $z_{t}\sim\mathcal{N}(0,I)$ and the parent vector $w.\mathcal{A}$ :

	$\displaystyle h_{t}$	$\displaystyle=\text{RNN}(h_{t-1},[z_{t}\oplus w.\mathcal{A}])$		(7)
	$\displaystyle\hat{u}_{t}$	$\displaystyle=\text{MLP}(h_{t})$		(8)

where $\oplus$ denotes concatenation, $h_{t}$ represents the hidden state, and $\hat{u}_{t}$ is the generated attribute vector at time $t$ . By reinjecting $w.\mathcal{A}$ at each step, this architecture ensures that the static relational constraints (e.g., Store Type, Location) exert a persistent influence over the entire dynamic trajectory of the child sequence.

IV-B2 Conditional Discriminator

In contrast to the generator, the discriminator $D_{\phi}$ is implemented as a fully connected network (MLP) designed to assess the global coherence of the sequence. It models the joint probability of the entire sequence conditioned on the parent attributes.

Assuming a fixed sequence length $T$ during training, the input to the discriminator is constructed by flattening the sequence $\{u_{1},\dots,u_{T}\}$ into a single vector and concatenating it with the parent attributes $w.\mathcal{A}$ . The discriminator then maps this joint representation $[u_{1}\oplus\dots\oplus u_{T}\oplus w.\mathcal{A}]$ to a validity score, determining whether the complete temporal trajectory constitutes a plausible instance given the specific parent context.

Refer to caption — Figure 1: Architecture schema of the Sequential RC-TGAN with Spectral Loss. The diagram illustrates the generation process conditioned on parent attributes, and the dual optimization setup where the generator receives adversarial feedback from the discriminator and frequency-domain feedback via the spectral envelope loss ( $\mathcal{L}_{spec}$ ).

IV-C Spectral Adaptation for Continuous Features

The spectral envelope theory in [18] is inherently designed for categorical time series. However, relational datasets frequently contain continuous numerical attributes $\mathcal{A}_{cont}(U)=\{c_{1},\dots,c_{I}\}$ that exhibit significant periodic behavior (e.g., sales volume, temperature). To incorporate these attributes into our frequency-domain regularization, we first employ a discretization strategy based on Variational Gaussian Mixture Models (VGM) [21].

For each continuous attribute $c_{i}\in\mathcal{A}_{cont}(U)$ , we fit a VGM to the training data to estimate the optimal number of modes $K_{c_{i}}$ and their parameters. The probability distribution of a value $u_{t,c_{i}}$ is modeled as a mixture of Gaussians:

\mathbb{P}(u_{t,c_{i}})=\sum_{k=1}^{K_{c_{i}}}\tau_{k}\mathcal{N}(u_{t,c_{i}};\mu_{k},\sigma_{k}).

(9)

To compute the spectral envelope for a continuous sequence $\mathbf{u}_{c_{i}}=(u_{1,c_{i}},\dots,u_{T,c_{i}})$ , we transform it into a discrete sequence of mode indicators $\mathbf{m}_{c_{i}}=(m_{1,c_{i}},\dots,m_{T,c_{i}})$ . At each time step $t$ , the value $u_{t,c_{i}}$ is assigned to the mode $k$ that maximizes the posterior probability:

m_{t,c_{i}}=\arg\max_{k}\left(\tau_{k}\mathcal{N}(u_{t,c_{i}};\mu_{k},\sigma_{k})\right).

(10)

This process effectively maps the continuous domain $\mathcal{V}(c_{i})$ to a finite categorical set $\{1,\dots,K_{c_{i}}\}$ . Consequently, we can calculate the spectral envelope $\lambda(\omega;c_{i})$ on this discretized sequence, allowing the spectral loss $\mathcal{L}_{spec}$ to enforce periodic consistency across both naturally categorical attributes $\mathcal{A}_{cat}(U)$ and discretized continuous attributes $\mathcal{A}_{cont}(U)$ .

Beyond this discrete mode assignment, each continuous value is concurrently represented by a normalized scalar that captures its relative position within the assigned mode. Specifically, if the value $u_{t,c_{i}}$ is assigned to mode $k$ , we compute an intra-mode scalar $v_{t,c_{i}}=\frac{u_{t,c_{i}}-\mu_{k}}{4\sigma_{k}}$ . By concatenating the one-hot encoded discrete mode indicator $m_{t,c_{i}}$ with this normalized continuous scalar $v_{t,c_{i}}$ , the model retains the complete information necessary to fully reconstruct the original continuous feature $u_{t,c_{i}}$ . Therefore, while the categorical mode sequence $\mathbf{m}_{c_{i}}$ explicitly drives the frequency-domain regularization via the spectral envelope, the supplementary scalar sequence $v_{t,c_{i}}$ ensures no loss of localized continuous variance in the time domain.

Note that another way to incorporate the continuous attributes into frequency domain is to use the power spectrum of the signal. However as it will be shown in the ablation study section, we find that the discretization method is more effective.

IV-D Generator Losses

The training of the generator is guided by a hybrid objective function designed to satisfy two complementary requirements: global statistical realism (via adversarial feedback) and frequency-domain fidelity (via spectral envelope matching).

IV-D1 Adversarial Loss ( $\mathcal{L}_{adv}$ )

The primary objective of the generator is to produce relational sequences that are indistinguishable from real data. To achieve stable training dynamics, we employ the Wasserstein GAN (WGAN) objective.

Let $\mathbb{P}_{r}$ denote the real data distribution and $\mathbb{P}_{g}$ the generator distribution conditioned on parent attributes $w.\mathcal{A}$ . The discriminator $D$ (or critic) aims to maximize the divergence between its scoring of real and synthetic sequences. Conversely, the generator $G$ minimizes this divergence. The adversarial loss for the generator is defined as:

\mathcal{L}_{adv}=-\mathbb{E}_{\textbf{z}\sim p(\textbf{z}),w.\mathcal{A}\sim p(w.\mathcal{A})}\left[D_{\phi}(G_{\theta}(\textbf{z},w),w.\mathcal{A})\right].

(11)

Minimizing this term encourages the generator to capture general temporal correlations and the joint distribution of the sequence conditioned on the parent $w$ .

IV-D2 Spectral Envelope Loss ( $\mathcal{L}_{spec}$ )

Standard adversarial losses often fail to capture frequency patterns in categorical time series because discriminators tend to focus on local transitions rather than global frequency structures. To remedy this, we introduce a regularization term based on the spectral envelope.

Sequence-wise Spectral Estimation: Since the spectral envelope is a statistical property, we estimate it over mini-batches to ensure stability. Let $\mathcal{B}=\{\mathbf{u}^{(1)},\dots,\mathbf{u}^{(B)}\}$ be a mini-batch of $B$ sequences. For a specific categorical feature $d_{j}$ , we compute the spectral envelope $\lambda\left(\omega;\mathbf{u}^{(b)}_{d_{j}}\right)$ for the $b$ -th sequence at frequency $\omega$ (as defined in (3)).

We calculate the mean spectral envelope for the real batch, $\bar{\lambda}^{(real)}$ , and the synthetic batch, $\bar{\lambda}^{(synth)}$ , by averaging the envelopes across the batch dimension:

\bar{\lambda}^{(real)}(\omega;d_{j})=\frac{1}{B}\sum_{b=1}^{B}\lambda\left(\omega;\mathbf{u}^{(b)}_{d_{j}}\right).

(12)

This batch-averaging step reduces the variance of the periodogram estimator and provides a robust target frequency profile for the generator. We adapt this estimation for the continuous numerical features $c_{i}$ by relying on their discrete mode indicators (as detailed in Section IV-C). Let $\mathbf{m}^{(b)}_{c_{i}}$ denote the discretized sequence for the $b$ -th instance of feature $c_{i}$ . The mean spectral envelope is correspondingly calculated as:

\bar{\lambda}^{(real)}(\omega;c_{i})=\frac{1}{B}\sum_{b=1}^{B}\lambda\left(\omega;\mathbf{m}^{(b)}_{c_{i}}\right).

(13)

The synthetic counterparts, $\bar{\lambda}^{(synth)}(\omega;d_{j})$ and $\bar{\lambda}^{(synth)}(\omega;c_{i})$ , are computed analogously over the generated batch.

Loss Formulation: To enforce periodic consistency across the entire relational dataset, we partition our frequency-domain objective into two components. The categorical spectral loss, $\mathcal{L}_{spec}^{(cat)}$ , minimizes the average $L_{2}$ distance between the real and synthetic mean spectral envelopes across all $J$ categorical features:

\mathcal{L}_{spec}^{(cat)}=\frac{1}{J}\sum_{j=1}^{J}\left\|\bar{\lambda}^{(real)}(\cdot;d_{j})-\bar{\lambda}^{(synth)}(\cdot;\theta,d_{j})\right\|_{2}.

(14)

Likewise, the continuous spectral loss, $\mathcal{L}_{spec}^{(cont)}$ , computes the average $L_{2}$ distance across the $I$ discretized numerical features:

\mathcal{L}_{spec}^{(cont)}=\frac{1}{I}\sum_{i=1}^{I}\left\|\bar{\lambda}^{(real)}(\cdot;c_{i})-\bar{\lambda}^{(synth)}(\cdot;\theta,c_{i})\right\|_{2}.

(15)

The total spectral envelope loss, $\mathcal{L}_{spec}$ , is constructed as the weighted sum of these two terms, distributed proportionally to the number of features of each type:

\mathcal{L}_{spec}=\frac{J}{J+I}\mathcal{L}_{spec}^{(cat)}+\frac{I}{J+I}\mathcal{L}_{spec}^{(cont)}.

(16)

Minimizing this unified term explicitly forces the generator to align the latent periodicities (e.g., seasonality, cyclic trends) of the synthetic sequences with the ground truth across both mixed data types (see Fig. 2).

IV-E Training Loop

The training procedure employs an alternating optimization strategy to balance the competing objectives. In each epoch, we execute the following three distinct phases:

1.

Discriminator Update: First, we optimize the discriminator $D$ to distinguish between real sequences and the current synthetic output. We perform $n_{critic}$ updates to the discriminator for every generator update to maintain an optimal gradient approximation for the WGAN objective.
2.

Adversarial Generator Update: Second, we update the generator $G$ by minimizing $\mathcal{L}_{adv}$ . In this step, the generator weights are adjusted to fool the discriminator, ensuring global statistical coherence and adherence to the parent conditioning.
3.

Spectral Generator Update: Finally, we perform a specialized refinement step focused on frequency-domain fidelity. We update the generator by minimizing $\mathcal{L}_{spec}$ . This update is repeated $n_{steps\_for\_spec}$ times per epoch.

V Design of Simulated Data for Experiments

Validating generative models on real-world categorical time series is inherently difficult. Because real-world data lacks a definitive "ground truth" for its underlying stochastic frequencies, evaluations often rely on noisy periodogram estimates. To rigorously evaluate whether a generative model genuinely learns complex frequency-domain features, rather than merely memorizing local transitions, it is crucial to employ benchmark time series where the spectral properties are known beforehand.

To this end, we turn to stationary Markov chains. These stochastic processes provide a controlled, "gold standard" evaluation environment for two primary reasons: first, they can be easily and exactly simulated to generate massive, customized datasets for model training; second, they allow for the exact analytical derivation of their theoretical spectral envelope. By comparing the empirical spectral envelope of the generated sequences against this mathematically known ground truth, we can accurately measure the frequency fidelity of our synthetic approximations.

We map the categorical series $X_{t}$ into the multivariate point process $Y_{t}$ (one-hot vector). Specifically, $Y_{t}$ takes values in the set of standard basis vectors $\{e_{0},\dots,e_{K-1}\}\subset\mathbb{R}^{K}$ . In this one-hot encoded representation, $e_{j}$ is a vector with a $1$ at the $j+1$ -th position and $0$ everywhere else, corresponding exactly to the event that $X_{t}$ is in state $a_{j}$ . The process is characterized by the Transition Matrix Function, denoted as $\mathcal{T}(h)$ . This matrix-valued function describes the conditional probability of the process transitioning from one basis state to another over a given time lag $h\geq 1$ .

For a stationary categorical process, the entry $(i,j)$ of the transition matrix function at lag $h$ , $\mathcal{T}_{ij}(h)$ , represents the probability of transitioning from state $e_{i}$ to state $e_{j}$ after $h$ steps:

\mathcal{T}_{ij}(h)=\mathbb{P}[Y_{t+h}=e_{j}\mid Y_{t}=e_{i}].

(17)

This matrix captures the "flow" of probability mass across the state space over time.

In the context of a first-order Markov chain, the behavior of the transition matrix function $\mathcal{T}(h)$ is strictly governed by the immediate 1-step transitions. Consequently, the transition probabilities at any lag $h$ are entirely determined by the $h$ -th power of the 1-step transition matrix $P=\mathcal{T}(1)$ :

\mathcal{T}(h)=P^{h}.

This transition relationship is the key to computing the temporal covariance of the process. For a stationary categorical process $Y_{t}$ characterized by a stationary distribution vector $\pi$ (row vector), let $\Pi=\text{diag}(\pi)$ denote the diagonal matrix of its marginal probabilities. The autocovariance matrix function, $\Gamma(h)$ , is directly related to the transition matrix function $\mathcal{T}(h)$ by the following equation:

\Gamma(h)=\Pi\mathcal{T}(h)-\pi^{\prime}\pi,\quad\text{for }h\geq 0.

(18)

By substituting the property established above for a first-order Markov chain where the multi-step transition is simply the matrix power $\mathcal{T}(h)=P^{h}$ , this general relation simplifies significantly. The autocovariance matrix function reduces to a geometric decay governed entirely by the 1-step transition matrix $P$ :

\Gamma(h)=\Pi P^{h}-\pi^{\prime}\pi.

Equation (18) reveals that the spectral properties of the process are entirely governed by the relaxation of the transition mechanism $\mathcal{T}(h)$ .

For the remainder of this section, we assume that the stationary categorical process $X_{t}$ is a first-order Markov chain characterized by the one-step transition matrix $P=\mathcal{T}(1)$ .

V-A Spectral Properties of Circulant Transitions

Deriving the spectral envelope for a general transition matrix requires numerically solving the eigenvalue problem at every frequency. For the class of circulant transition matrices, we can derive an exact analytical form that links the stochastic parameters directly to the spectral shape.

A transition matrix $P$ is circulant if every row is a cyclic right shift of the preceding row. Consequently, the entire matrix is fully characterized by its first row vector $\textbf{b}=[b_{0},b_{1},\dots,b_{K-1}]$ , where $b_{j}=\mathbb{P}(Y_{t+1}=e_{j}\mid Y_{t}=e_{0})$ .

The general form of such a matrix is:

P=\begin{bmatrix}b_{0}&b_{1}&b_{2}&\dots&b_{K-1}\\ b_{K-1}&b_{0}&b_{1}&\dots&b_{K-2}\\ b_{K-2}&b_{K-1}&b_{0}&\dots&b_{K-3}\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ b_{1}&b_{2}&b_{3}&\dots&b_{0}\end{bmatrix}.

(19)

This structural symmetry serves as a mathematical bridge between the time domain and the frequency domain: circulant matrices are diagonalized by the Inverse Discrete Fourier Transform matrix [6], a property we leverage to derive analytical spectral envelopes. Then, the eigenvalues $\gamma_{k}$ of the circulant matrix in (19) is given by:

\gamma_{k}=\sum_{j=0}^{K-1}b_{j}e^{i\frac{2\pi jk}{K}}\text{ for }k=0,\ldots,K-1.

(20)

The magnitude ( $|\gamma_{k}|$ ) is determined by the concentration of the probability mass in b. If b is highly concentrated (low entropy), the magnitude approaches 1 ( $|\gamma_{k}|\approx 1$ ), implying long memory, whereas a uniform b (high entropy) yields $|\gamma_{k}|\approx 0$ , which is characteristic of a white noise process.

Lemma 3 (Spectral Envelope of Circulant Chains).

Let $X_{t}$ be a stationary categorical process with $K$ states governed by a circulant transition matrix $P$ . Let $\gamma_{k}=r_{k}e^{i\phi_{k}}$ be the eigenvalues of $P$ expressed in polar form i.e. $r_{k}=|\gamma_{k}|$ and $\phi_{k}=\arg(\gamma_{k})$ . The spectral envelope $\lambda(\omega)$ is the upper boundary of the spectral densities of the $K-1$ non-trivial eigenmodes:

\lambda(\omega)=\max_{k\in\{1,\dots,K-1\}}\left(\frac{1-r_{k}^{2}}{1-2r_{k}\cos(2\pi\omega-\phi_{k})+r_{k}^{2}}\right).

(21)

The proof of this lemma is in the appendix.

Using this lemma, we analyze two types of circulant chains representing distinct temporal dynamics: periodicity and inertia.

V-B The Noisy Cyclic Process (Periodicity)

This process models periodic behavior with phase noise, serving as a robust benchmark for capturing seasonality and cyclic constraints.

A Noisy Cyclic Process (NCP) is defined by the transition matrix:

P_{ij}=\begin{cases}\alpha&\text{if }j\equiv(i+1)\pmod{K}\\ 1-\alpha&\text{if }j=i\end{cases}

where $\alpha\in(0.5,1)$ is the switching state parameter. An NCP is a circulant chain where $\textbf{b}=[1-\alpha,\alpha,0,\dots]$ . Its spectral envelope is given by equation (21) where :

	$\displaystyle r_{k}$	$\displaystyle=\sqrt{1-2\alpha(1-\alpha)\left[1-\cos\left(\frac{2\pi k}{K}\right)\right]},$
	$\displaystyle\phi_{k}$	$\displaystyle=\arctan\left(\frac{\alpha\sin(2\pi k/K)}{(1-\alpha)+\alpha\cos(2\pi k/K)}\right).$

The NCP serves as a robust benchmark for modeling periodic behavior and cyclic constraints under varying degrees of phase noise. Its temporal dynamics are primarily controlled by the switching state parameter $\alpha\in(0.5,1)$ , which dictates the strictness of the cycle. Figure 3 visualizes the spectral envelope of the NCP with a state space of $K=7$ for different values of $\alpha$ . As shown, the process naturally exhibits distinct resonant peaks clustered around the fundamental frequency of $1/7\approx 0.14$ and its associated harmonics. When the cycle strength $\alpha$ approaches 1 (represented by the darker lines), the system mimics a deterministic cycle, concentrating the spectral energy into very sharp, Dirac-like peaks. Conversely, as $\alpha$ decreases toward its lower bound, the process introduces greater phase noise, which progressively broadens these sharp harmonic peaks into wide spectral hills, reflecting a more stochastic and relaxed periodic progression.

V-C The Symmetric Sticky Process (Inertia)

This process models systems with inertia, where the state tends to persist over time with no preferred direction of change.

A Symmetric Sticky Process (SSP) is defined by the transition matrix:

P_{ij}=\begin{cases}\alpha&\text{if }i=j\\ \frac{1-\alpha}{K-1}&\text{if }i\neq j\end{cases}

where $\alpha\in(1/K,1)$ is the switching state parameter. We can remark that the transition matrix of the SSP is circulant such that the first row $\textbf{b}=[\alpha,\frac{1-\alpha}{K-1},\dots]$ . We can derive the spectral envelope of the SSP from equation (21):

\lambda(\omega)=\frac{1-\gamma^{2}}{1-2\gamma\cos(2\pi\omega)+\gamma^{2}},

where $\gamma=\frac{\alpha K-1}{K-1}$ . The non-trivial eigenvalues are identical and real corresponding to $\gamma$ .

The asymptotic behavior of the process is highly sensitive to the switching state parameter $\alpha$ . As $\alpha\to 1$ , the eigenvalue $\gamma\to 1$ , causing the process to become extremely "sticky" and rarely switch states. In this regime, the spectral envelope forms a sharp peak at $\omega=0$ , ultimately approaching a Dirac delta. Conversely, as $\alpha\to 1/K$ , the eigenvalue $\gamma\to 0$ , which reduces the process to pure random noise (see Figure 3). Consequently, the spectral energy becomes uniformly distributed, and the envelope flattens to a constant line where $\lambda(\omega)\approx 1$ .

V-D Bayesian Hierarchical Sampling for generating simulated relational databases

To rigorously evaluate the conditional generation capabilities of our model, we elaborate a method for building a synthetic relational database using a Bayesian hierarchical framework. Unlike using a single fixed parameter for the entire dataset, we model the switching parameter $\alpha$ as a random variable associated with each parent entity. This setup forces the generative model to learn the mapping $\alpha\mapsto\text{Spectral Envelope}(\alpha)$ rather than memorizing a static distribution.

Following the relational schema $\mathcal{S}$ introduced in Section IV, we construct a framework where the parent table $W$ governs the stochastic dynamics of the child time series in table $U$ . Note that for any single simulated database, we select only one of these two types of circulant chains (either the Symmetric Sticky or the NCP) and fix the total number of categorical states $K$ to drive the temporal dynamics. Because both processes are fully parameterized by the switching parameter $\alpha$ for a given $K$ , we apply a Bayesian hierarchical modeling to each choice: first, we sample and store $N$ values of $\alpha$ in the parent table; second, we generate a corresponding categorical time series for each of these $N$ values using the chosen Markov process.

For the NCP, where $\alpha$ represents the probability of advancing the cycle, we use a uniform prior over the valid range of directed cycles:

\alpha_{i}\sim\mathcal{U}(0.5,1.0),

where $\alpha_{i}$ is the random variable illustrating the value of $\alpha$ for the $i$ -th row in the parent table. For the Symmetric Sticky, we simply replace $\mathcal{U}(0.5,1.0)$ distribution by $\mathcal{U}(1/K,1.0)$ .

For each parent row $i$ , we generate a categorical time series $\textbf{u}^{(i)}=\{u^{(i)}_{1},\dots,u^{(i)}_{T}\}$ of length $T$ , which acts as part of the child table $X$ . The dynamics of this series are conditioned strictly on the parent’s parameter $\alpha_{i}$ .

The sequence is generated via the transition matrix $P_{\alpha_{i}}$ specific to the chosen type of circulant chain (Symmetric Sticky or Noisy Cyclic):

\mathbb{P}(u^{(i)}_{t+1}|u^{(i)}_{t},\alpha_{i})=[P_{\alpha_{i}}]_{u^{(i)}_{t},u^{(i)}_{t+1}}.

(22)

This hierarchical construction provides a "gold standard" dataset for conditional generative modeling. Since the true spectral envelope $\lambda(\omega;\alpha_{i})$ is analytically known for every parent $i$ (via Lemma 3), we can compute the exact expected spectral error. A successful generative model must produce synthetic children $\hat{\textbf{u}}^{(i)}$ such that their empirical spectral envelopes match the theoretical envelopes dictated by their sampled parent attributes $\hat{\alpha}_{i}$ .

VI New Metrics Based on Spectral Analysis

Evaluating the fidelity of synthetic relational time series requires going beyond simple marginal distributions or static correlations. Standard metrics often fail to accurately detect if the synthetic data preserves the specific frequency-domain characteristics (such as seasonality and cyclic constraints) inherent to the real process. To address this, we propose two metrics rooted in spectral analysis.

Let $\{w_{1},\dots,w_{M}\}$ be the set of parent instances in the real database, and $\{\hat{w}_{1},\dots,\hat{w}_{M^{\prime}}\}$ be the set of parent instances in the synthetic database. Each parent $w_{m}$ (or $\hat{w}_{m^{\prime}}$ ) identifies a specific sub-population of children rows forming a multivariate time series.

For a numerical attribute $c_{i}$ , let $f(\omega;c_{i},w_{m})$ denote the spectral density of the series associated with parent $w_{m}$ . For a categorical attribute $d_{j}$ , let $\lambda(\omega;d_{j},w_{m})$ denote its spectral envelope. We define the mean spectral densities and mean spectral envelopes for the real data as:

	$\displaystyle\bar{f}^{(real)}(\omega;c_{i})$	$\displaystyle=\frac{1}{M}\sum_{m=1}^{M}f(\omega;c_{i},w_{m}),$		(23)
	$\displaystyle\bar{\lambda}^{(real)}(\omega;d_{j})$	$\displaystyle=\frac{1}{M}\sum_{m=1}^{M}\lambda(\omega;d_{j},w_{m}).$		(24)

The synthetic counterparts, $\bar{f}^{(synth)}$ and $\bar{\lambda}^{(synth)}$ , are defined analogously over the $M^{\prime}$ synthetic parents.

VI-A Spectral Density Divergence ( $\overline{\mathcal{D}}_{spec}$ )

To evaluate the temporal fidelity of continuous features, we measure the divergence between the average power spectrums. We first normalize the spectral densities so that $\int_{-1/2}^{1/2}f(\omega)d\omega=1$ , considering them as probability distributions over the frequency domain.

The individual divergence $div(\cdot,\cdot)$ for an attribute $c_{i}$ is defined as the divergence (e.g., Wasserstein or Kullback-Leibler) between the real and synthetic mean densities. The global Spectral Density Divergence (SDD) is the average over all continuous attributes:

\overline{\mathcal{D}}_{spec}=\frac{1}{I}\sum_{i=1}^{I}{\mathcal{D}_{spec}(c_{i})},

(25)

where $\mathcal{D}_{spec}(c_{i})=div\left(\bar{f}^{(real)}(\cdot;c_{i}),\bar{f}^{(synth)}(\cdot;c_{i})\right)$ . It is worth noting that the aggregated quantities $\bar{f}^{(real)}(\cdot;c_{i})$ and $\bar{f}^{(synth)}(\cdot;c_{i})$ constitute well-defined normalized spectral densities. Mathematically, the set of valid normalized spectral densities is closed under convex combinations; since each individual $f(\omega;c_{i},w_{m})$ is a non-negative, real-valued, its integral under interval $[-1/2,1/2]$ is equal to one, and even function (for real-valued processes), their arithmetic mean preserves these fundamental properties. Consequently, $\bar{f}^{(real)}$ effectively represents the spectral density of a "representative" process for the class $U$ , averaging out local idiosyncrasies to reveal the global frequency structure of the population. This validity allows us to treat these patterns as probability distributions and rigorously apply divergence metrics $div$ such as Kullback-Leibler (KL) or Wasserstein distance.

VI-B Spectral Envelope Divergence ( $\overline{\mathcal{D}}_{env}$ )

For categorical attributes, we assess the preservation of latent periodicities using the spectral envelope. The divergence for a single attribute $d_{j}$ is calculated as the $L^{2}$ distance between the mean envelopes:

\mathcal{D}_{env}(d_{j})=\frac{\big\|\bar{\lambda}^{(real)}(\cdot;d_{j})-\bar{\lambda}^{(synth)}(\cdot;d_{j})\big\|_{2}}{K-1}.

(26)

The global Spectral Envelope Divergence (SED) is the mean over all categorical attributes:

\overline{\mathcal{D}}_{env}=\frac{1}{J}\sum_{j=1}^{J}\mathcal{D}_{env}(d_{j}).

(27)

The aggregated spectral envelope $\bar{\lambda}^{(real)}(\omega;d_{j})$ inherits the fundamental algebraic properties of its constituents. Since $\bar{\lambda}^{(real)}$ is constructed as a finite linear combination of individual envelopes, and given that each component $\lambda(\cdot;d_{j},w_{m})$ is continuous on the compact interval $[-1/2,1/2]$ (Lemma 1), the mean envelope itself is necessarily a continuous function in $C^{0}([-1/2,1/2])$ . Furthermore, the fundamental norm constraints established in Lemma 2 are preserved under this averaging operation. Specifically, by the convexity of the norm, the mean envelope satisfies the dimensionality constraint $1\leq\|\bar{\lambda}^{(real)}(\cdot;d_{j})\|_{1}\leq K-1$ (If $K$ is the number of categories of $d_{j}$ ) and maintains finite energy $1\leq\|\bar{\lambda}^{(real)}(\cdot;d_{j})\|_{2}<\infty$ . These properties ensure that $\bar{\lambda}^{(real)}$ remains a well-defined term within the functional $L^{2}\left([-1/2,1/2]\right)$ and verify important properties of spectral envelopes, guaranteeing that the divergence metric $\overline{\mathcal{D}}_{env}$ is both bounded and mathematically stable.

VII Experiments

In this section, we empirically evaluate the performance of our proposed generative framework. The primary objective is to demonstrate that integrating the spectral envelope loss into a sequential GAN architecture significantly improves the preservation of latent periodicities and temporal dynamics in both continuous and categorical time series conveyed by rows of child tables of relational databases. To this end, we conduct comprehensive experiments on both simulated data and real-world transactional datasets. We compare our model against several state-of-the-art generative approaches, followed by an ablation study to evaluate the contribution of specific components within our proposed model.

VII-A Experimental setup

VII-A1 Simulated Data (Bayesian Hierarchical Benchmarks)

We first evaluate our models on simulated relational databases generated via our Bayesian Hierarchical Benchmarking framework. We simulate two distinct categorical Markov processes:

•

NCP: Tests the model’s ability to capture periodic behaviors of categorical time series. The transition parameter is sampled from the prior $\alpha\sim\mathcal{U}(0.5,1.0)$ .
•

SSP: Tests the model’s ability to capture low-frequency, high-inertia dynamics (low-pass filter behavior). The transition parameter is sampled from the prior $\alpha\sim\mathcal{U}(1/K,1.0)$ .

Specifically, we synthesize a parent table comprising 100 independent entities (rows), where each entity’s attribute $\alpha$ is drawn from the corresponding uniform prior. For each parent row, we simulate a sequence of 10,000 child rows, generated using the Markov transition matrix governed by that specific $\alpha$ . This hierarchical construction dictates the exact theoretical spectral envelope of the generated child sequences, providing a rigorous "gold standard" to isolate and evaluate frequency-domain fidelity across varying state space sizes ( $K=7,12,21$ ).

VII-A2 Real-World Datasets

We also evaluate our method on two real-world relational databases containing complex temporal dynamics: Rossmann [14] and Walmart [20]. Each database adheres to a two-level hierarchical relational schema. The parent table contains static metadata representing the set of individual stores (e.g., store type, location), while the child table contains the multivariate time series for each store, encompassing both continuous numerical features (e.g., daily or weekly sales volume) and highly periodic categorical features (e.g., day of the week, promotional events).

VII-A3 Baseline models

We compare our proposed method (Seq. RC-TGAN) against state-of-the-art systems spanning standard relational generation and dedicated time series generative adversarial networks:

•

SDV [12]: A standard probabilistic relational model that builds generative models of relational databases by computing statistics at the intersection of related tables.
•

ClavaDDPM [11]: A recent diffusion-based approach for multi-relational data synthesis.
•

DoppelGANger [7]: A state-of-the-art GAN designed for networked time series that tackles mode collapse and long-term dependencies.
•

TimeGAN [22]: A time series GAN that combines the unsupervised adversarial paradigm with the control of supervised training through a jointly optimized latent embedding space.

VII-A4 Metrics

To rigorously assess both temporal fidelity and the preservation of complex temporal patterns conveyed by child tables rows structures, we employ the following metrics:

•

MSE (ACF): The Mean Squared Error between the autocorrelation of the real and synthetic data. This measures the model’s ability to preserve time-domain dependencies and localized temporal structures.
•

SDD ( $\overline{\mathcal{D}}_{spec}$ ): Defined in (25), we specifically use the KL divergence as $div$ function to evaluate categorical time series.
•

SED ( $\overline{\mathcal{D}}_{env}$ ): Defined in (27), used to evaluate numerical time series.

VII-B Experimental Results

The results are summarized in Tables II and I, alongside visual analyses in Figures 5 and 4. To assess model performance, we highlight relative improvements over the second-best baselines and evaluate statistical significance using a two-sample t-test ( $p<0.05$ ).

VII-B1 Performance on Simulated Data

Table I demonstrates that Seq. RC-TGAN achieves statistically significant reductions in SED ( $\overline{\mathcal{D}}_{env}$ ) across all tested state space sizes ( $K=7,12,21$ ). For the NCP, our proposed method yields relative improvements of 45.8%, 43.3%, and 28.2% over the next best baselines at $K=7,12,$ and $21$ , respectively. This trend holds for the SSP, where Seq. RC-TGAN significantly outperforms the second-best models by 32.7% ( $K=7$ ), 52.8% ( $K=12$ ) and 37.48% ( $K=21$ ).

Figure 4 provides a qualitative spectral analysis of the generated sequences against the mathematical ground truth (black dashed line). A rigorous visual evaluation of synthetic categorical time series in the frequency domain necessitates assessing two critical criteria: (1) frequency localization, ensuring the synthetic envelope successfully exhibits peaks at all theoretical fundamental and harmonic frequencies; (2) spectral purity, verifying that every peak in the synthetic envelope corresponds to a true theoretical peak without introducing spurious periodic artifacts.

Our proposed model consistently satisfies both criteria. For the SSP (top row), it accurately captures the theoretical low-pass filter behavior, concentrating the precise magnitude of spectral mass exactly at $\omega=0$ without hallucinating higher-frequency artifacts as the persistence parameter $\alpha$ approaches 1. For the NCP (bottom row), the model successfully isolates the true fundamental harmonic peaks (e.g., near $\omega\approx 0.14$ for $K=7$ ), maintains strict spectral purity by avoiding false peaks, and faithfully reproduces the theoretical amplitudes to accurately model the phase noise. In stark contrast, competing baselines universally fail these criteria, failing to detect true periodicities and typically collapsing into flat, white-noise-like representations.

VII-B2 Performance on Real-World Data

Table II details the performance of the generative models on the Rossmann and Walmart databases. The proposed framework establishes state-of-the-art performance across multiple facets of temporal generation. Most notably, it demonstrates a unique ability to model complex categorical time series, yielding statistically significant divergence reductions in SED ( $\overline{\mathcal{D}}_{env}$ ) compared to the second-best baselines: 37.8% and 59.3% relative gain on the Rossmann and Walmart databases, respectively ( $p<0.05$ ).

Furthermore, although the spectral envelope theory described in section II is inherently designed for categorical sequences, our framework extends this frequency-domain regularization to continuous numerical attributes by employing a Variational Gaussian Mixture Model (VGM) discretization strategy. By mapping continuous values to discrete mode indicators, the spectral loss successfully enforces periodic consistency across all feature types. This adaptation directly contributes to our model achieving a statistically significant 33.7% relative improvement in continuous SDD ( $\overline{\mathcal{D}}_{spec}$ ) on the Rossmann dataset, as well as a relative gain of 24.8% over DoppelGANger on the Walmart dataset ( $p<0.05$ for both).

SDV, ClavaDDPM, and TimeGAN consistently demonstrate the weakest performance, struggling to capture both local time-domain structures and global frequency distributions. In contrast, DoppelGANger proves to be a highly competitive state-of-the-art baseline for numerical data. On the Walmart dataset, DoppelGANger and our proposed model are statistically tied for the best time-domain performance (MSE ACF of 0.012 vs. 0.013, respectively; $p>0.05$ ). However, despite DoppelGANger’s proficiency with continuous variables, it struggles significantly when tasked with modeling the periodic dynamics of categorical time series. Our proposed method overcomes this important limitation via our unified spectral envelope regularization.

This dynamic is visually corroborated by the Autocorrelation Function (ACF) analysis of numerical time series presented in Figure 5. As seen in the top row (a, b), static tabular models like SDV and ClavaDDPM completely fail to capture temporal dependencies, resulting in flat ACF curves. This visual analysis highlights a critical limitation of the commonly used MSE (ACF) metric. On the Rossmann dataset, the static SDV model achieves a nominally better MSE (ACF) compared to the sequential DoppelGANger model (0.0700 vs. 0.0868). However, visual inspection reveals that SDV merely produces a flat line near zero; this mathematically minimizes the mean squared error across all lags by "playing it safe," but fails entirely to capture underlying temporal dynamics. DoppelGANger successfully reproduces the oscillating shape of the sales autocorrelation but misses the exact amplitude of certain peaks, resulting in a harsher point-wise MSE penalty.

The spectral metrics ( $\overline{\mathcal{D}}_{spec}$ and $\overline{\mathcal{D}}_{env}$ ) correct this incompleteness by evaluating the global frequency structure rather than local point-wise errors. Ultimately, our proposed Seq. RC-TGAN framework bridges all these gaps, demonstrating the ability to accurately capture local point-wise dependencies, global continuous periodicities via VGM adaptation, and crucially, the complex structural harmonics inherent to categorical time series.

TABLE I: Perfomance on simulated datasets: Comparison of

\overline{\mathcal{D}}_{env}

between our proposed method and baselines across different state space sizes (

K

). Lower is better.

Noisy Cyclic
	State Space Size ( $K$ )
Model	K=7	K=12	K=21
SDV	$\underline{0.7830\pm 0.0582}$	$0.9459\pm 0.1804$	$0.8418\pm 0.0794$
ClavaDDPM	$0.8729\pm 0.0660$	$0.8922\pm 0.1036$	$0.9101\pm 0.1797$
DoppelGANger	$0.9467\pm 0.0579$	$\underline{0.8786\pm 0.1071}$	$\underline{0.8336\pm 0.0452}$
TimeGAN	$0.9100\pm 0.1302$	$0.9135\pm 0.1427$	$0.9076\pm 0.0860$
Seq. RC-TGAN	$\mathbf{0.4246\pm 0.0117}$	$\mathbf{0.4984\pm 0.0319}$	$\mathbf{0.5988\pm 0.0202}$
Symmetric Sticky
SDV	$0.2182\pm 0.0297$	$0.1012\pm 0.0144$	$0.0661\pm 0.0112$
ClavaDDPM	$0.1873\pm 0.0266$	$\underline{0.0949\pm 0.0050}$	$0.0569\pm 0.0065$
DoppelGANger	$0.7691\pm 0.1404$	$0.2123\pm 0.0240$	$0.1258\pm 0.0173$
TimeGAN	$\underline{0.1813\pm 0.0208}$	$0.1007\pm 0.0069$	$\underline{0.0555\pm 0.0074}$
Seq. RC-TGAN	$\mathbf{0.1221\pm 0.0023}$	$\mathbf{0.0448\pm 0.0018}$	$\mathbf{0.0347\pm 0.0009}$

TABLE II: Performance Metrics on real-world datasets: MSE (ACF),

\overline{\mathcal{D}}_{spec}

, and

\overline{\mathcal{D}}_{env}

Dataset	Model	MSE (ACF)	$\overline{\mathcal{D}}_{spec}$	$\overline{\mathcal{D}}_{env}$
Rossmann	SDV	$\underline{0.0700\pm 0.0000}$	$50.00\%\pm 0.00\%$	$\underline{0.7359\pm 0.0000}$
	ClavaDDPM	$0.0702\pm 0.0001$	$50.30\%\pm 0.08\%$	$0.7407\pm 0.0006$
	DoppelGANger	$0.0868\pm 0.0293$	$\underline{46.22\%\pm 4.34\%}$	$0.7792\pm 0.5640$
	TimeGAN	$0.0951\pm 0.0317$	$57.20\%\pm 5.21\%$	$1.2507\pm 0.3600$
	Seq. RC-TGAN	$\mathbf{0.0340\pm 0.0072}$	$\mathbf{30.66\%\pm 3.86\%}$	$\mathbf{0.4578\pm 0.1630}$
Walmart	SDV	$0.1223\pm 0.0000$	$50.99\%\pm 0.00\%$	$0.0757\pm 0.0000$
	ClavaDDPM	$0.1195\pm 0.0015$	$45.04\%\pm 0.40\%$	$0.0727\pm 0.0008$
	DoppelGANger	$\mathbf{0.0120\pm 0.0052}$	$\underline{6.88\%\pm 0.89\%}$	$\underline{0.0118\pm 0.0028}$
	TimeGAN	$0.1250\pm 0.0208$	$16.32\%\pm 2.90\%$	$0.1446\pm 0.0108$
	Seq. RC-TGAN	$\underline{0.0130\pm 0.0112}$	$\mathbf{5.17\%\pm 0.58\%}$	$\mathbf{0.0048\pm 0.0023}$

VII-C Ablation Study

To rigorously isolate the contributions of our architectural design choices, specifically the recurrent temporal generation and the proposed frequency-domain loss, we conducted an ablation study. We compared four variants of our framework: RC-TGAN [1] (no temporal dimension modeling), the Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ ) that is the recurrent baseline without any spectral loss, the Seq. RC-TGAN (psd) which models based on the spectral density loss for the numerical columns instead of the spectral envelope loss (based on VGM), and our proposed full model, Seq. RC-TGAN.

The necessity of the defined spectral loss is most starkly evident in our highly controlled simulated environments (Table III). While the model Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ ) demonstrates improvements on empirical real-world data, it completely fails to reproduce pure mathematical periodicities. The results show that without explicit frequency-domain guidance, the recurrent baseline performs almost identically to the static RC-TGAN model on both the NCP and SSP across all state space sizes.

This reveals that standard adversarial training in the time domain, even with an RNN-based architecture, is insufficient to prevent white-noise-like spectra when faced with periodic constraints. By incorporating the spectral envelope loss, the divergence ( $\overline{\mathcal{D}}_{env}$ ) is reduced by approximately 50% across almost all configurations (e.g., a statistically significant divergence drop from 0.8483 to 0.4246 for the NCP at $K=7$ ). These results confirm that the proposed spectral loss is not merely an incremental tuning parameter for real-world data, but a fundamentally essential component for generative models to successfully reconstruct latent harmonics and system inertia in categorical time series.

Table IV details the performance of these variants on the Rossmann and Walmart datasets. Transitioning from a static generator (RC-TGAN) to a recurrent architecture (Seq. RC-TGAN (w/o $\mathcal{L}_{spec}$ )) yields statistically significant improvements ( $p<0.05$ ) across both time and frequency domains on real-world data. For instance, on the Walmart dataset, introducing the recurrent structure reduces the time-domain MSE (ACF) by 54.6% (from 0.1200 to 0.0545) and drastically reduces the continuous frequency divergence ( $\overline{\mathcal{D}}_{spec}$ ) by 81.8% (from 45.26% to 8.22%).

Evaluating the progression of our ablation study highlights the fundamental necessity of the spectral envelope loss for mixed-type columns in relational databases. First, comparing the unregularized recurrent baseline (Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ )) to the Seq. RC-TGAN (psd) variant demonstrates the advantage of the latter, as applying a standard spectral density loss successfully provides frequency-domain guidance for continuous numerical columns. However, this approach remains fundamentally insufficient for reliably capturing the complex periodic dynamics inherent to categorical data. Subsequently, comparing the Seq. RC-TGAN (psd) variant to our full model (Seq. RC-TGAN) illustrates the critical impact of our proposed approach. Implementing the full Spectral Envelope loss forces the generator to comprehensively learn overarching periodic patterns across all data types natively. This yields a statistically significant 23.9% relative reduction in categorical SED ( $\overline{\mathcal{D}}_{env}$ ) compared to the psd variant on the Rossmann dataset (dropping from 0.6014 to 0.4578). Furthermore, on the Walmart dataset, the full model drives the SED down to 0.0048, outperforming the PSD-only variant and achieving a massive 93.4% overall improvement compared to the initial recurrent baseline (dropping from 0.0732 to 0.0048).

These findings are visually confirmed by the Autocorrelation Function (ACF) analysis (Figure 5, bottom row). The static RC-TGAN completely fails to capture temporal dependencies, resulting in a flat ACF line. While the Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ ) successfully begins to capture localized temporal transitions, it still misses the global structural amplitudes. The integration of the full spectral loss acts as the definitive catalyst, enabling the generator to accurately reproduce long-range seasonal correlations rather than just step-by-step localized transitions.

TABLE III: Ablation study on simulated datasets: Comparison of

\overline{\mathcal{D}}_{env}

between RC-TGAN variants across different state space sizes (

K

). Lower is better.

Benchmark: Noisy Cyclic
	State Space Size ( $K$ )
Model	K=7	K=12	K=21
RC-TGAN	$\underline{0.8215\pm 0.0864}$	$\underline{0.9719\pm 0.1127}$	$0.9652\pm 0.3088$
Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ )	$0.8483\pm 0.1094$	$0.9775\pm 0.1155$	$\underline{0.8260\pm 0.0261}$
Seq. RC-TGAN	$\mathbf{0.4246\pm 0.0117}$	$\mathbf{0.4984\pm 0.0319}$	$\mathbf{0.5988\pm 0.0202}$
Benchmark: Symmetric Sticky
RC-TGAN	$\underline{0.1962\pm 0.0159}$	$0.0957\pm 0.0049$	$\underline{0.0546\pm 0.0065}$
Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ )	$0.1988\pm 0.0169$	$\underline{0.0926\pm 0.0086}$	$0.0588\pm 0.0029$
Seq. RC-TGAN	$\mathbf{0.1221\pm 0.0023}$	$\mathbf{0.0448\pm 0.0018}$	$\mathbf{0.0347\pm 0.0009}$

TABLE IV: Ablation study on real-world datasets: MSE (ACF),

\overline{\mathcal{D}}_{spec}

, and

\overline{\mathcal{D}}_{env}

Dataset	Model	MSE (ACF)	$\overline{\mathcal{D}}_{spec}$	$\overline{\mathcal{D}}_{env}$
Rossmann	RC-TGAN	$0.0703\pm 0.0002$	$50.29\%\pm 0.06\%$	$0.7404\pm 0.0003$
	Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ )	$0.0687\pm 0.0037$	$\underline{43.72\%\pm 2.54\%}$	$\underline{0.5790\pm 0.1574}$
	Seq. RC-TGAN (psd)	$\underline{0.0681\pm 0.0322}$	$43.99\%\pm 4.78\%$	$0.6014\pm 0.1771$
	Seq. RC-TGAN	$\mathbf{0.0340\pm 0.0072}$	$\mathbf{30.66\%\pm 3.86\%}$	$\mathbf{0.4578\pm 0.1630}$
Walmart	RC-TGAN	$0.1200\pm 0.0012$	$45.26\%\pm 0.58\%$	$0.0833\pm 0.0017$
	Seq. RC-TGAN (w\o $\mathcal{L}_{spec}$ )	$0.0545\pm 0.0227$	$\underline{8.22\%\pm 0.96\%}$	$0.0732\pm 0.0016$
	Seq. RC-TGAN (psd)	$\underline{0.0389\pm 0.0212}$	$16.23\%\pm 8.16\%$	$\underline{0.0067\pm 0.0102}$
	Seq. RC-TGAN	$\mathbf{0.0130\pm 0.0112}$	$\mathbf{5.17\%\pm 0.58\%}$	$\mathbf{0.0048\pm 0.0023}$

VIII Conclusion

In this paper, we addressed the critical challenge of generating high-fidelity time series within relational databases by introducing Seq. RC-TGAN, a sequential generative adversarial network enhanced with a novel, integrated spectral envelope loss. Rather than relying solely on static encodings, our framework explicitly optimizes the network to preserve the complex frequency-domain features of both categorical and continuous time series during training. Furthermore, we established a mathematically rigorous evaluation paradigm by analytically deriving the spectral envelope for circulant Markov chains, providing a "gold standard" for categorical time series alongside two novel spectral divergence metrics. Extensive experiments on these simulated data and real-world datasets (Rossmann and Walmart) demonstrate that our approach significantly outperforms state-of-the-art baselines in capturing latent periodicities, strict cyclic constraints, and long-term seasonality.

References

[1] M. Gueye, Y. Attabi, and M. Dumas (2023) Row conditional-TGAN for generating synthetic relational databases. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–5. Cited by: §I, §IV-B, §VII-C.
[2] V. Hudovernik, M. Xu, J. Shi, L. Šubelj, S. Ermon, E. Štrumbelj, and J. Leskovec (2025) RelDiff: relational data generative modeling with graph-based diffusion models. arXiv preprint arXiv:2506.00710. Cited by: §I.
[3] H. Y. J. Kang, M. Ko, and K. S. Ryu (2025) Tabular transformer generative adversarial network for heterogeneous distribution in healthcare. Scientific Reports 15 (1), pp. 10254. Cited by: §I.
[4] D. Koller (1999) Probabilistic relational models. In International Conference on Inductive Logic Programming, pp. 3–13. Cited by: §IV-A.
[5] A. Kotelnikov, D. Baranchuk, I. Rubachev, and A. Babenko (2024-10) TabDDPM: Modelling Tabular Data with Diffusion Models. arXiv. Note: arXiv:2209.15421 External Links: Link, Document Cited by: §I.
[6] I. Kra and S. R. Simanca (2012) On circulant matrices. Notices of the AMS 59 (3), pp. 368–377. Cited by: §V-A.
[7] Z. Lin, A. Jain, C. Wang, G. Fanti, and V. Sekar (2020) Using gans for sharing networked time series data: challenges, initial promise, and open questions. In Proceedings of the ACM internet measurement conference, pp. 464–483. Cited by: §I, 3rd item.
[8] Y. Ma, D. Qu, and Y. Wang (2026) Dynamic community detection using class preserving time series generation with fourier markov diffusion. Scientific Reports. Cited by: §I.
[9] L. R. Medsker, L. Jain, et al. (2001) Recurrent neural networks. Design and applications 5 (64-67), pp. 2. Cited by: §IV-B1.
[10] I. Padhi, Y. Schiff, I. Melnyk, M. Rigotti, Y. Mroueh, P. Dognin, J. Ross, R. Nair, and E. Altman (2021) Tabular transformers for modeling multivariate time series. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3565–3569. Cited by: §I.
[11] W. Pang, M. Shafieinejad, L. Liu, S. Hazlewood, and X. He (2024-11) ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models. arXiv. Note: arXiv:2405.17724 External Links: Link, Document Cited by: §I, 2nd item.
[12] N. Patki, R. Wedge, and K. Veeramachaneni (2016-10) The Synthetic Data Vault. 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA) 2016, pp. 399–410 (en). External Links: Document Cited by: §I, 1st item.
[13] X. Piao, Z. Chen, L. Zhu, Y. Dong, Y. Matsubara, and Y. Sakurai (2026) TIFO: time-invariant frequency operator for stationarity-aware representation learning in time series. arXiv preprint arXiv:2602.17122. Cited by: §I.
[14] Rossmann Store Sales. External Links: Link Cited by: §VII-A2.
[15] R. Shi, Y. Wang, M. Du, X. Shen, and X. Wang (2025) A Comprehensive Survey of Synthetic Tabular Data Generation. arXiv preprint arXiv:2504.16506. Cited by: §I.
[16] R. H. Shumway and D. S. Stoffer (2006) Time series analysis and its applications: with r examples. Springer. Cited by: §I.
[17] A. V. Solatorio and O. Dupriez (2023-02) REaLTabFormer: Generating Realistic Relational and Tabular Data using Transformers. arXiv. Note: arXiv:2302.02041 External Links: Link, Document Cited by: §I.
[18] D. S. Stoffer, D. E. Tyler, and A. J. McDougall (1993) Spectral analysis for categorical time series: scaling and the spectral envelope. Biometrika 80 (3), pp. 611–622. Cited by: §I, §II-B, §IV-C.
[19] D. S. Stoffer, D. E. Tyler, and D. A. Wendt (2000) The spectral envelope and its applications. Statistical Science, pp. 224–253. Cited by: §I.
[20] T. Wilczek Walmart. External Links: Link Cited by: §VII-A2.
[21] L. Xu, M. Skoularidou, A. Cuesta-Infante, and K. Veeramachaneni (2019-10) Modeling Tabular data using Conditional GAN. arXiv:1907.00503 [cs, stat] (en). Note: arXiv: 1907.00503 External Links: Link Cited by: 2nd item, §I, §IV-C.
[22] J. Yoon, D. Jarrett, and M. Van der Schaar (2019) Time-series generative adversarial networks. Advances in neural information processing systems 32. Cited by: §I, 4th item.
[23] Z. Zhang, Q. Ouyang, Z. Yu, D. Pei, and T. Xiao Frequency decomposition and enhancement for time series generation using diffusion models. Cited by: §I.

Proof of Lemma 1

By assumption, the spectral density matrix of the one-hot encoded process, $f_{Y}(\omega)$ , is continuous. Because the categories are mutually exclusive and exhaustive, the covariance matrix $V$ has rank $K-1$ . We can apply any $K\times(K-1)\text{-dim}$ projection matrix $Q$ to obtain a full-rank covariance matrix $\overline{V}=Q^{\prime}VQ$ and a projected spectral density matrix $\overline{f}_{Y}(\omega)=Q^{\prime}f_{Y}(\omega)Q$ .

The spectral envelope $\lambda(\omega)$ is given by the largest eigenvalue of the matrix $C(\omega)=\overline{V}^{-1/2}\overline{f}_{Y}(\omega)\overline{V}^{-1/2}$ . Since $f_{Y}(\omega)$ is continuous with respect to $\omega$ , the matrix $C(\omega)$ is also continuous.

The eigenvalues of $C(\omega)$ are the roots of its characteristic polynomial, which can be defined as $P(x,\omega)=\det(xI-C(\omega))$ . The coefficients of this polynomial are continuous functions of the entries of $C(\omega)$ which are continuous functions of $\omega$ . Then, polynomial $P(x,\omega)$ is continuous function of $\omega$ .

According to standard mathematical theorem in [ross2022yet] regarding the roots of polynomials, the roots of a polynomial are continuous functions of its coefficients. Consequently, the largest root, $\lambda(\omega)$ , is a continuous function on the fundamental frequency domain $[-1/2,1/2]$ .

Proof of Lemma 2

We prove the two properties sequentially based on the definitions of the spectral envelope and the norms.

Let $\overline{f}_{Y}(\omega)$ be the projected spectral density matrix of dimension $(K-1)\times(K-1)$ , and let $\overline{V}$ be the corresponding full-rank covariance matrix. The integral of the spectral density matrix over the fundamental frequency domain yields the covariance matrix:

\int_{-1/2}^{1/2}\overline{f}_{Y}(\omega)d\omega=\overline{V}.

Recall that the spectral envelope is defined as $\lambda(\omega)=\mu_{\max}(C(\omega))$ , where $\mu_{\max}(\cdot)$ is the highest eigenvalue function and $C(\omega)=\overline{V}^{-1/2}\overline{f}_{Y}(\omega)\overline{V}^{-1/2}$ .

Upper Bound: Integrating $C(\omega)$ over the frequency domain yields:

	$\displaystyle\int_{-1/2}^{1/2}C(\omega)d\omega$	$\displaystyle=\overline{V}^{-1/2}\left(\int_{-1/2}^{1/2}\overline{f}_{Y}(\omega)d\omega\right)\overline{V}^{-1/2}$
		$\displaystyle=I_{K-1}.$

Taking the trace of both sides, we get:

	$\displaystyle\int_{-1/2}^{1/2}\text{tr}(C(\omega))d\omega$	$\displaystyle=\text{tr}\left(\int_{-1/2}^{1/2}C(\omega)d\omega\right)$
		$\displaystyle=\text{tr}(I_{K-1})=K-1.$

Since $C(\omega)$ is positive semi-definite, its maximum eigenvalue is bounded by its trace, $\mu_{\max}(C(\omega))\leq\text{tr}(C(\omega))$ for all $\omega$ . Therefore:

\|\lambda\|_{1}=\int_{-1/2}^{1/2}\lambda(\omega)d\omega\leq\int_{-1/2}^{1/2}\text{tr}(C(\omega))d\omega=K-1.

Lower Bound: Using the variational characterization of the spectral envelope, for any non-zero vector $\bar{\beta}_{0}\in\mathbb{R}^{K-1}$ , we have:

\lambda(\omega)=\sup_{\bar{\beta}\in\mathbb{R}^{K-1}}\frac{\bar{\beta}^{\prime}\overline{f}_{Y}(\omega)\bar{\beta}}{\bar{\beta}^{\prime}\overline{V}\bar{\beta}}\geq\frac{\bar{\beta}_{0}^{\prime}\overline{f}_{Y}(\omega)\bar{\beta}_{0}}{\bar{\beta}_{0}^{\prime}\overline{V}\bar{\beta}_{0}}.

Integrating this inequality over the frequency domain gives:

	$\displaystyle\\|\lambda\\|_{1}$	$\displaystyle=\int_{-1/2}^{1/2}\lambda(\omega)d\omega$
		$\displaystyle\geq\int_{-1/2}^{1/2}\frac{\bar{\beta}_{0}^{\prime}\overline{f}_{Y}(\omega)\bar{\beta}_{0}}{\bar{\beta}_{0}^{\prime}\overline{V}\bar{\beta}_{0}}d\omega$
		$\displaystyle=\frac{\bar{\beta}_{0}^{\prime}\left(\int_{-1/2}^{1/2}\overline{f}_{Y}(\omega)d\omega\right)\bar{\beta}_{0}}{\bar{\beta}_{0}^{\prime}\overline{V}\bar{\beta}_{0}}$
		$\displaystyle=\frac{\bar{\beta}_{0}^{\prime}\overline{V}\bar{\beta}_{0}}{\bar{\beta}_{0}^{\prime}\overline{V}\bar{\beta}_{0}}=1.$

Thus, $1\leq\|\lambda\|_{1}\leq K-1$ .

From Lemma 1, $\lambda(\omega)$ is a continuous function on the compact interval $[-1/2,1/2]$ . Therefore, it is bounded, which implies $\lambda\in L^{\infty}([-1/2,1/2])$ and consequently $\lambda\in L^{2}([-1/2,1/2])$ , meaning $\|\lambda\|_{2}<\infty$ .

For the lower bound, we apply Jensen’s inequality (or the Cauchy-Schwarz inequality) on the probability space defined by the interval $[-1/2,1/2]$ with length 1:

\|\lambda\|_{2}^{2}=\int_{-1/2}^{1/2}\lambda(\omega)^{2}d\omega\geq\left(\int_{-1/2}^{1/2}\lambda(\omega)d\omega\right)^{2}=\|\lambda\|_{1}^{2}.

Since we established in Part (i) that $\|\lambda\|_{1}\geq 1$ , it strictly follows that $\|\lambda\|_{2}\geq 1$ .

Proof of Lemma 3

Because $P$ is a circulant matrix, it is a normal matrix ( $PP^{\prime}=P^{\prime}P$ ). A fundamental property of circulant matrices is that they are diagonalized by the Discrete Fourier Transform (DFT) matrix. Therefore, the eigenvectors $v_{k}$ are the fixed, orthogonal Fourier basis vectors. This orthogonality allows us to project the multivariate one-hot encoded categorical process $Y_{t}$ into $K$ uncorrelated scalar processes, defined as $Z_{t}^{(k)}=v_{k}^{*}Y_{t}$ .

The conditional expectation of each projected scalar process is exactly governed by its corresponding eigenvalue: $\mathbb{E}[Z_{t+1}^{(k)}\mid Z_{t}^{(k)}]=\gamma_{k}Z_{t}^{(k)}$ . This equation defines a complex Autoregressive Process of order 1 (AR(1)). For such an AR(1) process, the temporal dependence decays geometrically. Consequently, the normalized autocorrelation function at lag $h$ is given exactly by the corresponding eigenvalue raised to the absolute lag: $R_{k}(h)=\gamma_{k}^{|h|}$ .

By the Wiener-Khinchin theorem, the spectral density $f_{k}(\omega)$ of a stationary discrete-time process is the Discrete-Time Fourier Transform (DTFT) of its autocorrelation sequence. Substituting the geometric autocorrelation $R_{k}(h)=\gamma_{k}^{|h|}$ into the Fourier sum yields an explicitly solvable infinite geometric series:

	$\displaystyle f_{k}(\omega)$	$\displaystyle=\sum_{h=-\infty}^{\infty}R_{k}(h)e^{-2\pi i\omega h}$
		$\displaystyle=\sum_{h=-\infty}^{\infty}\gamma_{k}^{\|h\|}e^{-2\pi i\omega h}=\frac{1-\|\gamma_{k}\|^{2}}{\|1-\gamma_{k}e^{-2\pi i\omega}\|^{2}}.$

Using the polar representation of the eigenvalue $\gamma_{k}=r_{k}e^{i\phi_{k}}$ , we can expand the squared norm in the denominator:

|1-r_{k}e^{i(\phi_{k}-2\pi\omega)}|^{2}=1-2r_{k}\cos(2\pi\omega-\phi_{k})+r_{k}^{2}

This yields the explicit polar form for the spectral density: $f_{k}(\omega)=\frac{1-r_{k}^{2}}{1-2r_{k}\cos(2\pi\omega-\phi_{k})+r_{k}^{2}}$ .

The spectral envelope is defined as the supremum of the normalized spectral density over all possible projection vectors $\beta$ : $\lambda(\omega)=\sup_{\beta}\frac{f_{Z}(\omega;\beta)}{\text{Var}(Z)}$ . Because the scalar modes $Z_{t}^{(k)}$ are mutually uncorrelated (their cross-covariance is zero for all lags), the total spectral density of any linear combination is simply the sum of the individual mode densities.

To maximize a weighted average of independent components at any specific frequency $\omega$ , the optimal strategy is to assign all weight to the single component with the largest value. Thus, evaluating the envelope simplifies to finding the point-wise maximum of the individual harmonic densities (excluding the trivial stationary DC component $k=0$ ):

\lambda(\omega)=\max_{k\in\{1,\dots,K-1\}}f_{k}(\omega)

Sequential RC-TGAN: Generating Relational Time Series with Spectral Envelope Loss ††thanks: This work was supported by Mitacs through the Mitacs Accelerate program.