Core Concepts

QILaplace.jl maps length-$N$ signals to tensor networks so you can run Fourier- and Laplace-family transforms without materializing dense arrays. This page provides the minimum intuition and practical choices. For step-by-step usage, start with the tutorials.

Why it matters

Work with $N=2^n$ signals without $O(N)$ memory.
Control accuracy with explicit truncation (cutoff, maxdim).
Extract coefficients directly from compressed states.

Matrix Product States (MPS) and Operators (MPO)

At its core, QILaplace.jl does not treat a signal as a long, flat array of numbers. Instead, it reshapes that data into a high-dimensional tensor and compresses it into an MPS.

What are MPS and MPO?

A Matrix Product State (MPS) is a mathematical factorization that breaks down a massive tensor into a chain of smaller, interconnected local tensors. While originally popularized in quantum physics to represent many-body wavefunctions, in signal processing, an MPS serves as a highly efficient compressed data structure.

Efficiency: A signal of size $N = 2^n$ usually requires $O(N)$ memory. An MPS can represent the same signal using only $O(n \cdot \chi^2)$ parameters, where $\chi$ is the bond dimension.
Correlation: The bond dimension $\chi$ represents the "information complexity" or the amount of correlation between different segments of the signal. For many physical and mathematical signals, $\chi$ remains small even as the signal size $N$ grows exponentially.

A Matrix Product Operator (MPO) is the operator equivalent of an MPS. It allows us to apply linear transformations—such as the Fourier or Laplace transform—directly to the compressed MPS without ever needing to decompress the data back into a flat array.

Compressing Data: The Quantics Representation

The bridge between a standard 1D signal and a compressed MPS is a process called Binary Encoding (often referred to in literature as the Quantics representation).

Binary-encoding of Signal

To process a signal of length $N = 2^n$, we treat the index of each data point as a binary string of length $n$. We then reshape the vector into an $n$-dimensional tensor of shape $(2, 2, \dots, 2)$.

In QILaplace.jl, we follow a big-endian encoding convention. For a signal vector $x$, the index $j$ is represented as a bit-string $j = (b_1, b_2, \dots, b_n)$.

This mapping is such that:

Slow index $b_1$ is assigned to the first tensor, representing the "coarse" global structure of the signal.
Fast index $b_n$ is assigned to the last tensor, capturing the "fine" high-frequency details.

This big-endian layout ensures that long-range correlations in the signal map directly to the physical structure of the tensor chain. In the animation below, you can see an array of 8 entries ($x_0$ to $x_7$) being reshaped into a 3D tensor. By following the $j = (b_1, b_2, b_3)$ indexing, every element in this high-dimensional space corresponds to a unique point in the original signal.

MPS Conversion Algorithms

Transforming the "tensorized" signal into a compressed MPS is the most computationally intensive phase of the entire pipeline. This stage serves as the primary bottleneck because it requires decomposing the exponential information of the full signal into a chain of local correlations. Regardless of the specific algorithm used, the conversion process relies on the decay of Singular Values. As we decompose the signal, we encounter a spectrum of singular values at each bond connecting the tensor sites.

Compression: We "truncate" the representation by keeping only the most significant singular values.
The Threshold ($\tau$): Users can define a relative cutoff $\tau$. Any singular value smaller than this threshold is discarded.
Accuracy-Compression Trade-off: A smaller $\tau$ leads to higher fidelity but larger bond dimensions ($\chi$), while a larger $\tau$ achieves massive compression at the cost of some numerical precision.

To perform this decomposition, QILaplace.jl provides two primary algorithms, each meticulously optimized for different hardware constraints and signal complexities.

Quick Decision Rules for MPS Compression

Use method=:rsvd for large $n$ or exploratory runs. Switch to :svd when you need exact singular values at smaller $n$.
Start with cutoff=1e-10 to 1e-12 for high accuracy. For faster, smaller MPS, you can relax to 1e-6 to 1e-8.
Set maxdim to limit worst-case bond dimension growth. If you hit the cap and errors are too large, raise maxdim or lower cutoff.
For small $n$, sanity-check by comparing a few coefficient samples against dense references and monitoring maxbond growth.

1. The Standard SVD (Sequential Sweep)

The standard approach involves a sequential sweep from one end of the tensor chain to the other.

Mechanism: At each site, the algorithm performs a Singular Value Decomposition (SVD) to split the current tensor into a local MPS site and a remainder that is passed to the next site.
Cost: The bottleneck occurs at the central bond of the chain (where the matricized tensor is largest). For an $n$-qubit system, the complexity at the center is $O(2^{3n/2})$, as it requires computing the full singular value spectrum of a $2^{n/2} \times 2^{n/2}$ matrix in the worst case.
Best for: Small to medium $n$ where the exact spectrum is needed for high-fidelity representation.

2. Randomized SVD (Divide and Conquer)

The Randomized SVD (RSVD) algorithm uses a "divide-and-conquer" strategy to significantly speed up the conversion.

Mechanism: Instead of sweeping linearly, RSVD divides the tensor at the middle into a "left" and "right" block. It then "conquers" by iteratively splitting these blocks into single-site tensors.
Approximation: Unlike standard SVD, RSVD finds an approximation of the top-$k$ singular values using random projections. This avoids the need to find the full spectrum.
Cost: By targeting only the relevant $k$ singular values (where $k \approx \chi$), the complexity at the central bond is reduced to $O(k \cdot 2^{n})$ in the worst case, offering a massive speedup for large signals.
Best for: Large-scale signals (e.g., $n > 20$) where the signal is known to have a low rank ($k \ll N$).

This dual-algorithm approach allows QILaplace.jl to balance the trade-off between absolute numerical precision and the ability to process exponentially large data sets on standard hardware.

Quantum-Inspired Signal Processing

The "Quantum-Inspired" nature of this library is defined by how we construct our operators. In traditional quantum computing, a circuit is a sequence of unitary gates acting on qubits. In QILaplace.jl, we reinterpret these circuits as Matrix Product Operators (MPOs) that perform linear transformations directly on an MPS.

From Circuits to Compressed Operators

By "zipping" circuit gates into an MPO, we can compress an entire transformation into a compact form with low bond dimension.

Unitary Transforms ($Q\hat{F}T$): The Quantum Fourier Transform is the foundation of this approach. As introduced by Chen, Stoudenmire, and White (2023), the QFT circuit can be highly compressed into an MPO where the bond dimension does not increase with the number of qubits $n$. This allows spectral analysis on exponentially large data in logarithmic time.
Non-Unitary Transforms ($\hat{DT}$): QILaplace.jl extends this by incorporating non-unitary maps. The Discrete Laplace Transform requires exponential damping, which we implement via a Damping Transform ($\hat{DT}$). This circuit uses non-unitary damping gates that are likewise compressed into an efficient MPO.

By operating on classical hardware, we gain a unique "digital advantage." We can merge the $\hat{DT}$ and $Q\hat{F}T$ into a single, combined $z\hat{T}$ MPO. This unified operator allows us to probe the complex $z$-plane and identify the poles and zeros of a signal at scales reaching $M = 2^{60}$ points, far exceeding the limits of traditional FFT-based methods.

Check out the Tutorials to get hands-on with constructing these MPS and MPOs, and refer to the Benchmarking page to see the actual performance results verified on a MacBook M2 Pro and reproducible on your own hardware.

Why Quantum-Inspired?

We borrow the "algorithmic structure" of quantum gates but implement them as compressed classical tensors. This allows us to execute non-unitary maps that are often difficult for real quantum computers to handle, while maintaining the exponential scaling benefits of quantum algorithms.

Quantum Fourier Transform Circuit

The Quantum Fourier Transform (QFT) is one of the many foundational quantum algorithms that demonstrate an exponential speedup. While the classical FFT requires $O(N \log N)$ operations for a signal of size $N$, the QFT algorithm theoretically operates in $O(\log^2 N)$ time. In our context, we don't execute gates on a physical device; instead, we represent the entire circuit structure as a single, static MPO.

To turn the QFT circuit into an efficient MPO, we use the Zip-Up algorithm followed by a sweep of the Orthogonality Center (OC) with truncation, as introduced by Chen, Stoudenmire, and White (2023). Instead of multiplying all gates together (which would lead to an exponentially large matrix), the algorithm "zips" the gates into a chain of local tensors and moves the OC cleverly while truncating so that the changes are global.

The process follows a specific sequence as seen in the animation below:

Gate Combination: Tensors representing individual gates are combined sequentially across the qubits. These gates lie in the same qubit line.

SVD with Truncation: At each step, a Singular Value Decomposition (SVD) is performed on the bonds. This identifies the "entanglement" the operator introduces between sites and zips up two adjacent tensor trains. Since at every point in the algorithm, the SVD is performed at the OC, any truncations performed here will not affect the global properties of the operator upto the defined tolerance.

Orthogonality Center Sweep: We move down the chain to set the OC to the last site and repeat the process of SVD with truncation on the next controlled-phase gate.

Theoretical Guarantee: The efficiency of this compression relies on the fact that the singular values $\sigma_k$ for the QFT and Laplace operators decay exponentially with distance. Specifically, the singular values at a bond often follow the relationship:

\[\sigma_k \sim e^{-\alpha k}\]

where $k$ is the index of the singular value and $\alpha > 0$ is a decay constant. This ensure that our QFT circuit is represented in an MPO that does not scale with system size. Without this guarantee, we could attempt to compress any arbitrary quantum circuit, but the bond dimensions would grow uncontrollably with qubit size and circuit depth.

Damping Transform Circuit

While the QFT handles the oscillatory (phase) components of a signal, the discrete Laplace transform also requires exponential damping. To capture this, QILaplace.jl introduces the Damping Transform ($\hat{DT}$).

Unlike the QFT algorithm, the Damping Transform is a non-unitary circuit. This fundamental difference changes how we approach the compression process. For a unitary circuit, the intermediate zip-up steps can naturally maintain a canonical form, allowing for safe truncation. However, because the damping gates are non-unitary, zipping them together does not inherently construct a canonical MPO at each step. If we attempted to truncate the bonds at these intermediate, non-canonical steps, we would introduce uncontrolled global errors and the resulting bond dimension would not correspond to a properly minimized state.

Furthermore, because the real exponent in the Laplace transform lacks the periodicity of the Fourier transform, the $\hat{DT}$ requires a specialized paired-register layout (occupying $2n$ qubits). By encoding the signal on two registers ($|j\rangle|j'\rangle$, where $j=j'$), the second register serves as a static copy that provides the necessary controls for all bits $m$ ($m<l$ and $m>l$) without the exponential bond dimension growth associated with long-range correlations in a single register.

To combine non-unitary controlled damping gates (and the local damping $H_d$ operations) into a well-behaved MPO, we rely on a specialized two-pass algorithm:

QR Zipping (Forward Pass): As we combine the gates, we perform sequential QR decompositions to move the Orthogonality Center (OC) from one end of the chain to the other. Crucially, we do not perform any truncation at this stage. The QR sweep ensures the MPO is placed into a strict canonical form.

SVD Truncation (Backward Pass): Once the correct canonical form is established, we perform an SVD sweep to move the OC back to the initial site. Because the MPO is now canonical, we can safely apply truncations based on the singular values.

In the animation below, you can watch the sequential QR and SVD sweeps compress the damping circuit into a canonical MPO. The non-unitary gates are marked as hatched squares. As you watch, keep track of the following elements:

Site $T_i$: Represents the Orthogonality Center (OC).
Matrix $Q_i$: The matrices generated during the QR decomposition sweep.
Isometries $U_i$ and $V_i$: The left and right isometries generated during the SVD truncation sweep. The left isometries are colored yellow, while the right isometries are colored purple.
Red Bonds: Untruncated, intermediate bonds that grow large during the QR pass.
Blue Bonds: Truncated bonds that are compressed during the SVD pass.

Theoretical Guarantee

The efficiency of this compression relies on the fact that the singular values $\sigma_k$ for these non-unitary operators also decay exponentially with distance, similar to the QFT. For more details, refer to the original paper Noufal Jaseem et al. (2026). As a result, the final compressed MPO is guaranteed to have a bond dimension that saturates with system size for a given error threshold, leading to highly efficient and well-behaved memory scaling.

The Discrete Laplace Transform

The discrete Laplace transform (or $z$-transform) is achieved by the sequential application of the damping and phase transformations. In the quantum-inspired framework of QILaplace.jl, this is formulated as:

\[z\hat{T} \equiv Q\hat{F}T \circ \hat{DT}\]

The transformation is executed on the paired-register layout $|j\rangle |j'\rangle$:

The Damping Stage ($\hat{DT}$): The Damping Transform implements the real-valued exponential damping of the signal ($e^{-(\omega_r k / N) j}$), which corresponds to the radial factors ($r_k^{-j}$) in the transform. This is achieved by applying the local damping operators $H_d$ and controlled-damping gates to the main register ($|j\rangle$). During this phase, the copy register ($|j'\rangle$) serves as a control, facilitating the necessary interactions for the non-periodic damping gates without increasing bond complexity.
The Phase Stage ($Q\hat{F}T$): The Quantum Fourier Transform is then applied to the copy register ($|j'\rangle$). This stage implements the oscillatory components ($e^{-i(\omega_i l / N) j'}$) of the transform, using standard unitary Hadamard ($H$) and controlled-phase ($P_{lm}$) gates.

By composing these two MPOs and performing a final truncation sweep, we produce a single, highly compressed $z\hat{T}$ MPO. This operator allows for the direct evaluation of a signal on a dense polar grid in the $z$-plane, enabling efficient pole identification and system analysis for signals of unprecedented scale.