Population Aware Diffusion for Time Series Generation
Abstract
Diffusion models have shown promising ability in generating high-quality time series (TS) data. Despite the initial success, existing works mostly focus on the authenticity of data at the individual level, but pay less attention to preserving the population-level properties on the entire dataset. Such population-level properties include value distributions for each dimension and distributions of certain functional dependencies (e.g., cross-correlation, CC) between different dimensions. For instance, when generating house energy consumption TS data, the value distributions of the outside temperature and the kitchen temperature should be preserved, as well as the distribution of CC between them. Preserving such TS population-level properties is critical in maintaining the statistical insights of the datasets, mitigating model bias, and augmenting downstream tasks like TS prediction. Yet, it is often overlooked by existing models. Hence, data generated by existing models often bear distribution shifts from the original data. We propose Population-aware Diffusion for Time Series (PaD-TS), a new TS generation model that better preserves the population-level properties. The key novelties of PaD-TS include 1) a new training method explicitly incorporating TS population-level property preservation, and 2) a new dual-channel encoder model architecture that better captures the TS data structure. Empirical results in major benchmark datasets show that PaD-TS can improve the average CC distribution shift score between real and synthetic data by 5.9x while maintaining a performance comparable to state-of-the-art models on individual-level authenticity.
Community
AAAI-25
Github: https://github.com/wmd3i/PaD-TS
We are excited to share our new TS generation model that achieves state-of-the-art performance in population-level property preservation and comparable individual-level authenticity.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- FM-TS: Flow Matching for Time Series Generation (2024)
- UTSD: Unified Time Series Diffusion Model (2024)
- Clustering Time Series Data with Gaussian Mixture Embeddings in a Graph Autoencoder Framework (2024)
- LoyalDiffusion: A Diffusion Model Guarding Against Data Replication (2024)
- On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models (2024)
- Understanding and Mitigating Memorization in Diffusion Models for Tabular Data (2024)
- Learning Latent Spaces for Domain Generalization in Time Series Forecasting (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper