基于图转换器的时序预测

基于图转换器的时序预测
Time Series Forecasting with Graph Transformers

原始链接: https://kumo.ai/research/time-series-forecasting/

这篇博文探讨了图结构数据的时序预测，利用现实世界数据集中（通常存储在关系数据库中）数据之间相互关联的特性。提出的端到端流程利用图Transformer来预测图中特定节点的时序，从整个图结构中提取信号，而这些信号可以使用关系深度学习 (RDL) 从关系数据库中自动提取。核心框架根据过去的时序数据、日期时间编码、日历特征和图实体编码进行预测，后者通过图Transformer或图神经网络获得。它强调了捕捉时间和关系依赖性的重要性。博文还比较了传统的基于回归的预测方法和使用条件扩散模型的生成方法。虽然回归的目标是点预测，并且存在均值坍塌的问题，但生成模型可以生成未来值的分布，从而实现概率推理，捕捉高频细节，并在无需对数据分布进行先验假设的情况下量化不确定性。博文展示了与基线方法相比有前景的初步结果。

Hacker News上的一篇帖子讨论了Kumo.ai的博文“使用图神经网络进行时间序列预测”。 cye131批评了这种方法，认为图神经网络不适合时间序列预测，并且这家初创公司在推广其销售的技术时存在利益冲突。rusty1s，这篇论文的作者之一，澄清说该模型关注的是通过将历史数据与以图结构组织的外部信息相结合来丰富预测，而不是仅仅依赖于图神经网络。他们强调了模块化架构和学术同行评审。其他用户讨论了Facebook Prophet作为基准的实用性，一些人称赞它处理季节性的能力，另一些人则推荐其他方法，如L1正则化自回归特征。也有人批评该网站具有侵入性的滚动行为。最后，一个用户质疑这项技术在股市上赚取数十亿美元的潜力，而另一个用户认为其价值在于简化ETL、特征工程和模型设计流程，潜在地减少了企业的劳动力需求。

原文

Time series forecasting is a cornerstone in modern business analytics, whether it is concerned with anticipating market trends, user behavior, optimizing resource allocation, or planning for future growth. As such, a wide range of different approaches have been introduced and investigated for forecasting, lately data-driven approaches using machine learning and generative models.

This blog post will dive into forecasting on graph structured entities, e.g., as obtained from a relational database, utilizing not only the individual time series as signal but also related information. As most of the world’s data is stored in relational structures, this topic is of particular interest for real world applications. We describe an end-to-end pipeline to perform forecasting using graph transformers and specifically discuss predictive vs. generative paradigms.

Forecasting on Graph-structured Data

Forecasting is the process of making predictions about future events based on historical data and current observations, requiring detecting patterns, trends, and seasonal variations.

Traditional forecasting methods often treat time series data in isolation, focusing solely on temporal patterns within a single sequence. However, in real-world applications, valuable predictive signals often exist in related data sources. For instance, when forecasting product sales, factors such as marketing campaigns, competitor pricing, or regional economic indicators can significantly impact the accuracy of predictions. Graphs are a natural structure to represent such inter-connected data sources. They represent a set of inter-connected nodes of different entities, where some entities can have time series that can be forecasted. Each node can potentially hold a variety of features that hold important signal for forecasting tasks on other nodes. Further, they lend themselves to a wide arrange of machine learning methods, e.g., Graph Transformers.

A prominent option for obtaining graphs directly from an underlying business problem on a relational database is Relational Deep Learning (RDL), which automatically discovers and utilizes cross-table relationships and data in connected tables. The RDL scheme allows to automatically extract a graph structure from the relational database, allowing us to treat timeseries forecasting as a graph learning task. We will use the graph obtained via RDL as an example below. However, our graph forecasting techniques are not limited to graphs obtained via RDL but can be applied on arbitrary forecasting tasks where time series have to be forecasted for a subset of graph nodes.

Example. Consider the task of forecasting the sales per day for all products stored in a product table (yellow). Further tables containing transactions (blue), customers (green), product marketing (red) can provide additional signals that help solving the task. Using the RDL scheme, we can automatically transform the relational tables into a graph with node features. Then, the task is to perform forecasting on the subset of product nodes via graph machine learning.

Notation. We denote the input to our graph forecasting task as a graph