ProvSQL：一个用于追踪数据来源与概率的通用系统 (ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data) - 专知论文

会员服务 ·

0

系统 · 概率 · PostgreSQL · 数据库 · 数据库系统 ·

ProvSQL: A General System for Keeping Track of the Provenance and Probability of Data

翻译：ProvSQL：一个用于追踪数据来源与概率的通用系统

Aryak Sen,Silviu Maniu,Pierre Senellart

We present the data model, design choices, and performance of ProvSQL, a general and easy-to-deploy provenance tracking and probabilistic database system implemented as a PostgreSQL extension. ProvSQL's data and query models closely reflect that of a large core of SQL, including multiset semantics, the full relational algebra, and aggregation. A key part of its implementation relies on generic provenance circuits stored in memory-mapped files. We propose benchmarks to measure the overhead of provenance and probabilistic evaluation and demonstrate its scalability and competitiveness with respect to other state-of-the-art systems.

翻译：本文介绍了ProvSQL的数据模型、设计选择与性能表现，该系统作为一个PostgreSQL扩展实现，是一个通用且易于部署的来源追踪与概率数据库系统。ProvSQL的数据与查询模型紧密反映了SQL核心的大部分特性，包括多重集语义、完整的关系代数以及聚合操作。其实现的关键部分依赖于存储在内存映射文件中的通用来源电路。我们提出了用于衡量来源追踪与概率评估开销的基准测试，并展示了该系统相较于其他前沿系统在可扩展性与竞争力方面的表现。

0

相关内容

DeepSeek模型综述：V1 V2 V3 R1-Zero

DeepSeek模型综述：V1 V2 V3 R1-Zero

专知会员服务

116+阅读 · 2月11日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

NLG任务评价指标BLEU与ROUGE

NLG任务评价指标BLEU与ROUGE

AINLP

21+阅读 · 2020年5月25日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

PointNet系列论文解读

PointNet系列论文解读

人工智能前沿讲习班

17+阅读 · 2019年5月3日

误差反向传播——CNN

误差反向传播——CNN

统计学习与视觉计算组

30+阅读 · 2018年7月12日

SSD: Single Shot MultiBox Detector 深度学习笔记之SSD物体检测模型

SSD: Single Shot MultiBox Detector 深度学习笔记之SSD物体检测模型

AI研习社

18+阅读 · 2017年8月31日

“模块化自组装”DNA计算模型的研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

DyFuLM: An Advanced Multimodal Framework for Sentiment Analysis

Arxiv

0+阅读 · 12月1日

LLaMCAT: Optimizing Large Language Model Inference with Cache Arbitration and Throttling

Arxiv

0+阅读 · 11月26日

ViMoNet: A Multimodal Vision-Language Framework for Human Behavior Understanding from Motion and Video

Arxiv

0+阅读 · 11月16日

LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure

Arxiv

0+阅读 · 11月10日

MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving

Arxiv

0+阅读 · 11月4日

VIP会员

文章信息

相关主题

数据库系统

相关VIP内容

DeepSeek模型综述：V1 V2 V3 R1-Zero

DeepSeek模型综述：V1 V2 V3 R1-Zero

专知会员服务

116+阅读 · 2月11日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

复杂的序列数据分析：现有算法的系统文献综述，Complex Sequential Data Analysis: A Systematic Literature Review of Existing Algorithms

专知会员服务

27+阅读 · 2020年7月24日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

NLG任务评价指标BLEU与ROUGE

NLG任务评价指标BLEU与ROUGE

AINLP

21+阅读 · 2020年5月25日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

PointNet系列论文解读

PointNet系列论文解读

人工智能前沿讲习班

17+阅读 · 2019年5月3日

误差反向传播——CNN

误差反向传播——CNN

统计学习与视觉计算组

30+阅读 · 2018年7月12日

SSD: Single Shot MultiBox Detector 深度学习笔记之SSD物体检测模型

SSD: Single Shot MultiBox Detector 深度学习笔记之SSD物体检测模型

AI研习社

18+阅读 · 2017年8月31日

相关论文

DyFuLM: An Advanced Multimodal Framework for Sentiment Analysis

Arxiv

0+阅读 · 12月1日

LLaMCAT: Optimizing Large Language Model Inference with Cache Arbitration and Throttling

Arxiv

0+阅读 · 11月26日

ViMoNet: A Multimodal Vision-Language Framework for Human Behavior Understanding from Motion and Video

Arxiv

0+阅读 · 11月16日

LLMServingSim2.0: A Unified Simulator for Heterogeneous Hardware and Serving Techniques in LLM Infrastructure

Arxiv

0+阅读 · 11月10日

MetAdv: A Unified and Interactive Adversarial Testing Platform for Autonomous Driving

Arxiv

0+阅读 · 11月4日

相关基金

“模块化自组装”DNA计算模型的研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员