BEAVER: An Enterprise Benchmark for Text-to-SQL

Leaderboard

How to Submit:

Please send an email to peterbc@mit.edu, along with your method name, a brief description of the method, and, optionally, a link to your paper or codebase. We will follow up with detailed instructions.

Method

Model

Rank	Submission Date	Method	Model	Execution Accuracy

Subtask Metrics Across Settings

Citation

If you find our data, code, or the paper helpful, please cite the paper:

@article{chen2024beaver,
  title={BEAVER: an enterprise benchmark for text-to-sql},
  author={Chen, Peter Baile and Yang, Devin and Li, Weiyue and Wenz, Fabian and Zhang, Yi and Tatbul, Nesime and Cafarella, Michael and Demiralp, {\c{C}}a{\u{g}}atay and Stonebraker, Michael},
  journal={arXiv preprint arXiv:2409.02038},
  year={2024}
}

BEAVER is a large-scale enterprise text-to-SQL dataset containing 9128 queries spanning 812 tables across 19 diverse domains. Of these, 7978 queries are publicly released, while the remaining portion is held out as a private test set. Queries and databases were collected from private organizations.
To facilitate fine-grained evaluation and analysis, we provide

annotations for five subtasks: multi-table retrieval, join key detection, column mapping, domain knowledge extraction, and query decomposition
three categories of queries: complex queries without domain knowledge, domain-specific queries with minimal complexity, and domain-specific complex queries

Example data

Representative BEAVER tasks with question, SQL, and subtask annotations.

db source

Question

SQL

Multi-Table retrieval

Tables used in SQL

Join Keys

Connections among used tables

Column Mapping

Mapping from question phrases to table columns(s)

Domain knowledge

Domain-specific predictates used in SQL

Subquery Decomposition

Decomposition of SQL into simpler sub-queries

Citation

If you find our data, code, or the paper helpful, please cite the paper:

article{chen2024beaver,
  title={BEAVER: an enterprise benchmark for text-to-sql},
  author={Chen, Peter Baile and Yang, Devin and Li, Weiyue and Wenz, Fabian and Zhang, Yi and Tatbul, Nesime and Cafarella, Michael and Demiralp, {\c{C}}a{\u{g}}atay and Stonebraker, Michael},
  journal={arXiv preprint arXiv:2409.02038},
  year={2024}
}

🦫 BEAVER: An Enterprise Benchmark for Text-to-SQL

Leaderboard

How to Submit:

Subtask Metrics Across Settings

Citation

Example data

Question

SQL

Multi-Table retrieval

Tables used in SQL

Join Keys

Connections among used tables

Column Mapping

Mapping from question phrases to table columns(s)

Domain knowledge

Domain-specific predictates used in SQL

Subquery Decomposition

Decomposition of SQL into simpler sub-queries

Citation