enhance Spider benchmarking #154

minhyeong112 · 2025-01-31T04:21:02Z

This PR enhances Spider evaluation capabilities.

Key Changes:

Added Spider evaluation implementation
Enhanced SQL schema selection and SQLite connector
Improved database result handling with better error reporting
Added query normalization and validation utilities
Updated environment variable names for consistency
Expanded database engine support

- Add evaluation utilities for Spider dataset benchmarking - Implement SQLite connector for Spider database support - Update schema selection and query generation prompts - Add evaluation notebook with benchmarking results - Update dependencies in pyproject.toml files

…valuation

- Update SQL connectors in text_2_sql_core - Enhance AutoGen agents for parallel query solving - Update schema selection agents - Format code with black and fix linting issues - Update dependencies

- Resolve merge conflicts in autogen_text_2_sql.py - Update database result handling in parallel_query_solving_agent.py - Standardize message handling in sql_schema_selection_agent.py - Enhance evaluation utils with query normalization and validation - Update environment variable names in inner_autogen_text_2_sql.py - Expand database engine support in database.py - Apply black formatting to all modified files

Applied black code formatting to: - llm_model_creator.py - open_ai.py - sql.py - sqlite_sql.py - sql_schema_selection_agent.py - data_dictionary_creator.py - sqlite_data_dictionary_creator.py - create_spider_schema.py

Added evaluation functionality using Spider benchmark dataset to assess text-to-sql performance

…ewrite_agent.yaml content

minhyeong112 added 18 commits January 8, 2025 04:28

feat: Improved SQL schema selection and SQLite connector for Spider e…

cbc1435

…valuation

style: Fix trailing whitespace issues

65a7a90

style: Fix JSON formatting in Jupyter notebook

40c0f60

style: Apply black formatting to Python files

623b601

style: Apply Ruff fixes

13d6129

docs: Update Spider dataset and test suite download instructions

d628225

style: Fix JSON formatting in notebook

36fdd69

refactor: improve SQL connectors and agents for spider evaluation

7cd4aab

- Update SQL connectors in text_2_sql_core - Enhance AutoGen agents for parallel query solving - Update schema selection agents - Format code with black and fix linting issues - Update dependencies

feat: Add spider evaluation changes and schema improvements

ec2dd2b

chore: resolve merge conflicts with upstream/main

24ab162

style: fix trailing whitespace and formatting issues

33f016a

style: apply black formatting to remaining files

62201f6

Applied black code formatting to: - llm_model_creator.py - open_ai.py - sql.py - sqlite_sql.py - sql_schema_selection_agent.py - data_dictionary_creator.py - sqlite_data_dictionary_creator.py - create_spider_schema.py

style: apply ruff fixes to improve code quality

787175f

fix: resolve merge conflicts in uv.lock

4e6a379

chore: update dependencies in uv.lock

69a5b2e

feat: Add Spider evaluation for text-to-sql solution

e107868

Added evaluation functionality using Spider benchmark dataset to assess text-to-sql performance

minhyeong112 requested a review from a team as a code owner January 31, 2025 04:21

BenConstable9 and others added 3 commits January 31, 2025 15:30

Merge branch 'main' into spider-eval

e902e14

Add .env.example files, update .gitignore, and restore user_message_r…

47fbd83

…ewrite_agent.yaml content

Implement shared schema cache using Azure Cognitive Search

1e2f381

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enhance Spider benchmarking #154

enhance Spider benchmarking #154

minhyeong112 commented Jan 31, 2025

enhance Spider benchmarking #154

Are you sure you want to change the base?

enhance Spider benchmarking #154

Conversation

minhyeong112 commented Jan 31, 2025