Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enhance Spider benchmarking #154

Open
wants to merge 21 commits into
base: main
Choose a base branch
from

Conversation

minhyeong112
Copy link
Contributor

This PR enhances Spider evaluation capabilities.

Key Changes:

  • Added Spider evaluation implementation
  • Enhanced SQL schema selection and SQLite connector
  • Improved database result handling with better error reporting
  • Added query normalization and validation utilities
  • Updated environment variable names for consistency
  • Expanded database engine support

- Add evaluation utilities for Spider dataset benchmarking
- Implement SQLite connector for Spider database support
- Update schema selection and query generation prompts
- Add evaluation notebook with benchmarking results
- Update dependencies in pyproject.toml files
- Update SQL connectors in text_2_sql_core
- Enhance AutoGen agents for parallel query solving
- Update schema selection agents
- Format code with black and fix linting issues
- Update dependencies
- Resolve merge conflicts in autogen_text_2_sql.py
- Update database result handling in parallel_query_solving_agent.py
- Standardize message handling in sql_schema_selection_agent.py
- Enhance evaluation utils with query normalization and validation
- Update environment variable names in inner_autogen_text_2_sql.py
- Expand database engine support in database.py
- Apply black formatting to all modified files
Applied black code formatting to:
- llm_model_creator.py
- open_ai.py
- sql.py
- sqlite_sql.py
- sql_schema_selection_agent.py
- data_dictionary_creator.py
- sqlite_data_dictionary_creator.py
- create_spider_schema.py
Added evaluation functionality using Spider benchmark dataset to assess text-to-sql performance
@minhyeong112 minhyeong112 requested a review from a team as a code owner January 31, 2025 04:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants