Tigran555 commited on
Commit
bae4479
·
verified ·
1 Parent(s): 51ac59d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -1
README.md CHANGED
@@ -13,4 +13,44 @@ library_name: transformers
13
  tags:
14
  - sql
15
  - text2sql
16
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  tags:
14
  - sql
15
  - text2sql
16
+ ---
17
+
18
+ Model Card: HuggingFace Text2SQL Model (Fine-tuned BART)
19
+
20
+ ### Overview
21
+ - **Model Name**: HuggingFace Text2SQL Model (Fine-tuned BART)
22
+ - **Model Version**: 1.0
23
+ - **Model Objective**: Text-to-SQL Translation
24
+ - **Accuracy on Train Dataset (Text-to-SQL)**: 50.4%
25
+ - **Accuracy on Dev Dataset (Text-to-SQL)**: 49.7%
26
+ - **SQL Accuracy on Dev Dataset**: 51.1%
27
+
28
+ ### Model Description
29
+ The HuggingFace Text2SQL model is based on the BART architecture, fine-tuned for the specific task of translating natural language questions into SQL queries. BART (Bidirectional and Auto-Regressive Transformers) is a transformer-based model originally developed by Facebook AI for various natural language processing tasks, including text generation and summarization. This model was fine-tuned on the SPIDER dataset, which provides pairs of natural language questions and their corresponding SQL queries. Additionally, the model leverages databases for executing the generated SQL queries, enabling it to verify the correctness of the generated queries.
30
+
31
+ ### Intended Use
32
+ This model is intended for applications where there's a need to convert natural language queries into SQL for querying databases. It can be applied in various domains where structured data is accessible via SQL interfaces.
33
+
34
+ #### Limitations
35
+ - **Performance**: While the model demonstrates moderate accuracy in translating natural language questions to SQL queries, the exact SQL accuracy on the development dataset should be provided for a comprehensive understanding of its performance.
36
+ - **Generalization**: The model's ability to generalize to out-of-domain or noisy data may vary. Users should evaluate its performance on their specific datasets and tasks.
37
+ - **SQL Accuracy Metric**: Detailed information about the SQL accuracy metric used for evaluation should be provided to understand the model's performance in generating accurate SQL queries.
38
+
39
+ ### Ethical Considerations
40
+ - **Bias**: Bias in the model's translations, particularly concerning sensitive attributes such as race, gender, or ethnicity, should be carefully monitored and mitigated.
41
+ - **Privacy**: Care should be taken to protect user privacy and data security when translating natural language queries into SQL, especially when sensitive information may be present in the queries or the underlying database.
42
+ - **Fairness**: Fairness considerations should be taken into account to ensure equitable performance across different demographic groups.
43
+
44
+ ### Dataset
45
+ The model was fine-tuned on the SPIDER dataset, which consists of natural language questions paired with their corresponding SQL queries. The dataset was split into training and development sets for fine-tuning and evaluation.
46
+
47
+ ### Evaluation Metrics
48
+ The model's performance was primarily evaluated based on accuracy in translating natural language questions to SQL queries. Additionally, SQL accuracy on the development dataset was considered for a more comprehensive assessment of the model's performance.
49
+
50
+ ### Citation
51
+ If you use or reference this model in your work, please cite the following:
52
+
53
+ [Reference to the original BART model, SPIDER dataset, and any additional sources relevant to the fine-tuning process]
54
+
55
+ ### Disclaimer
56
+ This model is provided as-is, without any warranties or guarantees of performance or suitability for any particular task or application. Users are encouraged to thoroughly evaluate the model's performance and suitability for their specific use case before deployment.