--- library_name: transformers datasets: - tner/mit_movie_trivia base_model: - distilbert/distilbert-base-uncased --- # Model Card for Model ID # DistilBERT NER - MIT Movie Trivia Dataset This model is a fine-tuned version of DistilBERT (distilbert-base-uncased) on the MIT Movie Trivia Dataset for Named Entity Recognition (NER). It is designed to tag movie-related named entities such as actors, directors, characters, genres, and more. ## Model Details Base Model: DistilBERT (distilbert-base-uncased) Dataset: MIT Movie Trivia NER Dataset Fine-tuning Task: Named Entity Recognition (NER) Framework: Hugging Face Transformers Tokenizer: distilbert-base-uncased Total Parameters: ~66M Training Time: Approx. 8 epochs Intended Use This model is intended for Named Entity Recognition (NER) on movie-related text, such as: Identifying actors, directors, characters, genres, release years, etc. Extracting structured information from movie-related questions or reviews. NER Labels (BIO Format) The model uses the BIO (Beginning-Inside-Outside) format for labeling: Tag Description B-Actor Beginning of an actor's name I-Actor Inside an actor's name B-Director Beginning of a director's name I-Director Inside a director's name B-Character_Name Beginning of a character's name I-Character_Name Inside a character's name B-Genre Beginning of a genre I-Genre Inside a genre B-Year Movie release year B-Plot Beginning of a plot description I-Plot Inside a plot description B-Quote Beginning of a movie quote I-Quote Inside a movie quote O Outside any named entity ### Model Description This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - **Developed by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Model type:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] - **Finetuned from model [optional]:** [More Information Needed] ### Model Sources [optional] - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses ### Direct Use [More Information Needed] ### Downstream Use [optional] [More Information Needed] ### Out-of-Scope Use [More Information Needed] ## Bias, Risks, and Limitations [More Information Needed] ### Recommendations Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. ## How to Get Started with the Model Use the code below to get started with the model. [More Information Needed] ## Training Details Training Details Optimizer: AdamW Learning Rate: 2e-5 Batch Size: 8 Evaluation Strategy: Epoch-based Loss Function: CrossEntropyLoss ### Training Data [More Information Needed] ### Training Procedure #### Preprocessing [optional] [More Information Needed] #### Training Hyperparameters - **Training regime:** [More Information Needed] #### Speeds, Sizes, Times [optional] [More Information Needed] ## Evaluation ### Testing Data, Factors & Metrics #### Testing Data [More Information Needed] #### Factors [More Information Needed] #### Metrics [More Information Needed] ### Results Which movies did Quentin Tarantino make?? Ans: Entity: quentin tarantino, Label: Director, Score: 1.00 Leonardo DiCaprio starred in which movies? Ans: Entity: leonardo dicaprio, Label: Actor, Score: 1.00 #### Summary ## Model Examination [optional] [More Information Needed] ## Environmental Impact Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). - **Hardware Type:** [More Information Needed] - **Hours used:** [More Information Needed] - **Cloud Provider:** [More Information Needed] - **Compute Region:** [More Information Needed] - **Carbon Emitted:** [More Information Needed] ## Technical Specifications [optional] ### Model Architecture and Objective [More Information Needed] ### Compute Infrastructure [More Information Needed] #### Hardware [More Information Needed] #### Software [More Information Needed] ## Citation [optional] **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional] [More Information Needed] ## More Information [optional] [More Information Needed] ## Model Card Authors [optional] [More Information Needed] ## Model Card Contact [More Information Needed]