Service Unavailable

#158

by Daswahre - opened 9 days ago

9 days ago

Hi just wanted to use this with transformers but now as Response I always get 503 and here on the test it is said that the service is unavailable. just wanted to ask if this is a error from my side or if anything happened on your site.

angeldgm

8 days ago

•

edited 7 days ago

Hello. If you get 503 error, that means that service is unavailable right now (from 500 to 599 codes are server errors). It's not an error from your side. I am also getting that error.
Does anyone know how much time does it usually take for a model like this one to go back to the warm state? I hope that this gets solved soon. Thank you.
(https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.3 can also be used, and is working well at the time of writing this)

nbroad

7 days ago

To look at warm models, you can use this query: https://huggingface.co/models?inference=warm&pipeline_tag=text-generation

If you require a model on-demand, you can consider using Inference Endpoints. There is a sleep feature to disable it after 15 minutes of no calls to reduce costs, too.

(I work at HF)

charudatta-17

7 days ago

•

edited 7 days ago

I have been using the mistralai/Mistral-7B-Instruct-v0.2 model for a case study, and it was functioning perfectly until recently. However, I am now encountering an issue where I am unable to access the model. The error message I receive is as follows:

Model not loaded on the server: https://api-inference.huggingface.co/models/mistralai/Mistral-7B-Instruct-v0.2. Please retry with a higher timeout (current: 120).

HuggingFace Team, Could you please provide any suggestions or alternatives so that I can access this model again?

nbroad

6 days ago

@charudatta-17 ,

You have a few options:

Use the v3 model on Inference API
Use Inference Endpoints to host a dedicated endpoint just for you.
Download the model and run it on your own hardware
Go to a service like together.ai to pay per token on their api

pradipwarkhade

about 6 hours ago

@charudatta-17 ,

You have a few options:

Use the v3 model on Inference API

Use Inference Endpoints to host a dedicated endpoint just for you.

Download the model and run it on your own hardware

Go to a service like together.ai to pay per token on their api

Bro but which is file is to download for the model here are so many files i am confused

Nurb4000

about 6 hours ago

@charudatta-17 ,

You have a few options:

Use the v3 model on Inference API

Use Inference Endpoints to host a dedicated endpoint just for you.

Download the model and run it on your own hardware

Go to a service like together.ai to pay per token on their api

I never have understood why people are relying on free services. Download it, do it yourself. Then you are 100% in control. It also gets rid of issues during updates you are not ready for. Or if you cant do it local, pay someone else to do it for you ....

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment