OpenFace-CQUPT
/

Human_LLaVA

Visual Question Answering

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

ponytail commited on Sep 19

Commit

3d262b5

•

1 Parent(s): 8f56020

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -57,8 +57,8 @@ processor = AutoProcessor.from_pretrained(model_id,trust_remote_code=True)
 text = "Please describe this picture"
 prompt = "USER: <image>\n" + text + "\nASSISTANT:"
 image_file = "./test1.jpg"
-# raw_image = Image.open(image_file)
-raw_image = Image.open(requests.get(image_file, stream=True).raw)
 inputs = processor(images=raw_image, text=prompt, return_tensors='pt').to(cuda, torch.float16)
 output = model.generate(**inputs, max_new_tokens=400, do_sample=False)
@@ -66,7 +66,7 @@ predict = processor.decode(output[0][:], skip_special_tokens=True)
 print(predict)
 ```
-Our training code will be published publicly on github.[ddw2AIGROUP2CQUPT/Human-LLaVA-8B(github.com)]https://github.com/ddw2AIGROUP2CQUPT/Human-LLaVA-8B]
 ## Get the Dataset
 #### Dataset Example
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64259db7d3e6fdf87e4792d0/vRojQxm8IMybBV0X5CKbf.png)

 text = "Please describe this picture"
 prompt = "USER: <image>\n" + text + "\nASSISTANT:"
 image_file = "./test1.jpg"
+raw_image = Image.open(image_file)
+# raw_image = Image.open(requests.get(image_file, stream=True).raw)
 inputs = processor(images=raw_image, text=prompt, return_tensors='pt').to(cuda, torch.float16)
 output = model.generate(**inputs, max_new_tokens=400, do_sample=False)
 print(predict)
 ```
+Our training code have been released publicly on github.[ddw2AIGROUP2CQUPT/Human-LLaVA-8B(github.com)]https://github.com/ddw2AIGROUP2CQUPT/Human-LLaVA-8B]
 ## Get the Dataset
 #### Dataset Example
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64259db7d3e6fdf87e4792d0/vRojQxm8IMybBV0X5CKbf.png)