add example of how to upload image for GPT4 vision in README.md #249

jkyngan · 2023-11-10T21:07:02Z

First ,thanks for this amazing project.
As GPT-4 vision chat completion endpoint was introduced in v0.7.8,
an update of example in README.md would be great.

ThibautPV · 2023-11-11T00:56:51Z

Here is an example :

$result = OpenAI::chat()->create([
        'model' => 'gpt-4-vision-preview',
        'messages' => [
            [
                'role' => 'user',
                'content' => [
                    ['type' => 'text', 'text' => "What’s in this image?"],
                    ['type' => 'image_url', 'image_url' => "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"],
                ],
            ]
        ],
        'max_tokens' => 900,
    ]);

return $result->choices[0]->message->content;

MohammadaliMirhamed · 2023-11-11T13:57:13Z

is it possibe to upload video for it ?

vesper8 · 2023-11-12T13:14:50Z

@MohammadaliMirhamed Vision does not support video, but what people are doing is splitting up their videos into still frames and uploading that.

aconital · 2023-11-12T22:48:51Z

Is there a way to upload the images? A lot of times it cannot read an image from a URL and image has to be uploaded.

ThibautPV · 2023-11-12T23:50:07Z

If I understood correctly, you need to upload an image to your server.

Then, you can analyze it with GPT Vision

MohammadaliMirhamed · 2023-11-20T09:30:54Z

I have sent https://m.media-amazon.com/images/I/81nUFx9sXoL._AC_UF894,1000_QL80_.jpg to GPT-4-VISION-PREVIEW
and asked for what time is it . and it responded The time on the clock is 10:10.

aconital · 2023-11-25T21:25:59Z

It seems we can directly pass the base64 data to the api instead of url according to the doc here https://platform.openai.com/docs/guides/vision/quick-start. Here is an example:

import base64
import requests

# OpenAI API Key
api_key = "YOUR_OPENAI_API_KEY"

# Function to encode the image
def encode_image(image_path):
  with open(image_path, "rb") as image_file:
    return base64.b64encode(image_file.read()).decode('utf-8')

# Path to your image
image_path = "path_to_your_image.jpg"

# Getting the base64 string
base64_image = encode_image(image_path)

headers = {
  "Content-Type": "application/json",
  "Authorization": f"Bearer {api_key}"
}

payload = {
  "model": "gpt-4-vision-preview",
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What’s in this image?"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": f"data:image/jpeg;base64,{base64_image}"
          }
        }
      ]
    }
  ],
  "max_tokens": 300
}

GilbertrdzDev · 2023-11-28T07:35:40Z

Here I share the PHP version of @aconital:

function encodeImage($imagePath): string {
    $imageContent = file_get_contents($imagePath);
    return base64_encode($imageContent);
}

$imagePath = '/path/to/image.jpg';
$base64Image = encodeImage($imagePath);

$payload = [
    'model'      => 'gpt-4-vision-preview',
    'messages'   => [
        [
            'role'    => 'user',
            'content' => [
                [
                    'type' => 'text',
                    'text' => "What’s in this image?"
                ],
                [
                    'type' => 'image_url',
                    'image_url' => "data:image/jpeg;base64,$base64Image"
                ],
            ],
        ]
    ],
    'max_tokens' => 200,
];

$result  = OpenAI::chat()->create($payload);

echo "<figure style='font-family:sans-serif;width: 500px'>
         <img style='width: 100%' src='$imagePath' alt=''>
         <figcaption>{$result->choices[0]->message->content}</figcaption>
      </figure>";

Result:

LintonAchmad · 2024-06-13T21:03:04Z

updated payload for php now expecting an object for the image_url - gpt-4o:

$payload = [
            'model'      => 'gpt-4o',
            'messages'   => [
                [
                    'role'    => 'user',
                    'content' => [
                        [
                            'type' => 'text',
                            'text' => "What’s in this image?"
                        ],
                        [
                            'type' => 'image_url',
                            'image_url' => [
                                'url' => "data:image/png;base64,$base64Image"
                            ]
                        ],
                    ],
                ]
            ],
            'max_tokens' => 200,
 ];

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add example of how to upload image for GPT4 vision in README.md #249

add example of how to upload image for GPT4 vision in README.md #249

jkyngan commented Nov 10, 2023

ThibautPV commented Nov 11, 2023

MohammadaliMirhamed commented Nov 11, 2023

vesper8 commented Nov 12, 2023

aconital commented Nov 12, 2023

ThibautPV commented Nov 12, 2023 •

edited

Loading

MohammadaliMirhamed commented Nov 20, 2023 •

edited

Loading

aconital commented Nov 25, 2023 •

edited

Loading

GilbertrdzDev commented Nov 28, 2023 •

edited

Loading

LintonAchmad commented Jun 13, 2024

add example of how to upload image for GPT4 vision in README.md #249

add example of how to upload image for GPT4 vision in README.md #249

Comments

jkyngan commented Nov 10, 2023

ThibautPV commented Nov 11, 2023

MohammadaliMirhamed commented Nov 11, 2023

vesper8 commented Nov 12, 2023

aconital commented Nov 12, 2023

ThibautPV commented Nov 12, 2023 • edited Loading

MohammadaliMirhamed commented Nov 20, 2023 • edited Loading

aconital commented Nov 25, 2023 • edited Loading

GilbertrdzDev commented Nov 28, 2023 • edited Loading

LintonAchmad commented Jun 13, 2024

ThibautPV commented Nov 12, 2023 •

edited

Loading

MohammadaliMirhamed commented Nov 20, 2023 •

edited

Loading

aconital commented Nov 25, 2023 •

edited

Loading

GilbertrdzDev commented Nov 28, 2023 •

edited

Loading