Images
The Images API allows you to access DALL-E models for creating and modifying images.
- Creating images
- Analyzing images (Vision)
- Create variations on an image
- Perform inpainting
- Perform outpainting
Config
A default config file is provided for Chat endpoints at config/prompts/images.yml
.
There you can define:
- The model and fallback models used by chat, eg
dall-e-3
- Options to pass to OpenAI, eg
size
orquality
Most image requests are quite simple so don’t need a lot of configuration.
File storage
One drawback of OpenAIs image APIs is that any images created by them are deleted after 1 hour. To counter this in StartKit.AI the image is automatically uploaded to S3 after it is returned by OpenAI, and the resulting image URL is returned instead.
You can disable this by passing upload: false
to any of our image functions.
File uploads
To analyze an image with Vision, the AI needs access to an uploaded file. When the /api/images/detect
endpoint is called, we upload the image directly to S3 and pass the resulting URL to OpenAI Vision automatically.
The other image endpoints require base64
encoded images to be passed, so these images are not stored.
Image sizes
OpenAIs image endpoints are very strict on the images that are sent to create variations, inpainting, and outpainting. They must be .png
format, they must be square, and they must be below 4MB.