SeeSay Contextualizer

Upload an image to generate a caption, extract text, create audio from context, and determine the context using GPT-2 and Florence-2-base.

Upload an Image

Generated Audio

Generated Caption

Extracted Text (OCR)

Generated Context