How to Extract Text from Images in Code Interpreter by OpenAI

🗓️

OpenAI’s Code Interpreter is doing wonders ever since it got launched. And if you are eager to learn how to Extract Text from Images by using the tool, this short blog post is for you.

Let’s get to it…

Using OpenAI’s Code Interpreter to Extract Text from Images

Below, I will be providing you with 2 different example prompts along with working screenshots so that you can get it done easily.

🎉 You also need to upload the required files (JPG, PNG, or other image files) to the Code Interpreter for this operation. Just click on the small plus (+) icon in the left of the prompt input field to select and upload the files.

Prompt #1

Extract text from the uploaded PNG image and save it as a .txt file.

Prompt #1 to Extract Text from Images in OpenAI's Code Interpreter

This prompt instructs the Code Interpreter to perform Optical Character Recognition (OCR) on an uploaded PNG image, extracting any text it contains. The extracted text is then saved into a .txt file, effectively converting the image-based text into a machine-readable format.

Prompt #2

Analyze the uploaded image containing a restaurant bill and extract all the text present in the image. Then, convert the extracted text into a well-formatted CSV file, including relevant details such as item name, quantity, price, and total amount. Ensure that the CSV file is organized with appropriate headers and separators for easy data manipulation.

Prompt #2 to Extract Text from Images in OpenAI's Code Interpreter

This prompt instructs the Code Interpreter to process an image of a restaurant bill, extract the text, and format it into a CSV file. It ensures structured data organization for items, quantities, prices, and total amount for easy manipulation.

🎉 You can modify the above prompts as per your use case to get the desired output. OpenAI’s GPT-4 is powerful and properly understands any prompt you provide.

Also, check some other interesting things that you can do with the OpenAI’s Code Interpreter model:

I have also written a detailed post about the practical use cases of the Code Interpreter model that you may find helpful.

And if you get stuck somewhere, kindly feel free to let me know in the comments below.

Google Sheets + OpenAI API (Script)

A script to connect Google Sheets with the OpenAI API without third-party tools like Zapier or Make. This “script” allows instant access to responses from GPT-4 and GPT-3.5 inside the spreadsheet itself.

  • No 3rd party tools required
  • No subscription required, and
  • Works with a free Gmail account as well

🎉 Costs $10 (one time)



✦ Assistance from an AI engine was utilized in writing and/or editing this post to enhance its quality.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *