Is this workflow free?

The n8n template is free to import, but OpenAI API usage is billed separately.

What do I need to run it?

An n8n account or instance, an OpenAI API key with GPT-4o access, and file uploads enabled.

How do I import the workflow?

Download the JSON file and use the Import workflow option in n8n.

Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs — n8n Automation Template (2026)

What this workflow does

This workflow creates a smart AI chat assistant that processes text, images, and PDFs with GPT-4o's multimodal capabilities and conversation memory.

It suits AI-driven support bots, personal assistants, and embedded chat widgets needing file analysis and contextual replies.

Who is this for?

Support teams, developers, and product teams building chat interfaces that need to process mixed media queries from users.

What problem it solves

Manually handling images and PDFs in chats breaks context and requires separate tools, slowing down responses and analysis.

Live workflow preview

Interactive canvas of every node and connection — scroll and click to explore. Powered by n8n's preview.

Open the template on n8n to import and run it. View source template →

What it automates

PDF Explainer Bot

Users upload contracts or reports and receive instant answers to questions about specific sections.

Image Analysis Assistant

Support agents receive photos from customers and get AI descriptions plus contextual replies in one thread.

Internal Knowledge Chat

Teams query mixed documents like screenshots and PDFs while keeping conversation history for follow-ups.

How the workflow works

The 6 nodes in this automation, in order.

1AI Agent@n8n/n8n-nodes-langchain.agent
2Basic LLM Chain@n8n/n8n-nodes-langchain.chainLlm
3OpenAI Chat Model@n8n/n8n-nodes-langchain.lmChatOpenAi
4Simple Memory@n8n/n8n-nodes-langchain.memoryBufferWindow
5Chat Memory Manager@n8n/n8n-nodes-langchain.memoryManager
6OpenAI@n8n/n8n-nodes-langchain.openAi

Apps & integrations used

AI AgentBasic LLM ChainOpenAI Chat ModelSimple MemoryChat Memory ManagerOpenAI

How to set up Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs

1Import the workflow JSON into your n8n instance
2Add your OpenAI API key to the Chat Model node and select GPT-4o
3Enable file uploads in the chatTrigger node settings
4Connect the memory nodes (Simple Memory and Chat Memory Manager)
5Activate the workflow and open the chat UI to test text/image/PDF inputs
6Embed the chat widget on your site or connect via webhook

How to customize this workflow

→Swap GPT-4o for GPT-4o mini to reduce token costs
→Replace chatTrigger with a webhook node for API-first use
→Add a Notion or Airtable node to log responses automatically
→Insert a Slack node to notify a channel on new file uploads

Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs: pros & cons

Pros

+Native GPT-4o vision support for images and PDFs
+Conversation memory preserves context across messages
+Modular nodes make it easy to embed or extend
+Works directly in n8n's hosted chat UI

Cons

–Requires paid OpenAI GPT-4o API access
–PDF handling relies on model vision rather than text extraction
–Base64 conversion step adds minor processing overhead

Did you find this helpful?

Frequently asked questions

It builds a chat assistant that accepts text, images, and PDFs and answers using GPT-4o with memory.

User reviews

Verified reviews from the community shape this listing's rating.

Loading reviews…

Sign in to review

Similar workflows

Other ai & llm automations worth a look.

✨🤖Automate Multi-Platform Social Media Content Creation with AI

AI & LLM · n8n

Verified

AI generates and posts optimized social content across multiple platforms.

Advanced👁 205KHTTP Request

Generate AI Viral Videos with Seedance and Upload to TikTok, YouTube & Instagram

AI & LLM · n8n

Verified

Automates AI video creation and uploads to TikTok, YouTube, and Instagram.

Intermediate👁 215KGoogle Sheets

Generate AI Videos with Google Veo3, Save to Google Drive and Upload to YouTube

AI & LLM · n8n

Verified

Automates video workflows from Google Sheets using OpenAI and Drive.

Beginner👁 156KGoogle Sheets

Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs

What this workflow does

Who is this for?

What problem it solves

Live workflow preview

What it automates

PDF Explainer Bot

Image Analysis Assistant

Internal Knowledge Chat

How the workflow works

Apps & integrations used

How to set up Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs

How to customize this workflow

Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs: pros & cons

Pros

Cons

Frequently asked questions

What does this workflow do?

Is this workflow free?

What do I need to run it?

How do I import the workflow?

User reviews

Similar workflows

✨🤖Automate Multi-Platform Social Media Content Creation with AI

Generate AI Viral Videos with Seedance and Upload to TikTok, YouTube & Instagram

Generate AI Videos with Google Veo3, Save to Google Drive and Upload to YouTube

Promote Multimodal Chat Assistant with GPT-4o for Text, Images, and PDFs