This guide explains how the new AI chatbots in WOZTELL work using OpenAI and the GPT Assistant v1.3.0 extension.
With this extension, you can create AI assistants connected to OpenAI to respond automatically across channels such as WhatsApp Business, WhatsApp Cloud API, Instagram, Facebook Messenger, and Webchat. You can also use AI to support human agents inside the WOZTELL Inbox through reply suggestions.
Important: Migration from Assistants APIPreviously, WOZTELL used configurations based on OpenAI’s Assistants API.
OpenAI has announced the deprecation of Assistants API, and its shutdown is scheduled for August 26, 2026. For this reason, legacy configurations must be migrated to the new chatbots based on Responses API before that date.
The new version of GPT Assistant in WOZTELL uses Responses API, which is OpenAI’s recommended standard for new implementations of assistants, conversational agents, and AI tools.
An AI chatbot in WOZTELL is an assistant connected to OpenAI that can interpret user messages, generate automatic responses, and use additional tools to provide better answers.
Depending on the configuration, the assistant can:
This makes it possible to automate repetitive conversations, improve response speed, support the sales team, and deliver more consistent answers across all conversational channels.
GPT Assistant v1.3.0 is the WOZTELL extension that allows you to connect an OpenAI account with WOZTELL’s conversational channels.
The main configuration is divided into three key steps:
Once configured, the assistant can be activated on the available channels within the WOZTELL account.
The new version of GPT Assistant uses OpenAI’s Responses API.
Responses API is the API recommended by OpenAI for new AI assistant implementations. It allows you to work with models, instructions, context, and tools such as File Search, Web Search, and Function Call.
In WOZTELL, conversational context is managed by WOZTELL. This means WOZTELL decides how many previous messages are sent to the model based on the assistant’s configuration.
The migration is important because legacy configurations based on Assistants API will no longer be supported by OpenAI starting on August 26, 2026.
For this reason, customers still using previous configurations must migrate their assistants to the new configuration based on Responses API.
The API Key is the credential that allows WOZTELL to connect with the customer’s OpenAI account.
Without a valid API Key, WOZTELL cannot send requests to the model or correctly activate the assistant configuration.
The Model is the OpenAI model that generates the assistant’s responses.
The list of available models is retrieved dynamically from the connected OpenAI account. This means not all customers will see exactly the same models.
A token is the unit of text processed by the model. It can be a word, part of a word, a number, a space, or a punctuation mark.
OpenAI costs are usually calculated based on input and output tokens.
These are the tokens sent to the model. They may include:
These are the tokens generated by the model in its response.
Context is all the information sent to the model so it can respond correctly.
It may include instructions, previous messages, Knowledge Base information, tool results, and external data.
The context window is the maximum amount of information a model can process in a single request.
The more context is sent, the higher the cost may be and, in some cases, the slower the response may become.
The choice of model depends on the use case, conversation volume, response complexity, and available budget.
| Model | Recommended use in WOZTELL | Quality | Relative cost | Recommendation |
|---|---|---|---|---|
| Advanced OpenAI models | Complex use cases, advanced technical support, reasoning, and high-precision assistants | Very high | High | Use when quality is more important than cost |
| Mini or lightweight models | FAQs, basic support, sales, onboarding, and high-volume conversations | High / medium-high | Low / medium-low | Recommended for most conversational use cases |
| Transcription models | Converting audio messages into text | Not applicable | Based on audio usage | Use when receiving voice notes or audio messages |
Before publishing prices or creating a commercial proposal, it is recommended to check OpenAI’s official pricing page, as models and costs may change.
Instructions, also known as the base prompt or system prompt, define how the assistant should behave.
A good prompt should include:
You are a presales assistant for WOZTELL.
Your goal is to help companies understand how WOZTELL can help them automate conversations across WhatsApp, Instagram, Facebook Messenger, and Webchat.
Be friendly, concise, and professional.
Only answer questions related to WOZTELL, WhatsApp Business Platform, AI chatbots, integrations, automation, and conversational channels.
If the user asks about something unrelated, politely explain that you can only help with WOZTELL-related topics.
Before qualifying the lead, ask for their name, company, website, and main use case.Negative constraints are rules that indicate what the assistant should not do.
Do not invent prices.
Do not promise implementation timelines unless confirmed.
Do not answer questions outside the scope of WOZTELL.
Do not provide legal, financial or medical advice.
Do not say you are an AI unless the user asks directly.Human Handover, or escalation to a human, happens when the assistant needs to transfer the conversation to an agent.
Escalation is recommended when:
File Search allows the assistant to search for information inside a Knowledge Base before responding.
It is useful when the assistant needs to answer based on official documentation, FAQs, product guides, internal processes, or commercial materials.
Web Search allows the assistant to search for up-to-date information on the internet.
It is recommended when the answer depends on information that may change, such as external documentation, prices, regulations, news, or recent public information.
Domain Filtering allows web searches to be limited to specific domains.
Correct example:
woztell.com
support.woztell.com
openai.comIncorrect example:
https://woztell.com/Function Call allows the assistant to call an external function to retrieve data or execute an action.
Use case examples:
The Knowledge Base is the set of documents the assistant can consult to answer questions.
It may include:
The Vector Store is the technical container where processed documents are stored so the assistant can search for relevant information.
A simple way to understand it:
Retrieval is the process through which the assistant retrieves relevant fragments from the Knowledge Base and uses them as additional context to respond.
| Feature | What it does | Who uses it | Replies automatically |
|---|---|---|---|
| AI Chatbot | Its the chatbot configuration, Replies to end users on connected channels | End user | Yes |
| AI Inbox Replies | Its the AI Reply configuration, generates reply suggestions inside the Inbox | Human agent | No |
The AI Chatbot replies automatically to end users on channels such as WhatsApp, Instagram, Facebook Messenger, and Webchat.
AI Inbox Replies helps human agents by generating reply suggestions inside the WOZTELL Inbox. The agent can review, edit, and send the reply manually.
The Channel is the communication channel where the assistant will reply to users.
Supported channels:
The Environment is a space within a channel that allows you to separate testing, development, and production.
If no additional environment has been created, the default environment will be used.
The Prefix Name is the identifying name of the assistant configuration connected to a specific channel or environment.
Examples:
AI-CHATBOT-WHATSAPP-PRESALES
AI-CHATBOT-INSTAGRAM-SUPPORT
AI-CHATBOT-WEBCHAT-FAQ
AI-CHATBOT-FACEBOOK-LEADSConnect links the assistant to the selected channel and activates it.
Disconnect unlinks the assistant from the selected channel.
Go Live is the moment when the assistant becomes active on a real channel and starts replying to end users.
The cost of using OpenAI mainly depends on the model, the tokens processed, and the tools used.
A hallucination is a response generated by the model that may sound correct but is not based on real or confirmed information.
If the answer is not available in the Knowledge Base or confirmed documentation, do not invent it. Ask the user to contact the WOZTELL team or escalate to a human agent.Grounding means basing the assistant’s responses on specific sources, such as Knowledge Base documents, Web Search results, or data returned by a Function Call.
Logs are technical records that show events, requests, and errors generated during the assistant’s operation.
They are used to diagnose problems related to:
| Error | Likely cause | Recommended solution |
|---|---|---|
| API Key not accepted | Key copied incorrectly, revoked, or invalid | Generate a new API Key and save it in WOZTELL |
| Model not available | The OpenAI account does not have access to the model | Review permissions, project, and billing in OpenAI |
| File Search not working | There are no files, the Vector Store is incorrect, or the tool is not active | Upload files, select the correct Vector Store, and enable File Search |
| The bot is not responding | Channel not connected, invalid API Key, or configuration error | Review the channel, API Key, logs, and assistant configuration |
The new WOZTELL AI chatbots, powered by OpenAI and Responses API, make it possible to create more flexible assistants that can operate across multiple channels and use tools such as File Search, Web Search, and Function Call.
This new architecture progressively replaces the previous configurations based on Assistants API. For this reason, customers who are still using legacy assistants must migrate before August 26, 2026.
Correctly configuring the model, prompt, Knowledge Base, tools, channel, and escalation rules is essential to create an assistant that is reliable, useful, and cost-controlled.