Question 1

What is Samuel?

Accepted Answer

Samuel is a free, open-source voice-first AI companion for macOS — wake-word activated ("Hey Samuel"), speaks back in under half a second, sees your screen and hears your system audio when you allow it, drives any Mac app, browses the web like a human, and writes his own tools on demand using GPT-5.5. Released under the MIT license.

Question 2

How is Samuel different from ChatGPT, Siri, or Alexa?

Accepted Answer

ChatGPT lives in a browser tab and only sees what you paste; Siri and Alexa run scripted commands; meeting tools like Granola or Otter summarize after the call. Samuel is the only one you can just talk to, in real time, about whatever just happened on your screen or in your audio — wake word in, voice out, sub-500 ms — and the only one that writes brand-new tools for itself when you ask for something he can't yet do.

Question 3

What can I use Samuel for?

Accepted Answer

"What did they just say?" mid-meeting, podcast, or lecture · live in-call assist for sales, support, and interviews · hands-free Mac control for RSI or VoiceOver users · ambient language learning while watching anime, K-drama, foreign news, or YouTube · meeting summarization without a bot joining · voice-controlled web browsing ("show me my Gmail") · self-building AI tools by voice ("build me a weather widget") · ambient monitoring ("tell me when you hear X").

Question 4

How fast does Samuel respond?

Accepted Answer

The voice loop is roughly half a second end- to-end — wake word in, reply out. That's the OpenAI Realtime API speed. Tasks that need a screen read or audio recall add a couple of seconds; deep reasoning (e.g., writing a new tool with GPT-5.5) takes 3–8 s, but Samuel narrates what he's doing while he works so you're never left wondering.

Question 5

Can Samuel actually do things, or only answer?

Accepted Answer

He can do things. Samuel drives any macOS app via the Accessibility tree — clicks, types, scrolls, switches tabs, opens apps — and falls back to GPT-5.5 visual computer-use when an app's accessibility info is thin. You choose how aggressive he is: background_workspace (zero-touch ambient), observe_only (read-only), ask_before_action (asks before writes), or takeover (full keyboard and mouse).

Question 6

Does Samuel really write his own tools?

Accepted Answer

Yes. When you ask for something Samuel doesn't already do — for example, a weather widget — he generates the code with GPT-5.5, has it reviewed by GPT-4o-mini, validates it, and installs it without restarting. If the new tool breaks later (because an external API changed, say), he diagnoses the failure and patches the code automatically. Maximum two repair attempts; after that he explains in plain language what he needs from you.

Question 7

Can Samuel translate or interpret in real time?

Accepted Answer

Yes. While you watch foreign-language content — anime, news, lectures, YouTube — you can ask any question by voice and Samuel answers within roughly half a second without pausing the video. Works for Japanese, Spanish, Mandarin, French, Korean, German, and any other language pair the underlying models support. The same flow works for live meetings.

Question 8

Can I ask Samuel about audio he just heard?

Accepted Answer

Yes. Once audio listening is allowed, Samuel keeps a rolling local audio buffer running silently. When you ask "what did they just say?" / "translate the last 30 seconds" / "teach me the words from that clip", he ffmpeg-trims the tail of the buffer to your window, transcribes it with gpt-4o-transcribe, and answers. Your question is the boundary — no polling cadence, no auto-pause/resume keystroke fights, and zero transcription cost while idle.

Question 9

How does Samuel browse the web?

Accepted Answer

Three tiers, chosen automatically. Quick search via SerpAPI for "look up X" requests. Deep research via OpenAI Responses API with web search for "find more details" requests, returned with cited sources. Real browser automation via Playwright for any login-required site (Gmail, GitHub, your bank, internal tools) — Samuel opens a visible Chromium window, you sign in once, and he reads and clicks through the page like a human.

Question 10

Which operating systems does Samuel support?

Accepted Answer

The v0.1.0 release is a 376 MB DMG for Apple Silicon Macs running macOS 14 (Sonoma) or newer. An Intel Mac build, a Windows port, and a Linux port are on the public roadmap — drop your email in the notify section and we'll let you know when yours is ready.

Question 11

Is Samuel free?

Accepted Answer

Yes — every beta feature available today will stay free forever. That's a commitment, not a trial period. Samuel is open source under the MIT license, and the entire current capability set — voice conversation, ambient screen and audio, browser automation, plugin generation, auto-repair, memory — is yours to keep. You only pay OpenAI for model usage (wake-word listening ~$0.006/min, ambient assistance ~$0.02–0.05/min, voice conversation at standard Realtime API rates). Plugins and browser automation run locally and cost nothing.

Question 12

Do I need an OpenAI API key to try Samuel?

Accepted Answer

No — v0.1.0 ships with a free trial proxy so you can use Samuel without bringing your own key on first launch. When the trial credits are used up, paste your own OpenAI API key in Settings → API Key for unlimited use; the app then talks to OpenAI directly and never contacts the proxy again. Samuel itself is free forever; the key only pays OpenAI for the model calls.

Question 13

Is my data private?

Accepted Answer

Memory, preferences, skills, plugins, and API keys are stored locally in ~/.samuel/. Browser sessions run locally via Playwright. Screen captures and audio are sent to OpenAI only while a feature is active, and every privacy-sensitive surface (continuous listening, continuous screen watching) requires an explicit Allow on a consent card the first time it flips on — no auto-approve countdown for those two. Toggle screen watching and audio listening off at any time in Settings, or say "stop listening to my speakers."

Question 14

What models does Samuel use?

Accepted Answer

OpenAI Realtime API for voice conversation (~500 ms), GPT-5.5 with reasoning for plugin code generation and visual computer control (3–8 s), GPT-4o Vision for screen understanding (3–5 s), GPT-4o-mini for code review and trigger classification (~1 s), and gpt-4o-transcribe for high-fidelity audio recall (3–10 s).

Question 15

How do I install Samuel?

Accepted Answer

Download the DMG (376 MB), open it, and drag Samuel into Applications. On first launch, right-click Samuel and choose Open to bypass the "Apple cannot verify" notice — notarization is on the roadmap. The free trial proxy lets you start without an API key; for unlimited use, paste your own OpenAI key in Settings.

An AI that lives with you.
And grows with you.

Real Japanese. Real time.

You don't open Samuel. You just talk to him.

He sees what you see.

He hears what you hear.

He answers in your voice's pause.

Ask for anything.
He'll build the ability if it doesn't exist.

Stop switching tabs.
Just talk.

A new shape for AI.

The ceiling isn't what we shipped.
It's what you'll ask for.

FAQ

Bring Samuel home.

An AI that lives with you. And grows with you.