Skip to content
← All content

Created from a single voice note with Agent Craft

X (Twitter)

I have this thing with words. Specific words. If I can't name…

I have this thing with words. Specific words. If I can't name something precisely, I don't trust that I actually understand it. So when I started building Agent Craft and kept trying to explain what we were capturing from someone's voice, I kept hitting a wall with the language available to me. "Speech patterns" felt clinical. "Verbal habits" felt like a therapy session. "Tone" was too vague. None of it landed on the actual thing I was pointing at. The thing I was pointing at: the way a person says "essentially" three times in a minute without noticing. The way they trail off with "and so on" when they assume you're following. The filler word that only shows up when they're confident, not when they're nervous. The weird specific phrase they've coined that no one else uses but everyone who knows them immediately recognises. All of that. Together. As a fingerprint. I started calling them earisms. I don't know exactly where it came from. I wasn't sitting down trying to coin a term. I was in the middle of explaining the capture process to someone and the word just came out, and I remember pausing because it felt right in a way the other words hadn't. Your earisms. Just the nature of how you speak. That's the thing people misunderstand about what Agent Craft does with a voice note. There's a rather common and actually pretty misguided assumption that we're just transcribing speech and tidying it up. We're not. The transcription is almost incidental. What actually happens underneath is that we're reading the text, identifying patterns, building a picture of how this specific person phrases things, what words they reach for, how they answer questions, what examples they default to. All of that becomes the fingerprint. The earisms are the data. I started using the word in internal conversations. Then in product discussions. Then in demos. Nobody ever asked me to clarify what I meant by it. They just got it immediately, which told me the word was doing real work. Quality is the first thing that pops into my head when I think about why this matters. If you lose the earisms in the output, you've lost the person. The content becomes generic. It sounds like it was written by a committee that had read a lot of content but had never actually met you. And people who follow you, who've read your posts, who know your voice, they feel that absence. Maybe they can't name it, but they feel it. That's the gap I kept identifying as I built this. Not the quantity problem. Not the scheduling problem. The voice problem. The way people would hand content off to a tool or a writer and get something back that was competent and completely bloodless. Earisms are what make you sound like yourself. I think about this with my own writing. I use "essentially" a lot. I frame explanations by naming the misconception first. I gravitate toward the word "essentially" when I'm trying to collapse a complex idea down to its core. These aren't choices I make consciously in the moment. They're just how I think on the page. If you took those things out of my content and replaced them with cleaner, more neutral phrasing, you'd have something that read fine and felt like a stranger wrote it. The word earisms has stuck around. It's in how we talk about the product internally. It's the shorthand for the thing we're actually trying to preserve when we capture someone's voice and translate it into content. Not their opinions, not their topics, not their format preferences. Those matter too, but they're easier to name and easier to replicate. The earisms are the hard part. They're also the part that makes the whole thing worth doing. If your voice isn't in your content, what's the point of posting under your name?

James GoddardJun 3, 2026Published to X — @JamesGodda75737View original ↗

More content from Agent Craft