← Blog

Hands-Free Hive Inspection: Why Voice Notes Beat the Notebook

By Toly · April 28, 2026 · 5 min read

I have tried every way to take inspection notes that you can think of. Pocket notebook in a sandwich bag. Waterproof Rite-in-the-Rain. Phone screen with a stylus. A clipboard zip-tied to the truck. An app form with thirty fields. A dictated voice memo straight to my phone's stock recorder app.

They all break in the same place: the moment between observing something and recording it. The bees are loud, the gloves are sticky, and you have about six seconds before the next thing demands your attention. Anything that needs two free hands or any screen interaction loses that window.

This post is about why voice-first logging is the only method I have found that survives that window, and what it took to make voice good enough to be the system of record.

What are the failure modes of paper?

Paper survives the apiary fine — propolis is forgiving, and a Rite-in-the-Rain notebook can take a season of abuse. The failure mode is on the other side: you have to transcribe it later, and you will not.

Even when you do, the structure of what you wrote down is whatever your hand had time for. "Hive 3 ok, queen seen, supersedure cell?" is the kind of note you write at the hive, and a month later you have no idea what it means. Was the question mark "I think I saw one" or "I am looking at one"? Did you follow up? You cannot tell. The notebook captured the observation but not the context.

What about phone screens?

Phone screens are worse. The propolis problem is the same as paper, except now there is also touch capacitance to fight. Capacitive touch through nitrile is hit-or-miss; through propolis-coated nitrile, it is closer to "miss."

Apps with form-based logging compound the problem. Every field is a tap, every dropdown is a scroll, every comment is a soft-keyboard struggle. Five fields per inspection times forty hives is two hundred touches. You will not do them. You will tap one field — usually "queen seen" — and call it an inspection.

So why has voice not been the answer for years already?

Two reasons that turned out to be both real and solvable:

  1. Generic dictation cannot handle beekeeping vocabulary. "Supersedure" becomes "super seizure," "varroa" becomes "vera," "nuc" becomes "nuke," "propolis" becomes either "propellers" or some indignity I do not want to write here. Your transcript becomes garbage and you give up.
  2. The audio went to the cloud immediately, or did not happen at all. Stock voice memo apps assume reliable network. Most apiaries do not have reliable network. So the recording either fails silently or stops working the moment you walk into a dead zone.

Both of these are solvable. Neither is solved by the speech-to-text APIs themselves — you have to do real work on top of them.

What does it take to make voice work for beekeeping?

Three things, all of which we had to build deliberately.

  1. Prime the speech model with beekeeping vocabulary. The Whisper API takes a prompt parameter that is essentially a list of unusual words to bias the recognizer toward. We pass it a curated list of beekeeping terms — varroa, supersedure, propolis, nuc, drone laying queen, queen rearing, apivar, oxalic — and the recognition rate on those words goes from "frustrating" to "you can rely on it."
  2. Post-correct what slipped past. Even with priming, some terms still come back wrong. We run a deterministic correction layer over the transcript that catches the residue: "super seizure" → "supersedure," "vera" → "varroa," "nuke" → "nuc," "drone laying clean" → "drone laying queen." This is unglamorous code and it is the difference between a transcript you trust and one you do not.
  3. Save the audio locally first; transcribe later. The recording lands in a local database on the phone the moment you start it. Transcription queues for whenever the network reappears. You can finish four hives in a dead zone, drive home, and watch the transcripts populate in your kitchen.

Does this actually work at scale?

Yes, with one caveat: voice does not replace the part of your brain that notices what to notice. It replaces the part that has to remember what it noticed five hours later. Those are different jobs and only the second one has been broken.

In practice, the workflow is short. Tap the NFC tag on a hive. Hear the app say "recording for Lavender." Talk through what you see. Tap the tag again to stop. Move to the next hive. The recording, the structured data, the action items, and the follow-up reminders all flow from that one narration. You do not write anything down. You do not type anything. You do not look at a screen.

Manual entry is still there for the days you don't feel like talking, or for the evening review on the web portal — every voice-extractor field is also a manual-entry field, so you can pick whichever path matches the moment. If you want to see what gets extracted from a narration, the features page walks through the workflow end-to-end. If you are skeptical about whether voice transcription is good enough for the words beekeepers actually use, the Why WhisperBee page covers the vocabulary tuning.

What I would do differently if I were starting again

Skip the notebook stage entirely. The hours I spent transcribing my own handwriting were the worst hours of beekeeping I had, and they were optional. Anything that records the moment of observation in a way you can search later is better than anything that needs to be transcribed.

That said: try voice on a single yard for two weeks before you commit. Two weeks is enough time to find your own friction points (where you put the phone, what you say first, how you handle a bad recording) and short enough that nothing is sunk if it does not fit the way you work.

The free tier covers two hives, which is more than enough to do the test. Sign up and try it on the next inspection day.

About the author

Toly is the founder of WhisperBee and an active beekeeper, writing about the parts of beekeeping that show up between inspections — record keeping, mite pressure, the workflow at the hive — and the tooling decisions that come out of running an apiary in real conditions.

More: all posts, why we built WhisperBee, RSS.