2026-05-23·4 min read

Private Interview Transcription: Source Confidentiality, On-Device AI

Journalists protect sources. Lawyers protect client communications. Researchers protect participant identities. Sending interview audio to cloud transcription services puts all of that at risk. LocalRAG! v3.0 transcribes interviews entirely on your iPhone or Android — the audio never leaves the device. WhisperKit transcribes, AI cleans up, and you can search across hours of recordings with AI Q&A. All on-device. All private.

Why cloud transcription fails sensitive interviews

Cloud transcription services (Otter, Rev, Fireflies) require uploading audio to their servers. For investigative journalism, the source might be at risk if their voice ends up in a third-party database. For legal interviews, attorney-client privilege can be at issue. For research interviews under IRB protocols, uploading violates participant consent. Even with strong vendor security promises, the audio existing on third-party servers is the privacy risk. The only real solution: never upload.

How LocalRAG! solves this

LocalRAG! transcribes interview audio (MP3, M4A, WAV, AAC, FLAC) entirely on your device with WhisperKit. An LLM cleanup pass removes filler words to produce clean, readable transcripts — suitable for direct quotation. The transcript is indexed locally; you can ask the AI to find quotes about specific topics, summarize what the interviewee said about a subject, or compare statements across multiple interviews. Nothing is uploaded.

1🎤

Record the interview

Use any voice recorder app. iOS’s built-in Voice Memos exports M4A directly. Import the file into LocalRAG!.

2🔒

On-device transcription

WhisperKit runs locally. The audio file stays in app sandbox — OS-level isolation. No network calls for the transcription step.

3🔍

Search and quote

“What did the source say about the deadline?” — LocalRAG! returns the cleaned transcript excerpt with timestamp. Copy verbatim for quotation.

Why journalists, lawyers, and researchers choose LocalRAG!

📰

Source protection

Investigative journalism’s first rule: protect your sources. Audio never leaves your device. No cloud breach, subpoena, or insider threat can expose your recordings.

⚖️

Privileged communications

Attorney-client interviews stay privileged. No third-party processor means no third-party discoverability — a meaningful posture for sensitive legal work.

🔋

IRB-friendly research

For human subjects research under IRB protocols, on-device transcription often meets stricter consent forms (“data will not be shared with third parties”).

📝

Clean, quotable transcripts

Raw Whisper output has fillers; LocalRAG’s LLM cleanup pass produces transcripts that read like edited speech. Quote directly with citation timestamps.

Example questions you can ask

“Find the moment the source confirmed the document existed”

AI scans the transcript for the confirmation moment and returns the exact quote with timestamp.

“What did the witness say about the meeting on March 14?”

Searches for March 14 references and returns the relevant statements verbatim.

“Summarize what the interviewee said about the company culture”

Returns a clean summary with citations to the specific exchanges.

“Compare what Interview A and Interview B said about leadership”

With both interviews in one Collection, AI cross-references and presents differences side-by-side.

Verdict

When source confidentiality and privilege matter, on-device is the only acceptable answer. LocalRAG! v3.0 brings WhisperKit-quality transcription and AI Q&A to the workflows that demand absolute privacy. Your sources, clients, and research participants deserve that protection.

FAQ

Is on-device transcription as accurate as cloud transcription?

WhisperKit base model is ~85% accurate on clear English speech and ~75% on Japanese. Cloud services using larger models can reach 90-95%, but you trade privacy for those points. v3.1 will introduce optional Whisper small/medium downloads for users who need higher accuracy on-device.

Can I produce verbatim transcripts for legal proceedings?

LocalRAG!’s LLM cleanup pass intentionally polishes the text — it removes fillers and restructures incomplete sentences. For legal proceedings requiring verbatim records, you should disable cleanup (settings option in v3.0) or work from the raw Whisper output, which preserves disfluencies.

How do I handle multi-speaker interviews?

Currently LocalRAG! transcribes all speakers as one stream. Speaker diarization (“Speaker 1, Speaker 2”) is on the roadmap for v4.0 — we’re evaluating on-device diarization models that don’t require cloud processing.

Try LocalRAG! Free

Free tier with 5 questions per day. No account required.

← Home