Journalists protect sources. Lawyers protect client communications. Researchers protect participant identities. Sending interview audio to cloud transcription services puts all of that at risk. LocalRAG! v3.0 transcribes interviews entirely on your iPhone or Android — the audio never leaves the device. WhisperKit transcribes, AI cleans up, and you can search across hours of recordings with AI Q&A. All on-device. All private.
Cloud transcription services (Otter, Rev, Fireflies) require uploading audio to their servers. For investigative journalism, the source might be at risk if their voice ends up in a third-party database. For legal interviews, attorney-client privilege can be at issue. For research interviews under IRB protocols, uploading violates participant consent. Even with strong vendor security promises, the audio existing on third-party servers is the privacy risk. The only real solution: never upload.
LocalRAG! transcribes interview audio (MP3, M4A, WAV, AAC, FLAC) entirely on your device with WhisperKit. An LLM cleanup pass removes filler words to produce clean, readable transcripts — suitable for direct quotation. The transcript is indexed locally; you can ask the AI to find quotes about specific topics, summarize what the interviewee said about a subject, or compare statements across multiple interviews. Nothing is uploaded.
Use any voice recorder app. iOS’s built-in Voice Memos exports M4A directly. Import the file into LocalRAG!.
WhisperKit runs locally. The audio file stays in app sandbox — OS-level isolation. No network calls for the transcription step.
“What did the source say about the deadline?” — LocalRAG! returns the cleaned transcript excerpt with timestamp. Copy verbatim for quotation.
Investigative journalism’s first rule: protect your sources. Audio never leaves your device. No cloud breach, subpoena, or insider threat can expose your recordings.
Attorney-client interviews stay privileged. No third-party processor means no third-party discoverability — a meaningful posture for sensitive legal work.
For human subjects research under IRB protocols, on-device transcription often meets stricter consent forms (“data will not be shared with third parties”).
Raw Whisper output has fillers; LocalRAG’s LLM cleanup pass produces transcripts that read like edited speech. Quote directly with citation timestamps.
AI scans the transcript for the confirmation moment and returns the exact quote with timestamp.
Searches for March 14 references and returns the relevant statements verbatim.
Returns a clean summary with citations to the specific exchanges.
With both interviews in one Collection, AI cross-references and presents differences side-by-side.
When source confidentiality and privilege matter, on-device is the only acceptable answer. LocalRAG! v3.0 brings WhisperKit-quality transcription and AI Q&A to the workflows that demand absolute privacy. Your sources, clients, and research participants deserve that protection.
WhisperKit base model is ~85% accurate on clear English speech and ~75% on Japanese. Cloud services using larger models can reach 90-95%, but you trade privacy for those points. v3.1 will introduce optional Whisper small/medium downloads for users who need higher accuracy on-device.
LocalRAG!’s LLM cleanup pass intentionally polishes the text — it removes fillers and restructures incomplete sentences. For legal proceedings requiring verbatim records, you should disable cleanup (settings option in v3.0) or work from the raw Whisper output, which preserves disfluencies.
Currently LocalRAG! transcribes all speakers as one stream. Speaker diarization (“Speaker 1, Speaker 2”) is on the roadmap for v4.0 — we’re evaluating on-device diarization models that don’t require cloud processing.
Try LocalRAG! Free
Free tier with 5 questions per day. No account required.