guideJanuary 27, 2026·4 min read

System Audio Capture on macOS: How It Actually Works

A non-technical explanation of how meeting recorders capture audio without joining your call.

The Magic Behind Invisible Recording

When a meeting recorder captures your call without joining it, what's actually happening?

The answer involves something called "system audio"—and macOS has specific ways to access it.

Let's break it down without getting too technical.


What Is System Audio?

System audio is everything playing through your computer's speakers or headphones.

  • Zoom call audio ✓
  • YouTube video ✓
  • Notification sounds ✓
  • Spotify ✓

It's the mixed output of all applications, delivered to your audio output device.

When you record system audio, you're capturing everything your computer is playing—including the meeting.


How macOS Handles This

For years, recording system audio on Mac was complicated. You needed third-party tools that installed virtual audio devices (like Loopback).

In 2022, Apple introduced ScreenCaptureKit—a built-in framework that lets apps capture screen and audio with proper permissions.

Modern meeting recorders use ScreenCaptureKit. They request permission once, then can capture system audio natively.

No virtual devices. No kernel extensions. No complicated setup.


Why This Matters for Meetings

Because your meeting audio (what you hear from other participants) is part of system audio, capturing it is straightforward.

The recorder isn't "joining" Zoom or Meet. It's not a participant. It's just recording what your computer is playing.

From the meeting's perspective, nothing is different. No bot joins. No participant list changes. The recording happens locally on your machine.


What Gets Captured

  • Meeting audio (other participants)
  • Anything else playing (music, videos)
  • System sounds

Most meeting recorders filter for just the meeting application to avoid recording unrelated audio. But technically, everything goes through the same output.

Tip: Close Spotify before your call if you don't want background music in your transcript.


What About Your Voice?

System audio is what you hear, not what you say.

Your voice comes from your microphone, which is a separate input.

Most recorders capture both: 1. Microphone input (your voice) 2. System audio output (their voices)

Then they mix these into a single recording, transcribe it, and identify speakers.


The Permission Model

When you first run a ScreenCaptureKit app, macOS asks for permission:

"[App] would like to record your screen and audio."

This is the same permission you give for screen recording. Once granted, the app can capture system audio.

You can revoke this in System Preferences → Privacy & Security → Screen Recording.


Quality Considerations

System audio capture is generally high quality because you're recording the digital audio stream directly. No microphone noise, no room acoustics.

  • Your internet connection (affects meeting audio quality)
  • Speaker audio quality (depends on remote participants' mics)
  • Application mixing (if multiple apps are playing)

For most meetings, quality is excellent. Better than recording the room with a physical mic.


Why Not Everyone Does This

Bot-based recorders have one advantage: they work from any device.

System audio capture requires the recording app running locally on your Mac. If you're joining from a phone or different computer, you need the app there too.

But if your meetings happen from your Mac (which, for most remote workers, they do), local capture is simpler and more private than sending a bot.

No bot announcements. No awkward explanations. Just recording.

Eddie

Eddie

Founder, Magnative

Never forget what a client told you

Magnative auto-records every call and files transcripts to your Google Drive client folders. So your AI assistant actually knows your client history.