Admin

Live OpenAI monthly cost data unavailable — showing local estimates.

Jobs today

Cost today

· DB estimate

Daily auto-suspend cap left

Self-imposed cap (DAILY_PLATFORM_BUDGET_USD)

Monthly auto-suspend cap left

Live OpenAI ~ Estimate

Job history

Loading…

No jobs recorded yet. Process an audio file to see history here.

About this build

Transcription backend

Active backend for new transcription jobs. Vintage mode is an override that forces local large-v3 + vocal separation regardless of the provider — tuned for old / low-quality recordings (cassette transfers, AM radio, music with strong instrumentals). See local_transcription.md for the sizing matrix.

Quick presets:

Provider (TRANSCRIPTION_PROVIDER)

openai calls Whisper API (cloud, fast, good on speech). faster-whisper runs Whisper locally (no API call, slower on CPU).

Vocal separation (Demucs) (ENABLE_VOCAL_SEPARATION)

Extracts the vocal stem from the mix before transcription. Strongly recommended for music — without it Whisper hallucinates random languages on instrumental-heavy tracks. First use downloads ~3 GB of model weights (one-time).

Demucs model (VOCAL_SEPARATION_MODEL)

htdemucs (v4 hybrid transformer) is the speed/quality default.

Allow vintage mode for users (ALLOW_VINTAGE_MODE)

When off (default), the vintage toggle is hidden in the UI and any vintage=true sent via the API is silently ignored. Enable only on instances with a local faster-whisper + Demucs stack powerful enough to run it.

Vintage mode (VINTAGE_MODE)

Forces local large-v3 + vocal separation regardless of the provider. ~6× realtime on CPU — best run as a one-off via --vintage on the CLI for individual jobs, not as a default.

Auto-retry degraded transcripts with vocal separation (AUTO_RETRY_ON_DEGRADED)

When the first pass returns a degraded transcript (Whisper covered < 30 % of the audio — the textbook hallucination on instrumental-heavy music), automatically re-run the same audio with Demucs forced on. The user pays credits exactly once regardless of the retry. Off → leaves the user with a degraded transcript and a warning banner instead. Adds wall-clock time only when the first pass actually fell over.

Music mode: always use vocal separation (MUSIC_MODE_FORCE_VOCAL_SEPARATION)

When a job is submitted with Audio type = Music track, automatically run Demucs vocal separation even if the global "Vocal separation" toggle is off. Recommended on — guitar and instrument hallucinations are the #1 cause of poor music transcription. Only disable if Demucs is not installed or on very resource-limited servers.

Music mode: always use whisper-1 for timestamps (MUSIC_MODE_FORCE_WHISPER1)

When a music job is processed, override the configured transcription model to whisper-1 regardless of the global model setting. Recommended on — whisper-1 is the only OpenAI model that returns per-segment timestamps. Without timestamps, LRC and SRT files cannot be time-synced. Disable only if you intentionally want gpt-4o-transcribe on music jobs.

Degraded coverage threshold (DEGRADED_COVERAGE_THRESHOLD)

Fraction of audio that must be covered by transcript segments before the result is considered acceptable. Below this ratio the auto-retry safety net fires. Default 0.30 (30 %) is speech-optimised. Raise to 0.50 on music-first deployments to catch more hallucinations (complementary to the blip-pattern detector which works regardless of this value).

faster-whisper settings (used by the local provider and by vintage mode).

Model size (FASTER_WHISPER_MODEL_SIZE)

Device (FASTER_WHISPER_DEVICE)

Compute type (FASTER_WHISPER_COMPUTE_TYPE)

Model cache directory (FASTER_WHISPER_MODEL_DIR)

Leave blank to use the default HuggingFace cache. On first use, the chosen model is downloaded (~150 MB to ~3 GB depending on size).

OpenAI

API Key (OPENAI_API_KEY)

Current:

Admin API Key (OPENAI_ADMIN_API_KEY)

Optional. Admin key with api.usage.read scope — enables live billing data on the dashboard. Create one at platform.openai.com/api-keys.

Current:

Transcription model (OPENAI_TRANSCRIPTION_MODEL)

whisper-1 is whisper-large-v2 under the hood — fine for clean speech but hallucinates random languages on music with strong instrumentals (the "Take On Me as Russian" failure mode). gpt-4o-mini-transcribe is the cost/quality sweet spot. gpt-4o-transcribe is best on accents and difficult audio. Pricing changes — check openai.com/api/pricing.

Fallback model (OPENAI_TRANSCRIPTION_FALLBACK_MODEL)

Optional secondary model used if the primary fails (transient API errors, model outage). Pick a different model from the primary so a model-specific outage doesn't take both down.

Max file size (MB) (OPENAI_MAX_FILE_MB)

OpenAI Whisper hard limit. Do not raise above 25 unless the API limit changes.

Budget guardrails

Costs shown on the dashboard are pulled live from the OpenAI API when the key has api.usage.read (admin) permissions. Otherwise they fall back to local estimates based on the rate below.

Monthly budget (USD) (MONTHLY_PLATFORM_BUDGET_USD)

Daily budget (USD) (DAILY_PLATFORM_BUDGET_USD)

Transcription cost per minute (USD) (TRANSCRIPTION_COST_PER_MINUTE_USD)

OpenAI Whisper is $0.006/min. Update here if pricing changes.

$ / min

Auto-transcoding

Convert oversized files to MP3 via ffmpeg before sending to OpenAI.

Enable auto-transcode (TRANSCODE_OVERSIZED)

Transcoding bitrate (TRANSCODE_BITRATE_KBPS)

128 kbps mono is sufficient for Whisper.

Audio limits

Max audio duration (MAX_AUDIO_DURATION_SECONDS)

Files longer than this are rejected before transcription.

seconds

Application

Output directory (DEFAULT_OUTPUT_DIR)

Log level (LOG_LEVEL)

Enable debug output (ENABLE_DEBUG_OUTPUT)

Transactional email

Disabled (no-op logger)

Provider (EMAIL_PROVIDER)

Empty disables real email — messages are only logged. Save applies immediately.

Resend API key (RESEND_API_KEY)

Leave blank to keep the current value (currently: ).

From address (EMAIL_FROM)

From name (EMAIL_FROM_NAME)

Reply-To (EMAIL_REPLY_TO)

Public base URL (PUBLIC_BASE_URL)

Used to build verify / reset links inside emails. Production must be the public HTTPS URL.

Reset token TTL (PASSWORD_RESET_TTL_MINUTES)

minutes

Admin alert email (ADMIN_ALERT_EMAIL)

Require email verification (ENABLE_EMAIL_VERIFICATION)

Audio Recognition / AudD

Privacy notice: AudD sends a short audio sample (default 12 s) to an external provider. Enable this only if your privacy policy and terms disclose external processing. The full uploaded file is never sent.

AudD enabled (AUDD_ENABLED)

API token (AUDD_API_TOKEN)

Current:

Default audio mode (AUDD_DEFAULT_AUDIO_MODE)

Run for "auto" mode (AUTO_RECOGNITION_FOR_AUTO_MODE)

Premium-only mode

Admin override (test)

Sample duration (s)

Primary position (%)

Enable fallback sample

Fallback position (%)

Max requests per job

Timeout (ms)

Retry count

Sample format

Sample bitrate

Sample rate (Hz)

Sample channels

Store raw AudD response (debug only)

Cost per 1,000 reqs (USD)

Credit cost per recognition

Daily soft limit

Monthly soft limit

Per-user daily limit

Per-user monthly limit

AudD usage

Today

This month

Success rate

Avg latency

Status this month

No recognitions this month.

Recent failures

None.

Last 100 events

Time	Event	Job	Latency	Detail

VPS Health

Auto-refresh (10s)

CPU

RAM

Disk

Recent logs

last 100 lines · View full log →

Loading…

No log entries.

Job Debugger

Search any job by UUID to view full details: model used, timestamps_mode, transcription attempts, audit logs, etc.

Search details

User ID

Action

Audit log

Loading…

No audit events recorded yet.

Timestamp	Action	Job ID	User ID	Details

User:

Balance

Reserved

Available

Adjust credits (+ or −)

Note (optional)

Recent transactions

Date	Type	Amount	Note

Users

Loading…

No users found.

Email	Role	Status	Credits	Trials used	Joined	Last login

Dev Tools

Simulate password reset without email service. Only available when ENABLE_EMAIL_VERIFICATION=false.

New password

Free Trial Controls

Free trials paused

When paused, no new free trial jobs will be accepted platform-wide.

Status:

Block an IP

Blocked IPs ()

No IP is currently blocked.

IP	Reason	Blocked at	Actions

Sent today

Failed today

Sent this month

Failed this month

By category — last 30 days

No emails in the last 30 days.

Category	Sent	Failed

Recent failures

Email log

Loading…

No email log entries match these filters.

When	Category	Recipient	Status	Latency	Detail

🔍 Debugging API calls:

• Use API Calls filter to see all v1 API requests (POST, GET /segments, etc)
• Use Transcriptions filter to debug job processing (_run_job START/SUCCESS/FAILED)
• Use Timestamps filter to verify model selection (whisper-1 vs gpt-4o-*)
• Search bar also supports regex: try has_timestamps= to find timestamp metadata in responses

Level

Lines

Quick filters

Auto-refresh (5s) Download current

Older rotations:

Application logs

Loading…

Log file not found at . Run a request to generate the first lines, then refresh.

No entries match the current filters.

Admin login

Job history

About this build

Transcription backend

OpenAI

Budget guardrails

Auto-transcoding

Audio limits

Application

Transactional email

Audio Recognition / AudD

AudD usage

Last 100 events

VPS Health

Recent logs

Job Debugger

Job Details

Timeline

Transcription

Audio

Processing

Transcription Attempts

Audit Log

Audit log

User:

Users

Dev Tools

Free Trial Controls

Block an IP

Blocked IPs ()

By category — last 30 days

Recent failures

Email log

Application logs