[T]extFromTrack

Admin login

Sign in with your administrator account

← Back to app |

Admin

Live OpenAI monthly cost data unavailable — showing local estimates.

Jobs today

Cost today

· DB estimate

Daily auto-suspend cap left

Self-imposed cap (DAILY_PLATFORM_BUDGET_USD)

Monthly auto-suspend cap left

Live OpenAI ~ Estimate

Job history

Loading…
No jobs recorded yet. Process an audio file to see history here.

About this build

Transcription backend

Active backend for new transcription jobs. Vintage mode is an override that forces local large-v3 + vocal separation regardless of the provider — tuned for old / low-quality recordings (cassette transfers, AM radio, music with strong instrumentals). See local_transcription.md for the sizing matrix.

Quick presets:

openai calls Whisper API (cloud, fast, good on speech). faster-whisper runs Whisper locally (no API call, slower on CPU).

Extracts the vocal stem from the mix before transcription. Strongly recommended for music — without it Whisper hallucinates random languages on instrumental-heavy tracks. First use downloads ~3 GB of model weights (one-time).

htdemucs (v4 hybrid transformer) is the speed/quality default.

When off (default), the vintage toggle is hidden in the UI and any vintage=true sent via the API is silently ignored. Enable only on instances with a local faster-whisper + Demucs stack powerful enough to run it.

Forces local large-v3 + vocal separation regardless of the provider. ~6× realtime on CPU — best run as a one-off via --vintage on the CLI for individual jobs, not as a default.

When the first pass returns a degraded transcript (Whisper covered < 30 % of the audio — the textbook hallucination on instrumental-heavy music), automatically re-run the same audio with Demucs forced on. The user pays credits exactly once regardless of the retry. Off → leaves the user with a degraded transcript and a warning banner instead. Adds wall-clock time only when the first pass actually fell over.

When a job is submitted with Audio type = Music track, automatically run Demucs vocal separation even if the global "Vocal separation" toggle is off. Recommended on — guitar and instrument hallucinations are the #1 cause of poor music transcription. Only disable if Demucs is not installed or on very resource-limited servers.

When a music job is processed, override the configured transcription model to whisper-1 regardless of the global model setting. Recommended on — whisper-1 is the only OpenAI model that returns per-segment timestamps. Without timestamps, LRC and SRT files cannot be time-synced. Disable only if you intentionally want gpt-4o-transcribe on music jobs.

Fraction of audio that must be covered by transcript segments before the result is considered acceptable. Below this ratio the auto-retry safety net fires. Default 0.30 (30 %) is speech-optimised. Raise to 0.50 on music-first deployments to catch more hallucinations (complementary to the blip-pattern detector which works regardless of this value).

faster-whisper settings (used by the local provider and by vintage mode).

Leave blank to use the default HuggingFace cache. On first use, the chosen model is downloaded (~150 MB to ~3 GB depending on size).

OpenAI

Current:

Optional. Admin key with api.usage.read scope — enables live billing data on the dashboard. Create one at platform.openai.com/api-keys.

Current:

whisper-1 is whisper-large-v2 under the hood — fine for clean speech but hallucinates random languages on music with strong instrumentals (the "Take On Me as Russian" failure mode). gpt-4o-mini-transcribe is the cost/quality sweet spot. gpt-4o-transcribe is best on accents and difficult audio. Pricing changes — check openai.com/api/pricing.

Optional secondary model used if the primary fails (transient API errors, model outage). Pick a different model from the primary so a model-specific outage doesn't take both down.

OpenAI Whisper hard limit. Do not raise above 25 unless the API limit changes.

MB

Budget guardrails

Costs shown on the dashboard are pulled live from the OpenAI API when the key has api.usage.read (admin) permissions. Otherwise they fall back to local estimates based on the rate below.

$
$

OpenAI Whisper is $0.006/min. Update here if pricing changes.

$ / min

Auto-transcoding

Convert oversized files to MP3 via ffmpeg before sending to OpenAI.

Enable auto-transcode (TRANSCODE_OVERSIZED)

128 kbps mono is sufficient for Whisper.

Audio limits

Files longer than this are rejected before transcription.

seconds

Application

Enable debug output (ENABLE_DEBUG_OUTPUT)

Transactional email

Disabled (no-op logger)

Empty disables real email — messages are only logged. Save applies immediately.

Leave blank to keep the current value (currently: ).

Used to build verify / reset links inside emails. Production must be the public HTTPS URL.

minutes
Require email verification (ENABLE_EMAIL_VERIFICATION)

Audio Recognition / AudD

Privacy notice: AudD sends a short audio sample (default 12 s) to an external provider. Enable this only if your privacy policy and terms disclose external processing. The full uploaded file is never sent.

AudD enabled (AUDD_ENABLED)

Current:

Run for "auto" mode (AUTO_RECOGNITION_FOR_AUTO_MODE)
Premium-only mode
Admin override (test)
Enable fallback sample
Store raw AudD response (debug only)

AudD usage

Today

This month

Success rate

Avg latency

Status this month

  • No recognitions this month.

Recent failures

  • None.

Last 100 events

Time Event Job Latency Detail

VPS Health

CPU

RAM

Disk

Recent logs

last 100 lines · View full log →
Loading…
No log entries.

Job Debugger

Search any job by UUID to view full details: model used, timestamps_mode, transcription attempts, audit logs, etc.

Audit log

Loading…
No audit events recorded yet.
Timestamp Action Job ID User ID Details

User:

Balance

Reserved

Available

Recent transactions

Date Type Amount Note

Users

Loading…
No users found.
Email Role Status Credits Trials used Joined Last login

Dev Tools

Simulate password reset without email service. Only available when ENABLE_EMAIL_VERIFICATION=false.

Free Trial Controls

Free trials paused

When paused, no new free trial jobs will be accepted platform-wide.

Status:

Block an IP

Blocked IPs ()

No IP is currently blocked.
IP Reason Blocked at Actions

Sent today

Failed today

Sent this month

Failed this month

By category — last 30 days

No emails in the last 30 days.
Category Sent Failed

Recent failures

Email log

Loading…
No email log entries match these filters.
When Category Recipient Status Latency Detail
🔍 Debugging API calls:
  • • Use API Calls filter to see all v1 API requests (POST, GET /segments, etc)
  • • Use Transcriptions filter to debug job processing (_run_job START/SUCCESS/FAILED)
  • • Use Timestamps filter to verify model selection (whisper-1 vs gpt-4o-*)
  • • Search bar also supports regex: try has_timestamps= to find timestamp metadata in responses
Download current
Older rotations:

Application logs

Loading…
Log file not found at . Run a request to generate the first lines, then refresh.
No entries match the current filters.