vCon WTF Documentation

Welcome to the vCon WTF documentation! This library provides comprehensive support for converting between various transcription provider formats and the standardized IETF World Transcription Format (WTF).

Overview

vCon WTF is a Python library that standardizes transcription data across multiple providers. It supports bidirectional conversion between provider-specific formats and the IETF World Transcription Format (WTF), enabling interoperability and consistency in transcription workflows.

Key Features

  • Multi-Provider Support: Convert between 6 major transcription providers

  • Bidirectional Conversion: Provider format ↔ WTF format

  • Cross-Provider Testing: Consistency, performance, and quality validation

  • CLI Tool: Rich command-line interface with progress bars

  • Comprehensive Validation: Robust WTF format validation

  • Extensible Architecture: Easy to add new providers

Supported Providers

  • Whisper (OpenAI) - High-quality speech recognition

  • Deepgram - Real-time and batch transcription

  • AssemblyAI - Advanced AI transcription with speaker diarization

  • Rev.ai - Professional transcription services

  • Canary (NVIDIA) - Hugging Face integration

  • Parakeet (NVIDIA) - Hugging Face integration

Quick Start

Install the library:

pip install vcon-wtf

Convert a transcription:

from wtf_transcript_converter.providers import WhisperConverter

converter = WhisperConverter()
wtf_doc = converter.convert_to_wtf(whisper_data)
print(wtf_doc.transcript.text)

Use the CLI:

vcon-wtf to-wtf input.json --provider whisper --output result.wtf.json

Cross-Provider Testing:

vcon-wtf cross-provider all input.json --output-dir reports/

Indices and tables