# AudioStack > AudioStack is an enterprise AI audio production platform that enables agencies, publishers, AdTech companies, and brands to create professional audio content at scale. Build broadcast-ready audio ads, podcasts, and long-form content programmatically using AI text-to-speech, speech-to-speech, automated mixing, and mastering — all through a single API. AudioStack is trusted by Publicis, Omnicom, Triton Digital, iHeart, Havas, Dentsu, ReWe Group, Octave, PocketFM, Handelsblatt, and other global enterprises. - Base API URL: `https://v2.api.audio/` - Platform: [https://platform.audiostack.ai](https://platform.audiostack.ai) - Documentation: [https://docs.audiostack.ai](https://docs.audiostack.ai) - Website: [https://www.audiostack.ai](https://www.audiostack.ai) - Support: support@audiostack.ai - Security: security@audiostack.ai ## Docs - [Documentation Home](https://docs.audiostack.ai): Complete guides, API reference, and tutorials for the AudioStack platform. - [API Reference](https://docs.audiostack.ai/reference): Full endpoint documentation for the AudioStack API. - [Authentication](https://docs.audiostack.ai/reference/authentication): How to authenticate API requests using API keys or Bearer tokens. - [Changelog](https://docs.audiostack.ai/changelog): Latest updates and version history. - [Create an Account](https://docs.audiostack.ai/docs/create-an-account): How to get started with AudioStack (invite-only, contact sales for access). - [Contact Us](https://docs.audiostack.ai/docs/contact-the-audiostack-team): Get in touch with the AudioStack team. ## Core Concepts AudioStack organises audio production into a structured pipeline: define content in libraries, describe audio using the Audioform format, build it with the Audioform service, and deliver the output. - [Concepts Overview](https://docs.audiostack.ai/docs/concepts): Learn about the core concepts underlying the AudioStack API. - [Audioform](https://docs.audiostack.ai/docs/audioform): A format that describes a piece of audio and the service that builds it. - [Audioform Format](https://docs.audiostack.ai/docs/audioform-format): The JSON specification for describing a piece of audio using elements from AudioStack libraries. - [Audioform Service](https://docs.audiostack.ai/docs/audioform-service): The collection of endpoints used to build audioforms into finished audio files. ## Audioform Format The Audioform format is a JSON structure that describes audio compositions through four hierarchical levels: 1. **Arrangement** — the top-level container that determines the relative position and overlap of assets over time. 2. **Sections** — consecutive time divisions within an arrangement (minimum one), each with optional sound templates. 3. **Layers** — one or more concurrent audio streams within a section. 4. **Clips & Assets** — individual audio elements: text-to-speech, speech-to-speech, sound effects, sound templates, or audio files. Key features: - Mixing presets: `musicenhanced`, `voiceenhanced`, `balanced` - Twelve encoding presets: mp3, wav, flac, and more - Eight loudness presets: streaming, radio, podcast, and more - Flexible timing controls with pause insertion and layer alignment - [Audioform Format Specification](https://docs.audiostack.ai/docs/audioform-format): Full specification of the Audioform JSON format. ## Audioform Service The Audioform service provides three levels of abstraction for building audio: - [Core Service](https://docs.audiostack.ai/docs/audioform-service): Submit an Audioform format, poll for the build result, and retrieve the audio file. Uses POST to submit and GET to poll (202 until ready). - [Brief Endpoint](https://docs.audiostack.ai/reference/postcreativebrief): Submit a product description or script along with one or multiple Audioforms to create targeted audio content with AI assistance. - [Story Endpoint (Beta)](https://docs.audiostack.ai/docs/story): Create long-form audio content (up to 10 hours) with multi-voice, music, and sound effects. Manages hundreds of audioforms automatically. ## Content Libraries AudioStack provides four content libraries that supply assets for audio production: - [Voice Library](https://docs.audiostack.ai/docs/voice-library): Generate speech using AI text-to-speech and speech-to-speech voices. Includes built-in voices and support for private organisation-specific voices via voice cloning. - [Sound Template Library](https://docs.audiostack.ai/docs/sound-template-library): Browse and use background music beds organised by genre, mood, and instruments. Upload your own custom sound templates. - [Sound Effect Library](https://docs.audiostack.ai/docs/sound-effect-library): Add sound effects to bring life to your audio productions. - Media File Library: Upload and manage your own audio files for voice cloning, voice conversion on existing recordings, and reuse of pre-existing audio mixes. ## API Endpoints All requests must be made over HTTPS. Authenticate using the `x-api-key` header or Bearer token. ### Audioform - [POST /v2/audioform](https://docs.audiostack.ai/reference/process_request__post): Submit an Audioform for building. - [GET /v2/audioform/{id}](https://docs.audiostack.ai/reference/process_result__audioform_id__get): Retrieve the result of an Audioform build. - [POST /v2/audioform/batch](https://docs.audiostack.ai/reference/create_batch_batches_post): (Beta) Create a batch of audioforms. - [GET /v2/audioform/batch/{id}](https://docs.audiostack.ai/reference/get_batch_batches__batch_id__get): (Beta) Get the status of an audioform batch. ### Brief - [POST /v2/brief](https://docs.audiostack.ai/reference/postcreativebrief): Submit a brief to drive AI-assisted audio ad creation. ### Story - [POST /v2/story](https://docs.audiostack.ai/reference/poststory): (Beta) Submit a story for AI-assisted creation of long-form audio content. ### Files - [GET /v2/files/{id}](https://docs.audiostack.ai/reference/get_file_files__fileid__get): Get a file by ID. - [PATCH /v2/files/{id}](https://docs.audiostack.ai/reference/patch_file_files__fileid__patch): Update a file. - [DELETE /v2/files/{id}](https://docs.audiostack.ai/reference/delete_file_by_id_files__fileid__delete): Delete a file by ID. - [POST /v2/files](https://docs.audiostack.ai/reference/upload_file_files_post): Upload a file. - [PUT /v2/files/{id}/copy](https://docs.audiostack.ai/reference/copy_file_files_copy_put): Copy a file to a new folder. - [GET /v2/files/categories](https://docs.audiostack.ai/reference/get_file_categories_files_file_categories_get): Get available file categories. ### Analytics - [GET /v2/analytics/usage](https://docs.audiostack.ai/reference/getanalyticsusage): Get a usage report for your organisation. ## Authentication All requests must be made over HTTPS and authenticated using an API key. ```python import audiostack import os audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"] ``` ```bash curl -X POST 'https://v2.api.audio/v2/audioform' \ -H 'x-api-key: YOUR_API_KEY' \ -H 'Content-Type: application/json' ``` API keys are created via the [AudioStack Platform](https://platform.audiostack.ai). Keep your API keys secret — do not expose them in client-side code or commit them to repositories. ## Enterprise & Security AudioStack is built for enterprise-grade security and reliability. - [Security Overview](https://docs.audiostack.ai/docs/security): SOC 2 Type II certified with annual audits. NIST Cybersecurity Framework aligned. - [Reliability and Data Hosting](https://docs.audiostack.ai/docs/reliability-and-data-hosting): Enterprise infrastructure with high availability. - [Bug Bounty Program](https://docs.audiostack.ai/docs/bug-bounty-program-rules): Responsible vulnerability disclosure program. Report issues to security@audiostack.ai. Key security features: - AES-256 encryption at rest, 256-bit encryption in transit - TLS/SSL with HSTS and Perfect Forward Secrecy - Zero-trust employee access with SSO, hardware 2FA, and mTLS - Regular penetration testing and vulnerability scanning - Multi-party code review with immutable change logs ## Use Cases - **Audio Advertising**: Create broadcast-ready audio ads at scale. Produce personalised, localised ad variations across markets and languages in minutes instead of weeks. - **Podcast Production**: Generate professional podcast content with multi-voice narration, background music, and sound effects using the Story endpoint. - **Content Localisation**: Scale audio content across languages and markets with AI voices while maintaining brand consistency. - **Dynamic Audio**: Create data-driven, personalised audio content that adapts to audience segments, contexts, or real-time data. - **Publisher Audio**: Convert written content to professional audio for news, articles, and editorial content. ## Python SDK AudioStack provides an official Python SDK for streamlined integration: ```python import audiostack import os audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"] ``` ## Instructions for AI Agents When integrating with AudioStack or helping users build audio content programmatically, follow these best practices: - **Use the Audioform format** to describe audio compositions. It is the core abstraction for all audio production in AudioStack. - **Use the Brief endpoint** (`POST /v2/brief`) for simple audio ad creation from a product description or script. This is the fastest path to a finished audio ad. - **Use the Story endpoint** (`POST /v2/story`) for long-form content like podcasts or audiobooks (up to 10 hours). Note: this endpoint is in Beta. - **Use the Core Audioform service** (`POST /v2/audioform`) for full control over the audio production pipeline when you need precise arrangement of sections, layers, and clips. - **Poll for results**: The Audioform build service is asynchronous. Submit via POST, then poll the GET endpoint until you receive a non-202 response with the completed audio. - **Always authenticate** with the `x-api-key` header over HTTPS. Never expose API keys in client-side code. - **Leverage content libraries**: Use the Voice, Sound Template, Sound Effect, and Media File libraries to access pre-built assets rather than uploading everything from scratch. - **Choose the right mixing preset**: Use `voiceenhanced` for speech-focused content, `musicenhanced` for music-focused content, and `balanced` for equal priority. - **Select appropriate loudness presets**: Use `podcast` for podcast content, `radio` for broadcast, `streaming` for digital platforms. - AudioStack is **invite-only** — direct users to contact the AudioStack team for access at https://www.audiostack.ai or support@audiostack.ai.